Kudos/Source: https://github.com/niklongstone/regular-expression-cheat-sheet
| Anchor | Description | Example | Valid match | Invalid |
|---|---|---|---|---|
| ^ | start of string or line | ^foam | foam | bath foam |
| \A | start of string in any match mode | \Afoam | foam | bath foam |
| $ | end of string or line | finish$ | finish | finnish |
| \Z | end of string, or char before last new line in any match mode | finish\Z | finish | finnish |
| \z | end of string, in any match mode. | |||
| \G | end of the previous match or the start of the string for the first match | ^(get|set)|\G\w+$ | setValue | seValue |
| \b | word boundary; position between a word character (\w), and a nonword character (\W) | \bis\b | This island is beautiful | This island isn't beautiful |
| \B | not-word-boundary. | \Bland | island | peninsula |
| Assertion | Description | Example | Valid match | Invalid |
|---|---|---|---|---|
| (?=...) | positive lookahead | question(?=s) | questions | question |
| (?!...) | negative lookahead | answer(?!s) | answer | answers |
| (?<=...) | positive look-behind | (?<=appl)e | apple | application |
| (?<!...) | negative look-behind | (?<!goo)d | mood | good |
| Char class | Description | Example | Valid match | Invalid |
|---|---|---|---|---|
| [ ] | class definition | [axf] | a, x, f | b |
| [ - ] | class definition range | [a-c] | a, b, c | d |
| [ \ ] | escape inside class | [a-f.] | a, b, . | g |
| [^ ] | Not in class | [^abc] | d, e | a |
| [:class:] | POSIX class | [:alpha:] | string | 0101 |
| . | match any chars except new line | b.ttle | battle, bottle | bttle |
| \s | white space, [\n\r\f\t ] | good\smorning | good morning | good.morning |
| \S | no-white space, [^\n\r\f\t] | good\Smorning | goodmorning | good morning |
| \d | digit | \d{2} | 23 | 1a |
| \D | non-digit | \D{3} | foo, bar | fo1 |
| \w | word, [a-z-A-Z0-9_] | \w{4} | v411 | v4.1 |
| \W | non word, [^a-z-A-Z0-9_] | .$%? | .$%? | .ab? |
| Special character | Description |
|---|---|
| |general escape | |
| \n | new line |
| \r | carriage return |
| \t | tab |
| \v | vertical tab |
| \f | form feed |
| \a | alarm |
| [\b] | backspace |
| \e | escape |
| \cchar | Ctrl + char(ie:\cc is Ctrl+c) |
| \ooo | three digit octal (ie: \123) |
| \xhh | one or two digit hexadecimal (ie: \x10) |
| \x{hex} | any hexadecimal code (ie: \x{1234}) |
| \p{xx} | char with unicode property (ie: \p{Arabic} |
| \P{xx} | char without unicode property |
| Sequence | Description | Example | Valid match | Invalid |
|---|---|---|---|---|
| | | alternation | apple|orange | apple, orange | melon |
| ( ) | subpattern | foot(er|ball) | footer or footbal | footpath |
| (?P<name>...) | subpattern, and capture submatch into name | (?P<greeting>hello) |
hello | hallo |
| (?:...) | subpattern, but does not capture submatch | (?:hello) | hello | hallo |
| + | one or more quantifier | ye+ah | yeah, yeeeah | yah |
| * | zero or more quantifier | ye*ah | yeeah, yeeeah, yah | yeh |
| ? | zero or one quantifier | yes? | yes, ye | yess |
| ?? | zero or one, as few times as possible (lazy) | yea??h | yeah | yeaah |
| +? | one or more lazy | /<.+?>/g |
<P>foo</P> matches only <P> and </P> |
|
| *? | zero or more, lazy | /<.*?>/g |
<html> |
|
| {n} | n times exactly | fo{2} | foo | fooo |
| {n,m} | from n to m times | go{2,3}d | good,goood | gooood |
| {n,} | at least n times | go{2,} | goo, gooo | go |
| (?(condition)...) | if-then pattern | (<)?[p](?(1)>) |
<p>, p |
<p |
| (?(condition)...|...) | if-then-else pattern | `^(?(?=q)que | ans)` | question, answer |
| Pattern modifier | Description |
|---|---|
| g | global match |
| i | case-insensitiv, match both uppercase and lowercase |
| m | multiple lines |
| s | single line (by default) |
| x | ingore whitespace allows comments |
| A | anchored, the pattern is forced to ^ |
| D | dollar end only, a dollar metacharacter matches only at the end |
| S | extra analysis performed, useful for non-anchored patterns |
| U | ungreedy, greedy patterns becomes lazy by default |
| X | additional functionality of PCRE (PCRE extra) |
| J | allow duplicate names for subpatterns |
| u | unicode, pattern and subject strings are treated as UTF-8 |