Regex#

Regex is a pipeline module that uses regular expressions to match text data. It is an extremely powerful way of matching complex patterns and extracting enumerable fields from text. For those unfamiliar with regular expressions, a decent starting point is the Wikipedia article; you can also use a regular expression playground to experiment.

Think of the regex module in similar terms to the grep command in Linux: any entries which do not match the regular expression will be dropped (unless the -p flag is set). Building regular expressions is well outside the scope of this document, but one extremely important feature is the (?P<foo>\S+) style syntax, which will assign any matched group into an enumerated value; in this case, it will capture and extract a sequence of non-space characters into an enumerated value named “foo”.

For example, the following search will enumerate the method, user, and ip from an sshd Accepted log entry.

".*sshd.*Accepted (?P<method>\S*) for (?P<user>\S*) from (?P<ip>[0-9]+.[0-9]+.[0-9]+.[0-9]+)"

Because regular expressions can get very long, the regex module takes the -r flag, which specifies a resource containing a regular expression. When populating the resource, do not include “wrapping quotes” around the whole expression as you would when typing directly into a search: e.g. ".*ssh.*Accepted" becomes .*ssh.*Accepted. This is because the quotes are normally stripped out by the search parser prior to being handed to the regex module.

Syntax#

An invocation of the regex module consists of the module name, any arguments which are being used, the regular expression, and then optionally any filters to be applied.

regex <argument list> <regular expression> [filter arguments]

Supported Options#

-e <arg>: The “-e” option operates on an enumerated value instead of on the entire record. For example, a pipeline that showed packets not headed for port 80 but that have HTTP text would be tag=pcap packet ipv4.DstPort!=80 tcp.Payload | regex -e Payload ".*GET \/ HTTP\/1.1.*"
-r <arg>: The “-r” option specifies that the regular expression statement is located in a resource file.
-v: The “-v” option tells regex to operate in inverse mode, dropping any entries which match the regex and passing entries which do not match.
-p: The “-p” option tells regex to allow entries through if the regular expression does not match at all. The permissive flag does not change the operation of filters.
-a: The “-a” option enables array mode, which extracts all matches of each capture group into array enumerated values instead of just the first match. This is useful when an entry contains multiple occurrences of a pattern.

Note

Storing especially large regular expressions in resource files can clean up queries, and allows for easy reuse. If -r is specified, do not specify a regular expression in the query – instead the contents of the resource will be used. Handy!

Raw strings#

To facilitate using escape sequences in regular expressions, you can use backticks to prevent Gravwell from unescaping your input. For example:

tag=syslog grep sshd | regex `shd.*Accepted (?P<method>\S*) for (?P<user>\S*) from (?P<ip>[0-9]+.[0-9]+.[0-9]+.[0-9]+)`

If you used double quotes instead of backticks, \S would have to be double-escaped as \\S. This feature is convenient in that it allows you to copy regular expressions directly from regular expression playgrounds and other sources.

Inline Filtering#

The regex module supports inline filtering for down-selecting data directly within the regex module. The inline filtering also enables regex to engage accelerators to dramatically reduce the amount of data that needs to be processed. Inline filtering is achieved in the same manner as other modules by using comparison operators. If a filter is enabled that specifies equality (“equal”, “not equal”, “contains”, “not contains”) any entry that fails the filter specification will be dropped entirely. If a field is specified as not equal “!=” and the field does not exist, the field is not extracted but the entry won’t be dropped entirely.

Operator	Name	Description
==	Equal	Field must be equal
!=	Not equal	Field must not be equal
~	Subset	Field contains the value
!~	Not Subset	Field does NOT contain the value

Filtering Examples#

The following query extracts auth methods, usernames, and IP addresses from SSH logs and filters down to only those entries where the username is “root” and the IP is in the 192.168.0.0/16 subnet.

tag=syslog regex `shd.*Accepted (?P<method>\S*) for (?P<user>\S*) from (?P<ip>[0-9]+.[0-9]+.[0-9]+.[0-9]+)` user==root ip ~ "192.168"

Example Search#

The following query extracts the authentication method, username, and source IP address from SSH logs as enumerated values named method, user, and ip, which are then displayed in a table.

tag=syslog grep sshd | regex `shd.*Accepted (?P<method>\S*) for (?P<user>\S*) from (?P<ip>[0-9]+.[0-9]+.[0-9]+.[0-9]+)` 
| table method user ip

Array Mode Example#

The -a flag extracts all matches of each capture group into array enumerated values. This is useful when a single entry contains multiple occurrences of a pattern.

For example, to extract all numbers from each entry into an array:

tag=default regex -a `(?P<num>\d+)` | table num

If an entry contains “foo 123 bar 456 baz 789”, the num enumerated value will be an array containing [123, 456, 789].

Full regular expression syntax#

The following is copied from the re2 documentation (see their license)

Regular expressions are a notation for describing sets of character strings. When a particular string is in the set described by a regular expression, we often say that the regular expression matches the string.

The simplest regular expression is a single literal character. Except for the metacharacters like *+?()|, characters match themselves. To match a metacharacter, escape it with a backslash: \+ matches a literal plus character.

Two regular expressions can be alternated or concatenated to form a new regular expression: if e1 matches s and e2 matches t, then e1|e2 matches s or t, and e1e2 matches st.

The metacharacters *, +, and ? are repetition operators: e1* matches a sequence of zero or more (possibly different) strings, each of which match e1; e1+ matches one or more; e1? matches zero or one.

The operator precedence, from weakest to strongest binding, is first alternation, then concatenation, and finally the repetition operators. Explicit parentheses can be used to force different meanings, just as in arithmetic expressions. Some examples: ab|cd is equivalent to (ab)|(cd); ab* is equivalent to a(b*).

The syntax described so far is most of the traditional Unix egrep regular expression syntax. This subset suffices to describe all regular languages: loosely speaking, a regular language is a set of strings that can be matched in a single pass through the text using only a fixed amount of memory. Newer regular expression facilities (notably Perl and those that have copied it) have added many new operators and escape sequences, which make the regular expressions more concise, and sometimes more cryptic, but usually not more powerful.

This page lists the regular expression syntax accepted by RE2. Note that this syntax is a subset of that accepted by PCRE, roughly speaking, and with various caveats.

It also lists some syntax accepted by PCRE, PERL, and VIM.

Kinds of single-character expressions	examples
any character, possibly including newline (s=true)	`.`
character class	`[xyz]`
negated character class	`[^xyz]`
Perl character class (link)	`\d`
negated Perl character class	`\D`
ASCII character class (link)	`[[:alpha:]]`
negated ASCII character class	`[[:^alpha:]]`
Unicode character class (one-letter name)	`\pN`
Unicode character class	`\p{Greek}`
negated Unicode character class (one-letter name)	`\PN`
negated Unicode character class	`\P{Greek}`

	Composites
`xy`	`x` followed by `y`
`x\|y`	`x` or `y` (prefer `x`)

	Repetitions
`x*`	zero or more `x`, prefer more
`x+`	one or more `x`, prefer more
`x?`	zero or one `x`, prefer one
`x{n,m}`	`n` or `n`+1 or … or `m` `x`, prefer more
`x{n,}`	`n` or more `x`, prefer more
`x{n}`	exactly `n` `x`
`x*?`	zero or more `x`, prefer fewer
`x+?`	one or more `x`, prefer fewer
`x??`	zero or one `x`, prefer zero
`x{n,m}?`	`n` or `n`+1 or … or `m` `x`, prefer fewer
`x{n,}?`	`n` or more `x`, prefer fewer
`x{n}?`	exactly `n` `x`
`x{}`	(≡ `x*`) [(NOT SUPPORTED)] [VIM]
`x{-}`	(≡ `x*?`) [(NOT SUPPORTED)] [VIM]
`x{-n}`	(≡ `x{n}?`) [(NOT SUPPORTED)] [VIM]
`x=`	(≡ `x?`) [(NOT SUPPORTED)] [VIM]

Implementation restriction: The counting forms x{n,m}, x{n,}, and x{n} reject forms that create a minimum or maximum repetition count above 1000. Unlimited repetitions are not subject to this restriction.

	Possessive repetitions
`x*+`	zero or more `x`, possessive [(NOT SUPPORTED)]
`x++`	one or more `x`, possessive [(NOT SUPPORTED)]
`x?+`	zero or one `x`, possessive [(NOT SUPPORTED)]
`x{n,m}+`	`n` or … or `m` `x`, possessive [(NOT SUPPORTED)]
`x{n,}+`	`n` or more `x`, possessive [(NOT SUPPORTED)]
`x{n}+`	exactly `n` `x`, possessive [(NOT SUPPORTED)]

	Grouping
`(re)`	numbered capturing group (submatch)
`(?P<name>re)`	named & numbered capturing group (submatch)
`(?<name>re)`	named & numbered capturing group (submatch)
`(?'name're)`	named & numbered capturing group (submatch) [(NOT SUPPORTED)]
`(?:re)`	non-capturing group
`(?flags)`	set flags within current group; non-capturing
`(?flags:re)`	set flags during re; non-capturing
`(?#text)`	comment [(NOT SUPPORTED)]
`(?\|x\|y\|z)`	branch numbering reset [(NOT SUPPORTED)]
`(?>re)`	possessive match of `re` [(NOT SUPPORTED)]
`re@>`	possessive match of `re` [(NOT SUPPORTED)] [VIM]
`%(re)`	non-capturing group [(NOT SUPPORTED)] [VIM]

	Flags
`i`	case-insensitive (default false)
`m`	multi-line mode: `^` and `$` match begin/end line in addition to begin/end text (default false)
`s`	let `.` match `\n` (default false)
`U`	ungreedy: swap meaning of `x` and `x?`, `x+` and `x+?`, etc (default false)

Flag syntax is xyz (set) or -xyz (clear) or xy-z (set xy, clear z).

	Empty strings
`^`	at beginning of text or line (`m`=true)
`$`	at end of text (like `\z` not `\Z`) or line (`m`=true)
`\A`	at beginning of text
`\b`	at ASCII word boundary (`\w` on one side and `\W`, `\A`, or `\z` on the other)
`\B`	not at ASCII word boundary
`\g`	at beginning of subtext being searched [(NOT SUPPORTED)] [PCRE]
`\G`	at end of last match [(NOT SUPPORTED)] [PERL]
`\Z`	at end of text, or before newline at end of text [(NOT SUPPORTED)]
`\z`	at end of text
`(?=re)`	before text matching `re` [(NOT SUPPORTED)]
`(?!re)`	before text not matching `re` [(NOT SUPPORTED)]
`(?<=re)`	after text matching `re` [(NOT SUPPORTED)]
`(?<!re)`	after text not matching `re` [(NOT SUPPORTED)]
`re&`	before text matching `re` [(NOT SUPPORTED)] [VIM]
`re@=`	before text matching `re` [(NOT SUPPORTED)] [VIM]
`re@!`	before text not matching `re` [(NOT SUPPORTED)] [VIM]
`re@<=`	after text matching `re` [(NOT SUPPORTED)] [VIM]
`re@<!`	after text not matching `re` [(NOT SUPPORTED)] [VIM]
`\zs`	sets start of match (= \K) [(NOT SUPPORTED)] [VIM]
`\ze`	sets end of match [(NOT SUPPORTED)] [VIM]
`\%^`	beginning of file [(NOT SUPPORTED)] [VIM]
`\%$`	end of file [(NOT SUPPORTED)] [VIM]
`\%V`	on screen [(NOT SUPPORTED)] [VIM]
`\%#`	cursor position [(NOT SUPPORTED)] [VIM]
`\%'m`	mark `m` position [(NOT SUPPORTED)] [VIM]
`\%23l`	in line 23 [(NOT SUPPORTED)] [VIM]
`\%23c`	in column 23 [(NOT SUPPORTED)] [VIM]
`\%23v`	in virtual column 23 [(NOT SUPPORTED)] [VIM]

	Escape sequences
`\a`	bell (≡ `\007`)
`\f`	form feed (≡ `\014`)
`\t`	horizontal tab (≡ `\011`)
`\n`	newline (≡ `\012`)
`\r`	carriage return (≡ `\015`)
`\v`	vertical tab character (≡ `\013`)
`\*`	literal ``, for any punctuation character ``
`\123`	octal character code (up to three digits)
`\x7F`	hex character code (exactly two digits)
`\x{10FFFF}`	hex character code
`\C`	match a single byte even in UTF-8 mode
`\Q...\E`	literal text `...` even if `...` has punctuation
`\1`	backreference [(NOT SUPPORTED)]
`\b`	backspace [(NOT SUPPORTED)] (use `\010`)
`\cK`	control char ^K [(NOT SUPPORTED)] (use `\001` etc)
`\e`	escape [(NOT SUPPORTED)] (use `\033`)
`\g1`	backreference [(NOT SUPPORTED)]
`\g{1}`	backreference [(NOT SUPPORTED)]
`\g{+1}`	backreference [(NOT SUPPORTED)]
`\g{-1}`	backreference [(NOT SUPPORTED)]
`\g{name}`	named backreference [(NOT SUPPORTED)]
`\g<name>`	subroutine call [(NOT SUPPORTED)]
`\g'name'`	subroutine call [(NOT SUPPORTED)]
`\k<name>`	named backreference [(NOT SUPPORTED)]
`\k'name'`	named backreference [(NOT SUPPORTED)]
`\lX`	lowercase `X` [(NOT SUPPORTED)]
`\ux`	uppercase `x` [(NOT SUPPORTED)]
`\L...\E`	lowercase text `...` [(NOT SUPPORTED)]
`\K`	reset beginning of `$0` [(NOT SUPPORTED)]
`\N{name}`	named Unicode character [(NOT SUPPORTED)]
`\R`	line break [(NOT SUPPORTED)]
`\U...\E`	upper case text `...` [(NOT SUPPORTED)]
`\X`	extended Unicode sequence [(NOT SUPPORTED)]
`\%d123`	decimal character 123 [(NOT SUPPORTED)] [VIM]
`\%xFF`	hex character FF [(NOT SUPPORTED)] [VIM]
`\%o123`	octal character 123 [(NOT SUPPORTED)] [VIM]
`\%u1234`	Unicode character 0x1234 [(NOT SUPPORTED)] [VIM]
`\%U12345678`	Unicode character 0x12345678 [(NOT SUPPORTED)] [VIM]

	Character class elements
`x`	single character
`A-Z`	character range (inclusive)
`\d`	Perl character class
`[:foo:]`	ASCII character class `foo`
`\p{Foo}`	Unicode character class `Foo`
`\pF`	Unicode character class `F` (one-letter name)

	Named character classes as character class elements
`[\d]`	digits (≡ `\d`)
`[^\d]`	not digits (≡ `\D`)
`[\D]`	not digits (≡ `\D`)
`[^\D]`	not not digits (≡ `\d`)
`[[:name:]]`	named ASCII class inside character class (≡ `[:name:]`)
`[^[:name:]]`	named ASCII class inside negated character class (≡ `[:^name:]`)
`[\p{Name}]`	named Unicode property inside character class (≡ `\p{Name}`)
`[^\p{Name}]`	named Unicode property inside negated character class (≡ `\P{Name}`)

	Perl character classes (all ASCII-only)
`\d`	digits (≡ `[0-9]`)
`\D`	not digits (≡ `[^0-9]`)
`\s`	whitespace (≡ `[\t\n\f\r ]`)
`\S`	not whitespace (≡ `[^\t\n\f\r ]`)
`\w`	word characters (≡ `[0-9A-Za-z_]`)
`\W`	not word characters (≡ `[^0-9A-Za-z_]`)
`\h`	horizontal space [(NOT SUPPORTED)]
`\H`	not horizontal space [(NOT SUPPORTED)]
`\v`	vertical space [(NOT SUPPORTED)]
`\V`	not vertical space [(NOT SUPPORTED)]

	ASCII character classes
`[[:alnum:]]`	alphanumeric (≡ `[0-9A-Za-z]`)
`[[:alpha:]]`	alphabetic (≡ `[A-Za-z]`)
`[[:ascii:]]`	ASCII (≡ `[\x00-\x7F]`)
`[[:blank:]]`	blank (≡ `[\t ]`)
`[[:cntrl:]]`	control (≡ `[\x00-\x1F\x7F]`)
`[[:digit:]]`	digits (≡ `[0-9]`)
`[[:graph:]]`	graphical (≡ `[!-~]` ≡ [A-Za-z0-9!"#$%&'()*+,\-./:;<=>?@[\\\]^_`{\|}~])
`[[:lower:]]`	lower case (≡ `[a-z]`)
`[[:print:]]`	printable (≡ `[ -~]` ≡ `[ [:graph:]]`)
`[[:punct:]]`	punctuation (≡ [!-/:-@[-`{-~])
`[[:space:]]`	whitespace (≡ `[\t\n\v\f\r ]`)
`[[:upper:]]`	upper case (≡ `[A-Z]`)
`[[:word:]]`	word characters (≡ `[0-9A-Za-z_]`)
`[[:xdigit:]]`	hex digit (≡ `[0-9A-Fa-f]`)

	Unicode character class names–general category
`C`	other
`Cc`	control
`Cf`	format
`Cn`	unassigned code points [(NOT SUPPORTED)]
`Co`	private use
`Cs`	surrogate
`L`	letter
`LC`	cased letter [(NOT SUPPORTED)]
`L&`	cased letter [(NOT SUPPORTED)]
`Ll`	lowercase letter
`Lm`	modifier letter
`Lo`	other letter
`Lt`	titlecase letter
`Lu`	uppercase letter
`M`	mark
`Mc`	spacing mark
`Me`	enclosing mark
`Mn`	non-spacing mark
`N`	number
`Nd`	decimal number
`Nl`	letter number
`No`	other number
`P`	punctuation
`Pc`	connector punctuation
`Pd`	dash punctuation
`Pe`	close punctuation
`Pf`	final punctuation
`Pi`	initial punctuation
`Po`	other punctuation
`Ps`	open punctuation
`S`	symbol
`Sc`	currency symbol
`Sk`	modifier symbol
`Sm`	math symbol
`So`	other symbol
`Z`	separator
`Zl`	line separator
`Zp`	paragraph separator
`Zs`	space separator

Unicode character class names–scripts
`Adlam`
`Ahom`
`Anatolian_Hieroglyphs`
`Arabic`
`Armenian`
`Avestan`
`Balinese`
`Bamum`
`Bassa_Vah`
`Batak`
`Bengali`
`Bhaiksuki`
`Bopomofo`
`Brahmi`
`Braille`
`Buginese`
`Buhid`
`Canadian_Aboriginal`
`Carian`
`Caucasian_Albanian`
`Chakma`
`Cham`
`Cherokee`
`Chorasmian`
`Common`
`Coptic`
`Cuneiform`
`Cypriot`
`Cypro_Minoan`
`Cyrillic`
`Deseret`
`Devanagari`
`Dives_Akuru`
`Dogra`
`Duployan`
`Egyptian_Hieroglyphs`
`Elbasan`
`Elymaic`
`Ethiopic`
`Georgian`
`Glagolitic`
`Gothic`
`Grantha`
`Greek`
`Gujarati`
`Gunjala_Gondi`
`Gurmukhi`
`Han`
`Hangul`
`Hanifi_Rohingya`
`Hanunoo`
`Hatran`
`Hebrew`
`Hiragana`
`Imperial_Aramaic`
`Inherited`
`Inscriptional_Pahlavi`
`Inscriptional_Parthian`
`Javanese`
`Kaithi`
`Kannada`
`Katakana`
`Kawi`
`Kayah_Li`
`Kharoshthi`
`Khitan_Small_Script`
`Khmer`
`Khojki`
`Khudawadi`
`Lao`
`Latin`
`Lepcha`
`Limbu`
`Linear_A`
`Linear_B`
`Lisu`
`Lycian`
`Lydian`
`Mahajani`
`Makasar`
`Malayalam`
`Mandaic`
`Manichaean`
`Marchen`
`Masaram_Gondi`
`Medefaidrin`
`Meetei_Mayek`
`Mende_Kikakui`
`Meroitic_Cursive`
`Meroitic_Hieroglyphs`
`Miao`
`Modi`
`Mongolian`
`Mro`
`Multani`
`Myanmar`
`Nabataean`
`Nag_Mundari`
`Nandinagari`
`New_Tai_Lue`
`Newa`
`Nko`
`Nushu`
`Nyiakeng_Puachue_Hmong`
`Ogham`
`Ol_Chiki`
`Old_Hungarian`
`Old_Italic`
`Old_North_Arabian`
`Old_Permic`
`Old_Persian`
`Old_Sogdian`
`Old_South_Arabian`
`Old_Turkic`
`Old_Uyghur`
`Oriya`
`Osage`
`Osmanya`
`Pahawh_Hmong`
`Palmyrene`
`Pau_Cin_Hau`
`Phags_Pa`
`Phoenician`
`Psalter_Pahlavi`
`Rejang`
`Runic`
`Samaritan`
`Saurashtra`
`Sharada`
`Shavian`
`Siddham`
`SignWriting`
`Sinhala`
`Sogdian`
`Sora_Sompeng`
`Soyombo`
`Sundanese`
`Syloti_Nagri`
`Syriac`
`Tagalog`
`Tagbanwa`
`Tai_Le`
`Tai_Tham`
`Tai_Viet`
`Takri`
`Tamil`
`Tangsa`
`Tangut`
`Telugu`
`Thaana`
`Thai`
`Tibetan`
`Tifinagh`
`Tirhuta`
`Toto`
`Ugaritic`
`Vai`
`Vithkuqi`
`Wancho`
`Warang_Citi`
`Yezidi`
`Yi`
`Zanabazar_Square`

	Vim character classes
`\i`	identifier character [(NOT SUPPORTED)] [VIM]
`\I`	`\i` except digits [(NOT SUPPORTED)] [VIM]
`\k`	keyword character [(NOT SUPPORTED)] [VIM]
`\K`	`\k` except digits [(NOT SUPPORTED)] [VIM]
`\f`	file name character [(NOT SUPPORTED)] [VIM]
`\F`	`\f` except digits [(NOT SUPPORTED)] [VIM]
`\p`	printable character [(NOT SUPPORTED)] [VIM]
`\P`	`\p` except digits [(NOT SUPPORTED)] [VIM]
`\s`	whitespace character (≡ `[ \t]`) [(NOT SUPPORTED)] [VIM]
`\S`	non-white space character (≡ `[^ \t]`) [(NOT SUPPORTED)] [VIM]
`\d`	digits (≡ `[0-9]`) [VIM]
`\D`	not `\d` [VIM]
`\x`	hex digits (≡ `[0-9A-Fa-f]`) [(NOT SUPPORTED)] [VIM]
`\X`	not `\x` [(NOT SUPPORTED)] [VIM]
`\o`	octal digits (≡ `[0-7]`) [(NOT SUPPORTED)] [VIM]
`\O`	not `\o` [(NOT SUPPORTED)] [VIM]
`\w`	word character [VIM]
`\W`	not `\w` [VIM]
`\h`	head of word character [(NOT SUPPORTED)] [VIM]
`\H`	not `\h` [(NOT SUPPORTED)] [VIM]
`\a`	alphabetic [(NOT SUPPORTED)] [VIM]
`\A`	not `\a` [(NOT SUPPORTED)] [VIM]
`\l`	lowercase [(NOT SUPPORTED)] [VIM]
`\L`	not lowercase [(NOT SUPPORTED)] [VIM]
`\u`	uppercase [(NOT SUPPORTED)] [VIM]
`\U`	not uppercase [(NOT SUPPORTED)] [VIM]
`\_x`	`\x` plus newline, for any `x` [(NOT SUPPORTED)] [VIM]
`\c`	ignore case [(NOT SUPPORTED)] [VIM]
`\C`	match case [(NOT SUPPORTED)] [VIM]
`\m`	magic [(NOT SUPPORTED)] [VIM]
`\M`	nomagic [(NOT SUPPORTED)] [VIM]
`\v`	verymagic [(NOT SUPPORTED)] [VIM]
`\V`	verynomagic [(NOT SUPPORTED)] [VIM]
`\Z`	ignore differences in Unicode combining characters [(NOT SUPPORTED)] [VIM]

	Magic
`(?{code})`	arbitrary Perl code [(NOT SUPPORTED)] [PERL]
`(??{code})`	postponed arbitrary Perl code [(NOT SUPPORTED)] [PERL]
`(?n)`	recursive call to regexp capturing group `n` [(NOT SUPPORTED)]
`(?+n)`	recursive call to relative group `+n` [(NOT SUPPORTED)]
`(?-n)`	recursive call to relative group `-n` [(NOT SUPPORTED)]
`(?C)`	PCRE callout [(NOT SUPPORTED)] [PCRE]
`(?R)`	recursive call to entire regexp (≡ `(?0)`) [(NOT SUPPORTED)]
`(?&name)`	recursive call to named group [(NOT SUPPORTED)]
`(?P=name)`	named backreference [(NOT SUPPORTED)]
`(?P>name)`	recursive call to named group [(NOT SUPPORTED)]
`(?(cond)true\|false)`	conditional branch [(NOT SUPPORTED)]
`(?(cond)true)`	conditional branch [(NOT SUPPORTED)]
`(*ACCEPT)`	make regexps more like Prolog [(NOT SUPPORTED)]
`(*COMMIT)`	[(NOT SUPPORTED)]
`(*F)`	[(NOT SUPPORTED)]
`(*FAIL)`	[(NOT SUPPORTED)]
`(*MARK)`	[(NOT SUPPORTED)]
`(*PRUNE)`	[(NOT SUPPORTED)]
`(*SKIP)`	[(NOT SUPPORTED)]
`(*THEN)`	[(NOT SUPPORTED)]
`(*ANY)`	set newline convention [(NOT SUPPORTED)]
`(*ANYCRLF)`	[(NOT SUPPORTED)]
`(*CR)`	[(NOT SUPPORTED)]
`(*CRLF)`	[(NOT SUPPORTED)]
`(*LF)`	[(NOT SUPPORTED)]
`(*BSR_ANYCRLF)`	set \R convention [(NOT SUPPORTED)] [PCRE]
`(*BSR_UNICODE)`	[(NOT SUPPORTED)] [PCRE]