Quick Start
Tutorial
Tools & Languages
Examples
Reference
Book Reviews
Regex Reference
Introduction
Table of Contents
Quick Reference
Characters
Basic Features
Character Classes
Shorthands
Anchors
Word Boundaries
Quantifiers
Unicode
Capturing Groups & Backreferences
Named Groups & Backreferences
Special Groups
Mode Modifiers
Recursion & Balancing Groups
Replacement Reference
Characters
Matched Text & Backreferences
Context & Case Conversion
Conditionals
More on This Site
Introduction
Regular Expressions Quick Start
Regular Expressions Tutorial
Replacement Strings Tutorial
Applications and Languages
Regular Expressions Examples
Regular Expressions Reference
Replacement Strings Reference
Book Reviews
Printable PDF
About This Site
RSS Feed & Blog
RegexBuddy—Better than a regular expression reference!

Regular Expression Reference: Special Groups

FeatureSyntaxDescriptionExampleJGsoft .NET Java Perl PCRE PCRE2 PHP Delphi R JavaScript VBScript XRegExp Python Ruby std::regex Boost Tcl ARE POSIX BRE POSIX ERE GNU BRE GNU ERE Oracle XML XPath
Comment (?#comment) Everything between (?# and ) is ignored by the regex engine. a(?#foobar)b matches ab YESYESnoYESYESYESYESYESYESnonoYESYESYESnoECMAYESnonononononono
Branch reset group (?|regex) If the regex inside the branch reset group has multiple alternatives with capturing groups, then the capturing group numbers are the same in all the alternatives. (x)(?|(a)|(bc)|(def))\2 matches xaa, xbcbc, or xdefdef with the first group capturing x and the second group capturing a, bc, or def V2nono5.107.2YES5.2.4YESYESnonononononoECMA
1.42–1.85
nononononononono
Atomic group (?>regex) Atomic groups prevent the regex engine from backtracking back into the group after a match has been found for the group. If the remainder of the regex fails, the engine may backtrack over the group if a quantifier or alternation makes it optional. But it will not backtrack into the group to try other permutations of the group. a(?>bc|b)c matches abcc but not abc YESYESYESYESYESYESYESYESYESnonononoYESnoECMAnononononononono
Positive lookahead (?=regex) Matches at a position where the pattern inside the lookahead can be matched. Matches only the position. It does not consume any characters or expand the match. In a pattern like one(?=two)three, both two and three have to match at the position where the match of one ends. t(?=s) matches the second t in streets. YESYESYESYESYESYESYESYESYESYESYESYESYESYESECMAECMAYESnonononononono
Negative lookahead (?!regex) Similar to positive lookahead, except that negative lookahead only succeeds if the regex inside the lookahead fails to match. t(?!s) matches the first t in streets. YESYESYESYESYESYESYESYESYESYESYESYESYESYESECMAECMAYESnonononononono
Positive lookbehind (?<=regex) Matches at a position if the pattern inside the lookbehind can be matched ending at that position. (?<=s)t matches the first t in streets. YESYESYESYESYESYESYESYESYESYESnoYESYES1.9noECMAnononononononono
Negative lookbehind (?<!regex) Matches at a position if the pattern inside the lookbehind cannot be matched ending at that position. (?<!s)t matches the second t in streets. YESYESYESYESYESYESYESYESYESYESnoYESYES1.9noECMAnononononononono
Lookbehind (?<=regex|longer regex) Alternatives inside lookbehind can differ in length. (?<=is|e)t matches the second and fourth t in twisty streets. YESYESYES5.30YESYESYESYESYESYESn/aYESno1.9n/aECMA
1.38–1.43
n/an/an/an/an/an/an/an/a
Lookbehind (?<=x{n,m}) Quantifiers with a finite maximum number of repetitions can be used inside lookbehind. (?<=s\w{1,7})t matches only the fourth t in twisty streets. YESYES6
4 fail
5.30 failnononononoYESn/aYESnonon/anon/an/an/an/an/an/an/an/a
Lookbehind (?<=regex) The full regular expression syntax can be used inside lookbehind. (?<=s\w+)t matches only the fourth t in twisty streets. YESYES13nonononononoYESn/aYESnonon/anon/an/an/an/an/an/an/an/a
Lookbehind (group)(?<=\1) Backreferences can be used inside lookbehind. Syntax prohibited in lookbehind is also prohibited in the referenced capturing group. (\w).+(?<=\1) matches twisty street in twisty streets. YESYESnonono10.237.3.0no4.0.0YESn/aYES3.5non/anon/an/an/an/an/an/an/an/a
Keep text out of the regex match \K The text matched by the part of the regex to the left of the \K is omitted from the overall regex match. Other than that the regex is matched normally from left to right. Capturing groups to the left of the \K capture as usual. s\Kt matches only the first t in streets. V2nono5.107.2YES5.2.4YESYESnononono2.0noECMA
1.42–1.85
nononononononono
Lookaround conditional (?(?=regex)then|else) where (?=regex) is any valid lookaround and then and else are any valid regexes If the lookaround succeeds, the “then” part must match for the overall regex to match. If the lookaround fails, the “else” part must match for the overall regex to match. The lookaround is zero-length. The “then” and “else” parts consume their matches like normal regexes. (?(?<=a)b|c) matches the second b and the first c in babxcac YESYESnoYESYESYESYESYESYESnonononononoECMAnononononononono
Implicit lookahead conditional (?(regex)then|else) where regex, then, and else are any valid regexes and regex is not the name of a capturing group If “regex” is not the name of a capturing group, then it is interpreted as a lookahead as if you had written (?(?=regex)then|else). If the lookahead succeeds, the “then” part must match for the overall regex to match. If the lookahead fails, the “else” part must match for the overall regex to match. The lookaround is zero-length. The “then” and “else” parts consume their matches like normal regexes. (?(\d{2})7|c) matches the first 7 and the c in 747c noYESnononononononononononononononononononononono
Named conditional (?(name)then|else) where name is the name of a capturing group and then and else are any valid regexes If the capturing group with the given name took part in the match attempt thus far, the “then” part must match for the overall regex to match. If the capturing group did not take part in the match thus far, the “else” part must match for the overall regex to match. (?<one>a)?(?(one)b|c) matches ab, the first c, and the second c in babxcac YESYESnono6.7YES5.2.0YESYESnononoYESnonononononononononono
Named conditional (?(<name>)then|else) where name is the name of a capturing group and then and else are any valid regexes If the capturing group with the given name took part in the match attempt thus far, the “then” part must match for the overall regex to match. If the capturing group did not take part in the match thus far, the “else” part must match for the overall regex to match. (?<one>a)?(?(<one>)b|c) matches ab, the first c, and the second c in babxcac V2nono5.107.0YES5.2.2YESYESnononono2.0noECMA
1.42–1.85
nononononononono
Named conditional (?('name')then|else) where name is the name of a capturing group and then and else are any valid regexes If the capturing group with the given name took part in the match attempt thus far, the “then” part must match for the overall regex to match. If the capturing group did not take part in the match thus far, the “else” part must match for the overall regex to match. (?'one'a)?(?('one')b|c) matches ab, the first c, and the second c in babxcac V2nono5.107.0YES5.2.2YESYESnononono2.0noECMA
1.42–1.85
nononononononono
Conditional (?(1)then|else) where 1 is the number of a capturing group and then and else are any valid regexes If the referenced capturing group took part in the match attempt thus far, the “then” part must match for the overall regex to match. If the capturing group did not take part in the match thus far, the “else” part must match for the overall regex to match. (a)?(?(1)b|c) matches ab, the first c, and the second c in babxcac YESYESnoYESYESYESYESYESYESnononoYES2.0noECMAnononononononono
Relative conditional (?(-1)then|else) where -1 is a negative integer and then and else are any valid regexes Conditional that tests the capturing group that can be found by counting as many opening parentheses of named or numbered capturing groups as specified by the number from right to left starting immediately before the conditional. If the referenced capturing group took part in the match attempt thus far, the “then” part must match for the overall regex to match. If the capturing group did not take part in the match thus far, the “else” part must match for the overall regex to match. (a)?(?(-1)b|c) matches ab, the first c, and the second c in babxcac V2nonono7.2YES5.2.4YESYESnonononononononononononononono
Forward conditional (?(+1)then|else) where +1 is a positive integer and then and else are any valid regexes Conditional that tests the capturing group that can be found by counting as many opening parentheses of named or numbered capturing groups as specified by the number from left to right starting at the “then” part of conditional. If the referenced capturing group took part in the match attempt thus far, the “then” part must match for the overall regex to match. If the capturing group did not take part in the match thus far, the “else” part must match for the overall regex to match. ((?(+1)b|c)(d)?){2} matches cc and cdb in bdbdccxcdcxcdb V2nonono7.2YES5.2.4YESYESnonononononononononononononono
Conditional (?(+1)then|else) where 1 is the number of a capturing group and then and else are any valid regexes The + is ignored and the number is taken as an absolute reference to a capturing group. If the referenced capturing group took part in the match attempt thus far, the “then” part must match for the overall regex to match. If the capturing group did not take part in the match thus far, the “else” part must match for the overall regex to match. (a)?(?(+1)b|c) matches ab, the first c, and the second c in babxcac nonononononononononononoYESnonononononononononono
FeatureSyntaxDescriptionExampleJGsoft .NET Java Perl PCRE PCRE2 PHP Delphi R JavaScript VBScript XRegExp Python Ruby std::regex Boost Tcl ARE POSIX BRE POSIX ERE GNU BRE GNU ERE Oracle XML XPath