Question 1

What is the difference between greedy and lazy quantifiers?

Accepted Answer

By default, quantifiers (*, +, ?) are greedy: they match as much text as possible while still allowing the overall pattern to succeed. For example, the pattern '<.+>' applied to 'bold' greedily matches the entire string 'bold' because .+ consumes as many characters as it can. Adding a question mark after the quantifier makes it lazy: '<.+?>' matches '' first, then '' separately, because .+? matches as few characters as possible. Use lazy quantifiers when you want the shortest possible match, which is common when parsing delimited content like HTML tags, quoted strings, or bracketed expressions.

Question 2

What are capture groups and how do I use them?

Accepted Answer

Capture groups are portions of a regex enclosed in parentheses. They serve two purposes: grouping (to apply quantifiers or alternation to a sub-pattern) and capturing (to extract the matched substring for later use). In the pattern '(\d{4})-(\d{2})-(\d{2})', the three capture groups extract the year, month, and day from a date string. In replacement strings, you reference captures with $1, $2, $3, etc. Non-capturing groups (?:...) provide grouping without capturing, which is useful when you need grouping for syntax but do not need the matched text.

Question 3

Why does my regex work in one language but not another?

Accepted Answer

Regular expression implementations vary between languages and engines. JavaScript does not support lookbehind assertions in older engines (though modern engines do), PCRE (used by PHP and many tools) supports features like recursive patterns that JavaScript lacks, and Python's re module has subtle differences from both. This tool uses the JavaScript regex engine, which is the standard for web development. If you are writing regex for a different language, be aware of syntax differences, especially around Unicode support, named groups, and advanced features like atomic groups and possessive quantifiers.

Question 4

How do I match special characters literally?

Accepted Answer

Characters that have special meaning in regex (such as . * + ? ^ $ { } [ ] ( ) | \ /) must be escaped with a backslash to match them literally. For example, to match a literal period, use \. instead of . (which matches any character). To match a literal backslash, use \. Inside a character class ([...]), most special characters lose their special meaning, but ] and \ still need escaping, and ^ has special meaning only at the beginning of the class. A common mistake is forgetting to escape these characters, which causes the regex to match far more text than intended.

Question 5

What are lookahead and lookbehind assertions?

Accepted Answer

Lookahead and lookbehind are zero-width assertions that check whether a pattern exists ahead of or behind the current position without including it in the match. Positive lookahead (?=...) asserts that what follows matches the pattern. Negative lookahead (?!...) asserts that what follows does not match. Positive lookbehind (?<=...) asserts that what precedes matches. Negative lookbehind (?<!...) asserts that what precedes does not match. For example, '\d+(?= dollars)' matches a number only if it is followed by ' dollars', but the word 'dollars' is not part of the match. These are powerful for matching text based on context without consuming the context itself.

Question 6

Can regular expressions match nested structures like balanced parentheses?

Accepted Answer

Standard regular expressions cannot match arbitrarily nested structures because they are equivalent to finite automata, which cannot count nesting depth. This is a fundamental limitation from formal language theory: balanced parentheses form a context-free language, which requires a pushdown automaton (essentially, a stack). Some regex engines like PCRE and .NET support recursive patterns and balancing groups that extend beyond formal regular expressions, but JavaScript's regex engine does not support these features. For parsing nested structures, use a proper parser or iterative approach instead of regex.

Regex Tester

About Regex Tester

How to Use the Regex Tester

Common Use Cases

Validating Input Formats

Extracting Data from Unstructured Text

Search and Replace in Codebases

Parsing and Transforming Log Files

Frequently Asked Questions

Related Tools

Reference