How do lookahead and lookbehind assertions work?

Lookahead (?=...) and lookbehind (?<=...) are zero-width assertions that match a position without consuming characters. Positive lookahead (?=pattern) asserts that what follows matches the pattern. Negative lookahead (?!pattern) asserts it does not. Lookbehind works the same way but checks what precedes the current position. For example, \d+(?= dollars) matches digits only when followed by ' dollars'.

Regex Builder - Free Online Regular Expression Builder

Q: What is a regex builder?

A regex builder is a tool that helps you construct and test regular expression patterns interactively. You enter a pattern and test text, and the tool highlights all matches in real time, shows captured groups, and displays match positions. It reduces trial-and-error by giving you immediate visual feedback as you build your pattern.

Q: What are character classes in regex?

Character classes match any single character from a defined set. Square brackets define custom classes like [aeiou] for vowels or [0-9] for digits. Shorthand classes include \d (digits), \w (word characters: letters, digits, underscore), \s (whitespace), and their negations \D, \W, \S. Ranges like [a-z] match any lowercase letter, and [A-Za-z0-9] matches any alphanumeric character.

Q: What is the difference between greedy and lazy quantifiers?

Greedy quantifiers (*, +, ?) match as much text as possible, while lazy quantifiers (*?, +?, ??) match as little as possible. For example, given the text ' bold ', the greedy pattern matches the entire string, but the lazy pattern matches only ' '. Use lazy quantifiers when you want the shortest possible match.

Q: When should I use a parser instead of regex?

Regex is not suitable for parsing nested or recursive structures like HTML, XML, JSON, or programming languages. Use a proper parser for these. Regex also struggles with context-sensitive grammars and deeply nested parentheses. A good rule of thumb: if your regex has more than two levels of grouping or requires balancing opening and closing delimiters, a parser is the better choice.

Build regular expressions with live match highlighting, common pattern templates, capture groups, and detailed match positions.

How to Use the Regex Builder

Enter your regular expression in the pattern field or click one of the common pattern buttons to start with a pre-built pattern for emails, URLs, phone numbers, IP addresses, or dates. Type or paste your test text in the text area below. As you type, matching portions of the text are highlighted in yellow in real time. The match count updates instantly, and a detailed list shows each match with its position and any captured groups.

Use the flag checkboxes to enable case-insensitive matching or multiline mode. Case-insensitive mode ignores letter case, so the pattern "hello" will match "Hello", "HELLO", and any other case combination. Multiline mode makes the anchors ^ and $ match the start and end of individual lines rather than the entire string. The global flag is always enabled so all matches are found. Click "Copy Regex Pattern" to copy the current pattern to your clipboard.

Regular Expression Fundamentals

Regular expressions are sequences of characters that define search patterns. They are used across virtually every programming language for text searching, validation, extraction, and replacement. While the syntax can appear cryptic at first, regular expressions are built from a small set of building blocks that combine to create powerful patterns. Learning these fundamentals unlocks one of the most versatile tools in a developer's toolkit.

Character Classes and Shorthand

Character classes define sets of characters to match. Square brackets create custom classes: [aeiou] matches any vowel, [0-9] matches any digit, and [A-Za-z] matches any letter. Negated classes like [^0-9] match anything except the specified characters. Shorthand classes provide convenient alternatives: \d matches digits (same as [0-9]), \w matches word characters (letters, digits, underscore), \s matches whitespace (spaces, tabs, newlines), and their uppercase counterparts \D, \W, \S match the opposite. The dot . matches any character except newline.

Quantifiers: Greedy vs. Lazy

Quantifiers control how many times a pattern element repeats. The three basic quantifiers are * (zero or more), + (one or more), and ? (zero or one). Curly braces offer precise control: {3} matches exactly three times, {2,5} matches two to five times, and {3,} matches three or more times. By default, quantifiers are greedy, meaning they match as much text as possible. Adding a ? after any quantifier makes it lazy, matching as little as possible. The distinction matters when your text contains multiple possible endpoints, like matching HTML tags where <.*> greedily captures everything between the first and last angle brackets, but <.*?> lazily captures each individual tag.

Anchors and Boundaries

Anchors match positions rather than characters. The caret ^ matches the start of a line and the dollar sign $ matches the end. Word boundary \b matches the position between a word character and a non-word character, which is useful for matching whole words: \bcat\b matches "cat" but not "concatenate". In multiline mode, ^ and $ match the start and end of each line rather than the entire string.

Groups, Captures, and Backreferences

Parentheses create groups that serve two purposes: they group elements for quantifiers, and they capture matched text for later use. The pattern (\w+)@(\w+)\.(\w+) creates three capture groups from an email address. Named groups like (?<user>\w+) improve readability. Non-capturing groups (?:pattern) group without capturing, saving memory when you only need grouping for alternation or quantifiers. Backreferences like \1 refer to previously captured text, letting you match repeated patterns like (\w+)\s+\1 which finds doubled words.

Lookahead and Lookbehind

Lookahead and lookbehind are zero-width assertions that check for patterns without consuming characters. Positive lookahead (?=pattern) succeeds if the pattern matches ahead. Negative lookahead (?!pattern) succeeds if the pattern does not match ahead. Lookbehind works similarly but checks behind: (?<=\$)\d+ matches digits preceded by a dollar sign without including the dollar sign in the match. These assertions are powerful for extracting text that appears in a specific context without including the context in the result.

Common Pitfalls and When to Use a Parser

The most common regex mistakes include forgetting to escape special characters (use \. for a literal period), catastrophic backtracking from nested quantifiers like (a+)+, and trying to parse nested structures. Regular expressions are fundamentally unable to handle recursive nesting, which means they cannot reliably parse HTML, XML, JSON, or any language with balanced delimiters. For these tasks, use a proper parser. A good rule: if your regex requires more than a few minutes to understand, a parser or a series of simpler string operations may be more maintainable.

Frequently Asked Questions

What is a regex builder?

A tool for constructing and testing regex patterns interactively with live match highlighting, capture group display, and common pattern templates.

What are character classes in regex?

Sets of characters to match. Shorthand classes include \d (digits), \w (word characters), \s (whitespace). Custom classes use brackets like [a-z].

What is the difference between greedy and lazy quantifiers?

Greedy quantifiers (*, +) match as much as possible. Lazy versions (*?, +?) match as little as possible. Add ? after any quantifier to make it lazy.

How do lookahead and lookbehind work?

Zero-width assertions that check for patterns without consuming characters. (?=pattern) looks ahead, (?<=pattern) looks behind. Useful for context-dependent matching.>

When should I use a parser instead of regex?

Use a parser for nested or recursive structures like HTML, XML, JSON, or programming languages. Regex cannot handle balanced delimiters reliably.

Regex Builder

Match Details

Embed This

How to Use the Regex Builder

Regular Expression Fundamentals

Character Classes and Shorthand

Quantifiers: Greedy vs. Lazy

Anchors and Boundaries

Groups, Captures, and Backreferences

Lookahead and Lookbehind

Common Pitfalls and When to Use a Parser

Frequently Asked Questions

What is a regex builder?

What are character classes in regex?

What is the difference between greedy and lazy quantifiers?

How do lookahead and lookbehind work?

When should I use a parser instead of regex?

Related Calculators

Regex Tester

String Encoder/Decoder

Escape/Unescape Tool

You Might Also Need

Regex Tester

Regex to English Explainer

String Encoder/Decoder

Recommended Reading

How Much Should You Tip? A Complete Tipping Guide

GPA Calculator: How to Calculate Your Grade Point Average