Fix match(col, '^…') index analysis with escaped metacharacters to avoid wrong results and crashes#79969
Merged
GrigoryPervakov merged 5 commits intoClickHouse:masterfrom Jun 6, 2025
Merged
Conversation
- Change loop to `while (pos < end)` to guard against overruns - Support all single‐char escapes: `\]`, `\}`, `\-`, `\\` alongside existing metachars - Move `++pos` into the supported‐escape case only, removing stray increments - Prevent garbage reads past `end` and ensure correct fixed‐prefix for escaped metacharacters
filimonov
commented
May 8, 2025
| default: | ||
| /// all other escape sequences are not supported | ||
| pos = end; | ||
| break; |
Contributor
Author
There was a problem hiding this comment.
after that break (which exits the current switch) the increment on line 151 was occurring, moving the pos pointer after end of string.
Contributor
Contributor
Author
|
Bugfix check fails because of Indeed that setting was missing in 25.4... |
GrigoryPervakov
approved these changes
May 12, 2025
Member
GrigoryPervakov
left a comment
There was a problem hiding this comment.
LGTM, but check your test failure. Probably some random settings broke it and it needs no-random-settings to be stable in ci
Co-authored-by: Pervakov Grigorii <[email protected]>
Member
|
Fuzzer error is not related, it reproduces in master #81431 |
Merged
via the queue into
ClickHouse:master
with commit Jun 6, 2025
79902f2
117 of 121 checks passed
zvonand
pushed a commit
to Altinity/ClickHouse
that referenced
this pull request
Jun 17, 2025
Fix match(col, '^…') index analysis with escaped metacharacters to avoid wrong results and crashes
zvonand
added a commit
to Altinity/ClickHouse
that referenced
this pull request
Jun 18, 2025
24.8.14 Backport of ClickHouse#79969 -- Fix match(col, '^…') index analysis with escaped metacharacters to avoid wrong results and crashes
Contributor
Author
|
Backports? It really can lead to crashes. |
robot-ch-test-poll3
added a commit
that referenced
this pull request
Oct 31, 2025
Cherry pick #79969 to 25.3: Fix match(col, '^…') index analysis with escaped metacharacters to avoid wrong results and crashes
robot-clickhouse
added a commit
that referenced
this pull request
Oct 31, 2025
…aped metacharacters to avoid wrong results and crashes
mkmkme
pushed a commit
to Altinity/ClickHouse
that referenced
this pull request
Nov 4, 2025
Fix match(col, '^…') index analysis with escaped metacharacters to avoid wrong results and crashes
25 tasks
zvonand
added a commit
to Altinity/ClickHouse
that referenced
this pull request
Nov 5, 2025
25.3.8 Backport of ClickHouse#79969 - Fix match(col, '^…') index analysis with escaped metacharacters to avoid wrong results and crashes
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Changelog category (leave one):
Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):
Fixed incorrect query results and out-of-memory crashes when using
match(column, '^…')with backslash-escaped characters.Documentation entry for user-facing changes
Previously, hitting an unsupported “\x” escape in ^… patterns would do
pos = endbut still run a stray++pos, so the code walked past the regex buffer, pulled in garbage bytes (the “�i” etc), and could OOM.Now that stray increment is removed and all single-char escapes are handled correctly.
Backports are needed.
See example of the problem in fiddle: https://fiddle.clickhouse.com/ef60eb3a-8613-4127-b5b4-bb62fc274748
Another symptom is a very quick memory allocations (up to OOM) with a traces like below: