Add additional checks for missing streams in Wide parts by Avogar · Pull Request #92076 · ClickHouse/ClickHouse

Avogar · 2025-12-12T17:29:49Z

Changelog category (leave one):

Not for changelog (changelog entry is not required)

Changelog entry (a user-readable short description of the changes that goes into CHANGELOG.md):

If some files in data part are missing, we can end up with inconsistent in-memory state of some columns which can lead to crashes. It's better to throw logical errors in this case

clickhouse-gh · 2025-12-12T17:31:07Z

Workflow [PR], commit [e5e5426]

Summary: ❌

job_name	test_name	status	info	comment
Stateless tests (amd_binary, ParallelReplicas, s3 storage, parallel)		failure
	02319_lightweight_delete_on_merge_tree	FAIL	cidb, issue	ISSUE EXISTS
Stateless tests (amd_debug, distributed plan, s3 storage, parallel)		failure
	02319_lightweight_delete_on_merge_tree	FAIL	cidb, issue	ISSUE EXISTS
Integration tests (arm_binary, distributed plan, 1/4)		failure
	test_s3_plain_rewritable/test.py::test[cache_s3_plain_rewritable-data/]	FAIL	cidb, issue	ISSUE EXISTS
BuzzHouse (amd_debug)		failure
	Logical error: 'Inconsistent AST formatting: the query: (STID: 1941-1bfa)	FAIL	cidb, issue	ISSUE EXISTS
BuzzHouse (arm_asan)		failure
	Logical error: Function writeSlice expects same column types for GenericArraySlice and GenericArraySink. (STID: 3276-468b)	FAIL	cidb	IGNORED

…-check-for-unexpected-streams

azat · 2025-12-21T08:24:34Z

Hm, there are still some issues - https://s3.amazonaws.com/clickhouse-test-reports/json.html?PR=92076&sha=latest&name_0=PR&name_1=Stateless+tests+%28amd_asan%2C+distributed+plan%2C+parallel%2C+2%2F2%29

Logical error: Stream A for column B is not found (STID: 3190-622d)

Avogar · 2025-12-23T13:18:19Z

Tests are good now

azat · 2025-12-23T18:02:14Z

src/DataTypes/Serializations/SerializationNullable.cpp

    if (null_map->size() != nested_column->size())
        throw Exception(
-            ErrorCodes::INCORRECT_DATA,
+            settings.native_format ? ErrorCodes::INCORRECT_DATA : ErrorCodes::LOGICAL_ERROR,


Hm, why it is OK (INCORRECT_DATA over LOGICAL_ERROR) for native format? (here and below)

Because user can try to read corrupted data in Native format and we don't want to throw LOGICAL_ERROR when we try to read some corrupted user data.

But these functions can be called only for Native format? Am I missing something?

I guess native_format == false is when we reading MergeTree data, and in this case we will haveLOGICAL_ERROR with this patch, and for Native over network it will be INCORRECT_DATA?

(Note, StorageMemory also uses NativeReader which will use INCORRECT_DATA as well)

I guess native_format == false is when we reading MergeTree data, and in this case we will haveLOGICAL_ERROR with this patch, and for Native over network it will be INCORRECT_DATA?

Yes. And over TCP protocol we can also receive some corrupted data in Native format from different language clients.

My intention for these changes is actually to throw logical error in reading from MergeTree, because in serializations we have logic of skipping reading data if returned buffer for some stream is nullptr. And without such checks it may lead to inconsistent in-memory state of the columns and crashes in random places if we have some bug or missing file in the data part.

Cherry pick #92076 to 25.12: Add additional checks for missing streams in Wide parts

…n Wide parts

Backport #92076 to 25.12: Add additional checks for missing streams in Wide parts

… Wide parts

Cherry pick #92076 to 25.8: Add additional checks for missing streams in Wide parts

… Wide parts

Cherry pick #92076 to 25.10: Add additional checks for missing streams in Wide parts

…n Wide parts

Cherry pick #92076 to 25.11: Add additional checks for missing streams in Wide parts

…n Wide parts

Cherry pick #92076 to 25.3: Add additional checks for missing streams in Wide parts

Backport #92076 to 25.11: Add additional checks for missing streams in Wide parts

Backport #92076 to 25.10: Add additional checks for missing streams in Wide parts

Backport #92076 to 25.8: Add additional checks for missing streams in Wide parts

Add additional checks for missing streams in Wide parts

18f9620

clickhouse-gh bot added the pr-not-for-changelog This PR should not be mentioned in the changelog label Dec 12, 2025

azat self-assigned this Dec 14, 2025

Avogar and others added 8 commits December 15, 2025 14:25

Fix bad check

d218878

Fix style

f8aecc3

Merge branch 'master' into better-check-for-unexpected-streams

af7591d

Merge branch 'master' of github.com:ClickHouse/ClickHouse into better…

8bf3f75

…-check-for-unexpected-streams

Throw INCORRECT_DATA in Native format instead of LOGICAL_ERROR

57dff4d

Apply change for SerializationVariant

bc9c82c

Merge branch 'master' of github.com:ClickHouse/ClickHouse into better…

90eeb05

…-check-for-unexpected-streams

Merge branch 'master' of github.com:ClickHouse/ClickHouse into better…

7619e26

…-check-for-unexpected-streams

Avogar requested a review from azat December 19, 2025 14:13

Fix check for missing streams

e5e5426

azat reviewed Dec 23, 2025

View reviewed changes

azat approved these changes Dec 29, 2025

View reviewed changes

Avogar added this pull request to the merge queue Dec 29, 2025

Merged via the queue into ClickHouse:master with commit 95e809a Dec 29, 2025
125 of 131 checks passed

Avogar deleted the better-check-for-unexpected-streams branch December 29, 2025 18:23

robot-clickhouse added the pr-synced-to-cloud The PR is synced to the cloud repo label Dec 29, 2025

Avogar added the pr-must-backport Pull request should be backported intentionally. Use this label with great care! label Jan 19, 2026

robot-ch-test-poll3 added the pr-must-backport-synced The `*-must-backport` labels are synced into the cloud Sync PR label Jan 19, 2026

robot-ch-test-poll1 added a commit that referenced this pull request Jan 19, 2026

Merge pull request #94569 from ClickHouse/cherrypick/25.12/92076

fe68ca0

Cherry pick #92076 to 25.12: Add additional checks for missing streams in Wide parts

robot-clickhouse added a commit that referenced this pull request Jan 19, 2026

Backport #92076 to 25.12: Add additional checks for missing streams i…

17cffca

…n Wide parts

robot-ch-test-poll1 mentioned this pull request Jan 19, 2026

Backport #92076 to 25.12: Add additional checks for missing streams in Wide parts #94570

Merged

Avogar added a commit that referenced this pull request Jan 21, 2026

Merge pull request #94570 from ClickHouse/backport/25.12/92076

7817ec9

Backport #92076 to 25.12: Add additional checks for missing streams in Wide parts

robot-clickhouse added a commit that referenced this pull request Jan 21, 2026

Backport #92076 to 25.3: Add additional checks for missing streams in…

d895e90

… Wide parts

robot-ch-test-poll1 mentioned this pull request Jan 21, 2026

Backport #92076 to 25.3: Add additional checks for missing streams in Wide parts #94721

Closed

robot-ch-test-poll1 added a commit that referenced this pull request Jan 21, 2026

Merge pull request #94566 from ClickHouse/cherrypick/25.8/92076

ea38d18

Cherry pick #92076 to 25.8: Add additional checks for missing streams in Wide parts

robot-clickhouse added a commit that referenced this pull request Jan 21, 2026

Backport #92076 to 25.8: Add additional checks for missing streams in…

96c6d0c

… Wide parts

robot-ch-test-poll1 mentioned this pull request Jan 21, 2026

Backport #92076 to 25.8: Add additional checks for missing streams in Wide parts #94722

Merged

robot-ch-test-poll1 added a commit that referenced this pull request Jan 21, 2026

Merge pull request #94567 from ClickHouse/cherrypick/25.10/92076

16fcda2

Cherry pick #92076 to 25.10: Add additional checks for missing streams in Wide parts

robot-clickhouse added a commit that referenced this pull request Jan 21, 2026

Backport #92076 to 25.10: Add additional checks for missing streams i…

ae3daeb

…n Wide parts

robot-ch-test-poll1 mentioned this pull request Jan 21, 2026

Backport #92076 to 25.10: Add additional checks for missing streams in Wide parts #94723

Merged

robot-ch-test-poll1 added a commit that referenced this pull request Jan 21, 2026

Merge pull request #94568 from ClickHouse/cherrypick/25.11/92076

04bac6a

Cherry pick #92076 to 25.11: Add additional checks for missing streams in Wide parts

robot-clickhouse added a commit that referenced this pull request Jan 21, 2026

Backport #92076 to 25.11: Add additional checks for missing streams i…

4b35d8b

…n Wide parts

robot-ch-test-poll1 mentioned this pull request Jan 21, 2026

Backport #92076 to 25.11: Add additional checks for missing streams in Wide parts #94724

Merged

robot-ch-test-poll1 added a commit that referenced this pull request Jan 21, 2026

Merge pull request #94565 from ClickHouse/cherrypick/25.3/92076

764dc7b

Cherry pick #92076 to 25.3: Add additional checks for missing streams in Wide parts

robot-ch-test-poll added the pr-backports-created Backport PRs are successfully created, it won't be processed by CI script anymore label Jan 21, 2026

Avogar added a commit that referenced this pull request Jan 23, 2026

Merge pull request #94724 from ClickHouse/backport/25.11/92076

02737dc

Backport #92076 to 25.11: Add additional checks for missing streams in Wide parts

clickhouse-gh bot added a commit that referenced this pull request Jan 23, 2026

Merge pull request #94723 from ClickHouse/backport/25.10/92076

aadfe6d

Backport #92076 to 25.10: Add additional checks for missing streams in Wide parts

clickhouse-gh bot added a commit that referenced this pull request Jan 23, 2026

Merge pull request #94722 from ClickHouse/backport/25.8/92076

aecd130

Backport #92076 to 25.8: Add additional checks for missing streams in Wide parts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add additional checks for missing streams in Wide parts#92076

Add additional checks for missing streams in Wide parts#92076
Avogar merged 10 commits intoClickHouse:masterfrom
Avogar:better-check-for-unexpected-streams

Avogar commented Dec 12, 2025 •

edited by azat

Loading

Uh oh!

clickhouse-gh bot commented Dec 12, 2025 •

edited by Avogar

Loading

Uh oh!

azat commented Dec 21, 2025

Uh oh!

Avogar commented Dec 23, 2025

Uh oh!

azat Dec 23, 2025 •

edited

Loading

Uh oh!

Avogar Dec 29, 2025

Uh oh!

azat Dec 29, 2025

Uh oh!

Avogar Dec 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

Avogar commented Dec 12, 2025 • edited by azat Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changelog category (leave one):

Changelog entry (a user-readable short description of the changes that goes into CHANGELOG.md):

Uh oh!

clickhouse-gh bot commented Dec 12, 2025 • edited by Avogar Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

azat commented Dec 21, 2025

Uh oh!

Avogar commented Dec 23, 2025

Uh oh!

azat Dec 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Avogar Dec 29, 2025

Choose a reason for hiding this comment

Uh oh!

azat Dec 29, 2025

Choose a reason for hiding this comment

Uh oh!

Avogar Dec 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Avogar commented Dec 12, 2025 •

edited by azat

Loading

clickhouse-gh bot commented Dec 12, 2025 •

edited by Avogar

Loading

azat Dec 23, 2025 •

edited

Loading