S3Queue ordered mode with hive partitioning by ianton-ru · Pull Request #81040 · ClickHouse/ClickHouse

ianton-ru · 2025-05-30T11:28:09Z

Changelog category (leave one):

Improvement

Changelog entry (a user-readable short description of the changes that goes into CHANGELOG.md):

Tracking hive partitioning for ordered mode in S3Queue. Resolves #71161

Documentation entry for user-facing changes

Creates record in Keeper with last successfully processed file for every hive partition. Do not modify bucket mechanism, logic changed only inside bucket, so separate synchronization is not required.

clickhouse-gh · 2025-05-30T13:00:42Z

Workflow [PR], commit [547e34e]

ianton-ru · 2025-06-05T12:39:57Z

Integration tests for aarch64 interrupted by timeout, looks like some infrastructure-related issue.

src/Storages/ObjectStorageQueue/ObjectStorageQueueMetadata.h

src/Storages/ObjectStorageQueue/StorageObjectStorageQueue.cpp

tests/integration/test_storage_s3_queue/test_6.py

src/Storages/ObjectStorageQueue/ObjectStorageQueueOrderedFileMetadata.cpp

clickhouse-gh · 2025-06-16T21:58:40Z

Workflow [PR], commit [92edb60]

Summary: ❌

job_name	test_name	status	info	comment
Stateless tests (arm_asan, targeted)		failure
	02473_multistep_prewhere	FAIL	cidb	IGNORED
	02473_multistep_split_prewhere	FAIL	cidb	IGNORED
Integration tests (amd_tsan, 1/6)		failure
	test_storage_s3_queue/test_migration.py::test_migration[1-]	FAIL	cidb	IGNORED
	test_storage_s3_queue/test_migration.py::test_migration[1-s3queue_]	FAIL	cidb	IGNORED
BuzzHouse (amd_msan)		failure
	Logical error: '!table_alias.empty()' (STID: None)	FAIL	cidb, issue	ISSUE CREATED
BuzzHouse (amd_ubsan)		failure
	Logical error: Bad cast from type A to B (STID: 1635-4058)	FAIL	cidb, issue	ISSUE CREATED
AST fuzzer (arm_asan)		error		IGNORED

src/Storages/ObjectStorageQueue/ObjectStorageQueueSettings.cpp

src/Storages/ObjectStorageQueue/ObjectStorageQueueIFileMetadata.h

src/Storages/ObjectStorageQueue/ObjectStorageQueueOrderedFileMetadata.h

src/Storages/ObjectStorageQueue/ObjectStorageQueueOrderedFileMetadata.cpp

src/Storages/ObjectStorageQueue/ObjectStorageQueueSource.cpp

src/Storages/ObjectStorageQueue/ObjectStorageQueueOrderedFileMetadata.cpp

tests/integration/test_storage_s3_queue/test_6.py

src/Storages/ObjectStorageQueue/ObjectStorageQueueIFileMetadata.h

src/Storages/ObjectStorageQueue/ObjectStorageQueueSource.cpp

src/Storages/ObjectStorageQueue/StorageObjectStorageQueue.cpp

src/Storages/HivePartitioningUtils.cpp

src/Storages/VirtualColumnUtils.cpp

src/Storages/StorageInMemoryMetadata.cpp

src/Storages/ObjectStorageQueue/StorageObjectStorageQueue.cpp

ianton-ru · 2025-12-01T13:54:26Z

Failed fast test 03373_named_session_try_recreate_before_timeout looks unrelated.

kssenii · 2025-12-01T14:02:25Z

Yes, looks unrelated, but it blocks other tests from starting so at the moment no other tests were actually run and will not be, so could you please push some empty commit to restart the checks?

…efix

kssenii · 2025-12-22T11:09:24Z

tests/integration/test_storage_s3_queue/test_6.py

+@pytest.mark.parametrize("hosts", [1, 2])
+@pytest.mark.parametrize("processing_threads_num", [1, 16])
+@pytest.mark.parametrize("buckets", [1, 4])
+@pytest.mark.parametrize("engine_name", ["S3Queue",
+                                         "AzureQueue",
+                                         ])


Could you please split this test into separate ones? Flaky check fails because a single test runs for too long https://s3.amazonaws.com/clickhouse-test-reports/PRs/81040/27bf90da21b8557c55b73a9ae98cdfc01707b626//integration_tests_amd_asan_flaky/job.log
Also please avoid force-push from now on, as I've fixed private synchronization conflicts and a force push will reset my fixes.

This reverts commit 8a7eb4d.

ianton-ru · 2025-12-30T08:30:45Z

Need to wait after each data portion.
I need to find another way to speedup tests...

ianton-ru · 2025-12-31T18:23:08Z

Found a bug in test, now must be faster.

ianton-ru · 2026-01-02T09:36:44Z

Failed tests are looked unrelated

kssenii · 2026-01-08T12:24:18Z

src/Storages/ObjectStorageQueue/StorageObjectStorageQueue.cpp

+        if (code.value() == Coordination::Error::ZBADVERSION && retry_count < retry_limit)
+        {
+            ++retry_count;
+            LOG_INFO(log, "Keeper Bad Version error, other node wrote something, retry {}", retry_count);


@ianton-ru, hi, could you please clarify why do we need this check? As we use persistent nodes for processing/buckets nodes, this error should not happen, otherwise it would mean duplicated data which must not happen and this retry could just hide some bugs

Scenario with multiple nodes:

two hive partition, part1 with file file11 and part2 with file file21.

records in Keeper have version 1

node1 processes files file11 and file21

meanwhile new file file12 is added in part1

node2 processes file file12

node1 complete processing, starts to create records list. Extract versions for parts, both have version 1

node2 complete processing, starts to create records list, with version1 too

node2 commits records without errors, record in Keeper for part1 gets version 2

node1 tries to commit records as single transaction, gets error Bad Version for part1
Without hive partitions node1 does not need to write something after that, because node2 wrote more actual information about last processing file.
But with hive partitions node1 still need to write last processing file for part2.

So you mean node1 was processing file /part1/file11 concurrently with node2 processing file /part1/file12?
But if file11 and file12 share the same processed_node_path, then they are in the same bucket, and a single bucket can only be processed by a single processor, not allowing any concurrent processing (persistent bucket lock must maintain this invariant). Then the situation you describe should not happen.

Hmm. Looks like ClickHouse locks buckets only when number of buckets is greater than 1 (when useBucketsForProcessing returns true).
Is configuration with multiple nodes and single bucket valid?

Is configuration with multiple nodes and single bucket valid?

Not valid as it can lead to inconsistent processing (in some failure scenarios we can end up with files which will never be processed) and retries will not work properly. Currently if processing_threads_num is >1 (it will almost always be so as we take default from the number of cpu cores) and buckets is disabled, then buckets are anyway enforced automatically as well. Moreover, in cloud we do not allow to create tables without bucket-based processing:

ClickHouse/src/Storages/ObjectStorageQueue/ObjectStorageQueueMetadata.cpp

Lines 509 to 515 in 07a29cf

if (!warned && settings_ref[Setting::cloud_mode]

&& table_metadata.getMode() == ObjectStorageQueueMode::ORDERED

&& table_metadata.buckets <= 1 && table_metadata.processing_threads_num <= 1)

{

const std::string message = "Ordered mode in cloud without "

"either `buckets`>1 or `processing_threads_num`>1 (works as `buckets` if it's not specified) "

"will not work properly. Please specify them in the CREATE query. See documentation for more details.";

Should be handled here https://github.com/ClickHouse/clickhouse-private/blob/d442af4b207906e5c3600e35eb82742a1d0f2d97/src/Storages/ObjectStorageQueue/ObjectStorageQueueTableMetadata.h#L102-L106

Oh. I use "buckets=1" in tests, getBucketsNum returns 1, because buckets is not zero.

It's a gray zone. useBucketsForProcessing checks that buckets_num>1.

useBucketsForProcessing checks that buckets_num>1

Yes, but ObjectStorageQueueMetadata sets buckets_num as table_metadata->getBucketsNum, which returns what I've sent in the above message https://github.com/ClickHouse/clickhouse-private/blob/d442af4b207906e5c3600e35eb82742a1d0f2d97/src/Storages/ObjectStorageQueue/ObjectStorageQueueTableMetadata.h#L102-L106

I use "buckets=1" in tests,

In this case yes, it will not work, this is an omission which needs to be fixed. But by default buckets = 0, so if user does not touch this setting and only changes processing_threads_num, then buckets must be correctly set to the value of processing_threads_num

Removed here: #93843

S3Queue with hive partitioning

49eb7be

ianton-ru marked this pull request as ready for review May 30, 2025 11:48

GrigoryPervakov added the can be tested Allows running workflows for external contributors label May 30, 2025

clickhouse-gh bot added the pr-improvement Pull request with some product improvements label May 30, 2025

kssenii self-assigned this Jun 1, 2025

Fix style

2f7d37e

ianton-ru force-pushed the storage_s3_queue_prefix branch from db2db26 to 2f7d37e Compare June 4, 2025 15:18

Fix tidy build

547e34e

kssenii reviewed Jun 12, 2025

View reviewed changes

Fix after review

bdce2c4

ianton-ru force-pushed the storage_s3_queue_prefix branch from 9471804 to bdce2c4 Compare June 16, 2025 23:12

kssenii reviewed Jun 25, 2025

View reviewed changes

Changes after review

a3c17ab

kssenii reviewed Jun 26, 2025

View reviewed changes

src/Storages/ObjectStorageQueue/ObjectStorageQueueOrderedFileMetadata.cpp Outdated Show resolved Hide resolved

src/Storages/ObjectStorageQueue/ObjectStorageQueueOrderedFileMetadata.cpp Outdated Show resolved Hide resolved

ianton-ru added 3 commits June 30, 2025 19:15

Refzctoring after review

19809a4

Fix tidy build

3f5e7fc

Fix test

d50406b

kssenii reviewed Jul 4, 2025

View reviewed changes

ianton-ru and others added 4 commits July 8, 2025 15:05

Merge branch 'master' into storage_s3_queue_prefix

5c22c57

Improve test and small fixes after review

a90ff71

Merge master

8f6ff87

Changes after merge

436b31d

arthurpassos reviewed Aug 7, 2025

View reviewed changes

src/Storages/ObjectStorageQueue/ObjectStorageQueueSource.cpp Outdated Show resolved Hide resolved

src/Storages/ObjectStorageQueue/StorageObjectStorageQueue.cpp Outdated Show resolved Hide resolved

Materialized hive columns

f86a261

arthurpassos reviewed Aug 11, 2025

View reviewed changes

Small fixes

e821a5d

ianton-ru and others added 2 commits December 1, 2025 10:47

Merge branch 'master' into storage_s3_queue_prefix

48d52d2

Fix allow_experimental_object_storage_queue_hive_partitioning check

27483a2

ianton-ru and others added 6 commits December 1, 2025 15:48

Dumb commit to retrigger tests

4a99069

Merge branch 'master' into storage_s3_queue_prefix

f906409

Merge remote-tracking branch 'origin/master' into storage_s3_queue_pr…

dd9960c

…efix

Fix for keeper fault injection retries

f5d40df

Merge master

723b621

Merge branch 'master' into storage_s3_queue_prefix

27bf90d

kssenii reviewed Dec 22, 2025

View reviewed changes

ianton-ru added 4 commits December 29, 2025 11:09

Split test

8a7eb4d

Merge master

059c331

Fix SettingsChangesHistory

bdda7cc

Revert "Split test"

511dc17

This reverts commit 8a7eb4d.

ianton-ru and others added 3 commits December 31, 2025 17:46

Retry on Bad Version error

7fbadf7

Fix test

aec3283

Merge branch 'master' into storage_s3_queue_prefix

2816cb5

Fix retry

92edb60

kssenii added this pull request to the merge queue Jan 2, 2026

Merged via the queue into ClickHouse:master with commit fefb9e1 Jan 2, 2026
125 of 131 checks passed

robot-ch-test-poll2 added the pr-synced-to-cloud The PR is synced to the cloud repo label Jan 2, 2026

kssenii reviewed Jan 8, 2026

View reviewed changes

lesandie mentioned this pull request Jan 9, 2026

S3Queue auxiliary Zookeeper support (not clean) #93786

Closed

1 task

ianton-ru mentioned this pull request Jan 9, 2026

Remove 'Bad Version' handling from S3Queue #93843

Merged

1 task

bharatnc mentioned this pull request Jan 15, 2026

S3Queue ordered mode support more generic partitioning #94321

Merged

1 task

	if (!warned && settings_ref[Setting::cloud_mode]
	&& table_metadata.getMode() == ObjectStorageQueueMode::ORDERED
	&& table_metadata.buckets <= 1 && table_metadata.processing_threads_num <= 1)
	{
	const std::string message = "Ordered mode in cloud without "
	"either `buckets`>1 or `processing_threads_num`>1 (works as `buckets` if it's not specified) "
	"will not work properly. Please specify them in the CREATE query. See documentation for more details.";

Conversation

ianton-ru commented May 30, 2025 • edited by clickhouse-gh bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changelog category (leave one):

Changelog entry (a user-readable short description of the changes that goes into CHANGELOG.md):

Documentation entry for user-facing changes

Uh oh!

clickhouse-gh bot commented May 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ianton-ru commented Jun 5, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

clickhouse-gh bot commented Jun 16, 2025 • edited by kssenii Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ianton-ru commented Dec 1, 2025

Uh oh!

kssenii commented Dec 1, 2025

Uh oh!

kssenii Dec 22, 2025

Choose a reason for hiding this comment

Uh oh!

ianton-ru commented Dec 30, 2025

Uh oh!

ianton-ru commented Dec 31, 2025

Uh oh!

ianton-ru commented Jan 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

kssenii Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

ianton-ru Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kssenii Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ianton-ru Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kssenii Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

ianton-ru commented May 30, 2025 •

edited by clickhouse-gh bot

Loading

clickhouse-gh bot commented May 30, 2025 •

edited

Loading

clickhouse-gh bot commented Jun 16, 2025 •

edited by kssenii

Loading

ianton-ru commented Jan 2, 2026 •

edited

Loading

ianton-ru Jan 8, 2026 •

edited

Loading

kssenii Jan 8, 2026 •

edited

Loading

ianton-ru Jan 9, 2026 •

edited

Loading

kssenii Jan 9, 2026 •

edited

Loading