Utilizing a Shared Parsing Pool for Multiple Parquet Streams#66253
Utilizing a Shared Parsing Pool for Multiple Parquet Streams#66253Clark0 wants to merge 5 commits intoClickHouse:masterfrom
Conversation
|
generate test files, 1 file with 50 row groups and 3 files with 2 row groups Before 2.408 s |
There was a problem hiding this comment.
Please take a look at IO/SharedThreadPools.cpp
There was a problem hiding this comment.
Hi, just checked IO/SharedThreadPool, the idea is different. This SharedParsingThreadPool is to replace the individual thread pools opened by each ParquetBlockInputFormat.
|
This is an automated comment for commit 294a88e with description of existing statuses. It's updated for the latest CI running ❌ Click here to open a full report in a separate page
Successful checks
|
|
@Clark0, a test for Delta Lake has failed. |
48de35a to
3214911
Compare
6a602ab to
e63b74e
Compare
0ab0d85 to
71a4fe9
Compare
Hi Alexey, most of the test cases are fixed. Do the remaining failed cases relate to this change? |
It was fixed recently in the master. Let's merge with the master and ensure all the checks are green. |
71a4fe9 to
294a88e
Compare
| SharedParsingThreadPool(size_t max_threads_, size_t num_streams_) | ||
| : max_threads(max_threads_), num_streams(num_streams_) | ||
| { | ||
| threads_per_stream = std::max(1ul, max_threads / std::max(num_streams, 1ul)); |
There was a problem hiding this comment.
Move implementations to .cpp
|
Dear @antonio2368, this PR hasn't been updated for a while. You will be unassigned. Will you continue working on it? If so, please feel free to reassign yourself. |
|
Dear @al13n321, this PR hasn't been updated for a while. You will be unassigned. Will you continue working on it? If so, please feel free to reassign yourself. |
|
Obsoleted by #80931 |
Changelog category (leave one):
Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):
Utilizing a Shared Parsing Pool for Multiple Parquet Streams, fix #65963
Documentation entry for user-facing changes
CI Settings (Only check the boxes if you know what you are doing):