Skip to content

Improve right table insert for Partial MergeJoin on disk#10467

Merged
4ertus2 merged 10 commits intoClickHouse:masterfrom
4ertus2:joins
Apr 28, 2020
Merged

Improve right table insert for Partial MergeJoin on disk#10467
4ertus2 merged 10 commits intoClickHouse:masterfrom
4ertus2:joins

Conversation

@4ertus2
Copy link
Contributor

@4ertus2 4ertus2 commented Apr 24, 2020

I hereby agree to the terms of the CLA available at: https://yandex.ru/legal/cla/?lang=en

Changelog category (leave one):

  • Performance Improvement

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):
Better insert logic for right table for Partial MergeJoin.

Detailed description / Documentation draft:
Do not lock inserts and disk writes: allow 1 disk stream and parallel inserts in memory for right PMJ table (allowed memory is splitted into 2 pieces, while we flush one anoter is allowed for inserts). Use another merge-on-disk strategy (less disk i/o). Allow to set how much files to use for merge-sort on disk. Then bigger then more memory used and then less disk i/o needed. Write bigger intermediate files.

@blinkov blinkov added the pr-performance Pull request with some performance improvements label Apr 24, 2020
@4ertus2 4ertus2 changed the title [wip] Improve right table insert for Partial MergeJoin on disk Improve right table insert for Partial MergeJoin on disk Apr 25, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

pr-performance Pull request with some performance improvements

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants