Conversation
|
Some more stats made with this script: https://gist.github.com/akuzm/d1677a2810c2a6f7cbd056de1384c7ec A cumulative graph for relative metric changes: for value of change, plots the number of queries for which the change is less. What we can see is a significant reduction in page faults and system time, with no changes in query time. The query stability metrics didn't change much though (these are quantiles(0, 0.25, 0.5, 0.75, 0.95, 0.99, 1) of stddev of query run time, queries that are not "short" (>0.02 s)). There is some change for higher quantiles, but this is likely to be noise. I intend to merge this, hopefully it will make the testing more predictable, if only because we won't have to look at crazy SoftPageFaults fluctuations and wonder what they mean. @alexey-milovidov @azat any comments? |
|
Ah, forgot the most important thing. Here's a graph for MemoryResident (it's already in test results archive, 'metrics/MemoryResident.png'): You can see that the memory usage is somewhat higher, but for long test that use constant amount of memory, it stabilizes around the same value. Looking at summary metrics, median of MemoryResident is 20% higher: https://clickhouse-test-reports.s3.yandex.net/11993/8b82ff215fe4b42216aa41adf9907aba37a3817d/performance_comparison/report.html#8 . This is to be expected -- muzzy pages is memory that is freed lazily and does not immediately influence the RSS. But it may have some influence on our query limiting behavior, when we limit server RSS to a percentage of total available memory ( |
| # avoid spurious latencies and additional work associated with | ||
| # MADV_DONTNEED. See | ||
| # https://github.com/ClickHouse/ClickHouse/issues/11121 for motivation. | ||
| set (JEMALLOC_CONFIG_MALLOC_CONF "percpu_arena:percpu,oversize_threshold:0,muzzy_decay_ms:10000") |
There was a problem hiding this comment.
Maybe worth enable it for other OSes too, just in case? (although not sure that it has analogs)
There was a problem hiding this comment.
I think MacOS has some similar madvise call for this, don't know about other systems. Let's enable it.


Changelog category (leave one):
As a reference for other tests we've made, see #11401