Fix O(n^2) algorithm in the GC cleanup logic by JohnSully · Pull Request #429 · Snapchat/KeyDB

JohnSully · 2022-05-18T20:01:49Z

Because deletion from a vector is a linear operation the loop inside endEpoch() is O(n^2). It was written this way to be safe while deleting the vector but can cause serious consequences during large GC epochs where large numbers of epochs need to be cleared.

The fix is to move to a list datastructure with O(1) deletion. This is the more appropriate datastructure for this operation.

VivekSainiEQ · 2022-05-18T20:24:23Z

src/gc.h


-        auto itr = std::find(m_vecepochs.begin(), m_vecepochs.end(), m_epochNext+1);
-        if (itr == m_vecepochs.end())
+        auto itr = std::find(m_listepochs.begin(), m_listepochs.end(), m_epochNext+1);


Would it be possible to use an unordered set? You could still iterate in O(N) and insert/delete in O(1), but this find operation would be O(1) instead of O(N)?

We rely on sorted order though

What about an ordered set? insertion/deletion would be logarithmic instead of constant, but so would this find.

Deletion is O(log n) though, this would regress us to O(n log n) performance instead of the O(n) that we are currently so its not a net benefit.

paulmchen · 2022-05-18T23:12:18Z

I verified this fix on my cloud environment. It is now much better, The '0 qps' issue cannot be reproduced during the fullsync replication cycle. Thanks John.

btw. the following changes will also help on minimizing the CPU usage during the GC cycle, i suggest we add the change to KEYDB core as well:

(gc.h)

class GarbageCollector
{
struct EpochHolder
{
uint64_t tstamp;
std::unique_ptr<std::vector<std::unique_ptr>> m_spvecObjs;

    EpochHolder() {
        m_spvecObjs = std::make_unique<std::vector<std::unique_ptr<T>>>();
    }

    // Support move operators
    EpochHolder(EpochHolder &&other) = default;
    EpochHolder &operator=(EpochHolder &&) = default;

Fix O(n^2) algorithm in the GC cleanup logic

d42db8b

VivekSainiEQ reviewed May 18, 2022

View reviewed changes

JohnSully merged commit 5dbf1f6 into main May 20, 2022

JohnSully deleted the gc_perf_fix branch May 20, 2022 01:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix O(n^2) algorithm in the GC cleanup logic#429

Fix O(n^2) algorithm in the GC cleanup logic#429
JohnSully merged 1 commit intomainfrom
gc_perf_fix

JohnSully commented May 18, 2022

Uh oh!

VivekSainiEQ May 18, 2022

Uh oh!

JohnSully May 18, 2022

Uh oh!

VivekSainiEQ May 18, 2022

Uh oh!

JohnSully May 19, 2022 •

edited

Loading

Uh oh!

paulmchen commented May 18, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

JohnSully commented May 18, 2022

Uh oh!

VivekSainiEQ May 18, 2022

Choose a reason for hiding this comment

Uh oh!

JohnSully May 18, 2022

Choose a reason for hiding this comment

Uh oh!

VivekSainiEQ May 18, 2022

Choose a reason for hiding this comment

Uh oh!

JohnSully May 19, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

paulmchen commented May 18, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

JohnSully May 19, 2022 •

edited

Loading