Use less memory for constraint lookup radix cache by droe · Pull Request #860 · zmap/zmap

droe · 2024-04-26T20:24:43Z

Reduce the size of the precomputed array of prefixes from 4 MB to 1 MB in order to fit into the cache of CPUs with lower amounts of cache. Improves send rate on systems where send rate is CPU/memory-bound and cache is limited.

With the default blocklist, this changes the array/tree tradeoff from 3702243328 IPs in radix array, 15104 IPs in tree to 3702194176 IPs in radix array, 64256 IPs in tree. That seems like a reasonable price to pay for the perf boost of reducing the memory footprint to one fourth. On a system with 8 core Intel Atom, 8*2 MB L2 cache, 10 GbE NIC, netmap, AES-NI, before this change, send rate was 11% below what the NIC can do, while with this change, zmap pushes packets faster than the NIC can send them. You may want to test this change on higher end systems before merging, to assert that it does not perform substantially worse on different system configurations.

Reduce size of precomputed array of prefixes from 4 MB to 1 MB in order to fit into the cache of CPUs with lower amounts of cache. Improves send rate on systems where send rate is CPU/memory-bound.

droe · 2024-04-26T20:29:09Z

Test failures look unrelated to change under test.

zakird · 2024-04-27T14:40:57Z

Sounds fairly reasonable. @phillip-stephens can you confirm whether there are any performance implications on more resourced systems (e.g., one of our boxes)?

phillip-stephens · 2024-04-29T15:40:02Z

@zakird Doesn't look like any negative performance implications on my VM with plenty of cores/RAM and a large bandwidth uplink.
Test Command - sudo ./src/zmap -p 80 -t 20 -T 6 -B 10G -o /dev/null

main Branch Results

0:29 100% (0s left); send: 52595072 done (2.63 Mp/s avg); recv: 695316 0 p/s (24.0 Kp/s avg); drops: 0 p/s (0 p/s avg); hitrate: 1.32%
0:29 100% (0s left); send: 56374656 done (2.82 Mp/s avg); recv: 743188 0 p/s (25.6 Kp/s avg); drops: 0 p/s (0 p/s avg); hitrate: 1.32%

Pull Request #860 Results

0:28 100% (0s left); send: 54433902 done (2.72 Mp/s avg); recv: 719076 3 p/s (25.6 Kp/s avg); drops: 0 p/s (0 p/s avg); hitrate: 1.32%
0:28 100% (0s left); send: 59560768 done (2.98 Mp/s avg); recv: 785961 5 p/s (28.0 Kp/s avg); drops: 0 p/s (0 p/s avg); hitrate: 1.32%

phillip-stephens

Tested on a more resourced VM, (not using netmap), doesn't seem to negatively impact performance there.

Use less memory for constraint lookup radix cache

9516eea

Reduce size of precomputed array of prefixes from 4 MB to 1 MB in order to fit into the cache of CPUs with lower amounts of cache. Improves send rate on systems where send rate is CPU/memory-bound.

zakird assigned phillip-stephens Apr 27, 2024

zakird added this to the ZMap 4.2 milestone Apr 27, 2024

Merge branch 'main' into droe/constraint-less-cache

1ff3db5

phillip-stephens self-requested a review April 29, 2024 15:40

phillip-stephens approved these changes Apr 29, 2024

View reviewed changes

phillip-stephens merged commit 3e5f387 into zmap:main Apr 29, 2024

droe deleted the droe/constraint-less-cache branch June 8, 2024 09:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use less memory for constraint lookup radix cache#860

Use less memory for constraint lookup radix cache#860
phillip-stephens merged 2 commits intozmap:mainfrom
droe:droe/constraint-less-cache

droe commented Apr 26, 2024

Uh oh!

droe commented Apr 26, 2024

Uh oh!

zakird commented Apr 27, 2024

Uh oh!

phillip-stephens commented Apr 29, 2024

Uh oh!

phillip-stephens left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

droe commented Apr 26, 2024

Uh oh!

droe commented Apr 26, 2024

Uh oh!

zakird commented Apr 27, 2024

Uh oh!

phillip-stephens commented Apr 29, 2024

main Branch Results

Pull Request #860 Results

Uh oh!

phillip-stephens left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants