Add benchmarks for sharded + local store indexing #3663

aldenks · 2026-01-20T04:20:47Z

Parameterize with shards and no shards. Also parameterize with local + memory store to have an example of a store which has some modest latency.

When testing performance of optimizing sharded zarr reads I found that the memory store is too fast to show any difference, but the local store makes performance improvements there very visible. The extra parameterizations do expand the test space; test_slice_indexing with benchmarking on now takes ~30 seconds vs ~7 seconds on main.

Parameterize with shards and no shards. Parameterize with local + memory store to have an example of a store which has some modest latency.

codspeed-hq · 2026-01-20T04:32:59Z

Merging this PR will not alter performance

✅ 24 untouched benchmarks
🆕 24 new benchmarks
⏩ 6 skipped benchmarks¹

Performance Changes

	Mode	Benchmark	`BASE`	`HEAD`	Efficiency
🆕	WallTime	`test_slice_indexing[None-(slice(0, None, 4), slice(0, None, 4), slice(0, None, 4))-memory]`	N/A	500.6 ms	N/A
🆕	WallTime	`test_slice_indexing[None-(slice(None, None, None), slice(None, None, None), slice(None, None, None))-memory]`	N/A	502.7 ms	N/A
🆕	WallTime	`test_slice_indexing[None-(slice(None, None, None), slice(None, None, None), slice(None, None, None))-local]`	N/A	937.7 ms	N/A
🆕	WallTime	`test_slice_indexing[(50, 50, 50)-(slice(0, None, 4), slice(0, None, 4), slice(0, None, 4))-memory]`	N/A	541.9 ms	N/A
🆕	WallTime	`test_slice_indexing[(50, 50, 50)-(slice(10, -10, 4), slice(10, -10, 4), slice(10, -10, 4))-memory]`	N/A	286.5 ms	N/A
🆕	WallTime	`test_slice_indexing[None-(slice(0, None, 4), slice(0, None, 4), slice(0, None, 4))-local]`	N/A	930.3 ms	N/A
🆕	WallTime	`test_slice_indexing[None-(slice(None, 10, None), slice(None, 10, None), slice(None, 10, None))-memory]`	N/A	938.9 µs	N/A
🆕	WallTime	`test_slice_indexing[(50, 50, 50)-(slice(None, None, None), slice(None, None, None), slice(None, None, None))-memory]`	N/A	546.7 ms	N/A
🆕	WallTime	`test_slice_indexing[(50, 50, 50)-(slice(0, None, 4), slice(0, None, 4), slice(0, None, 4))-local]`	N/A	658.9 ms	N/A
🆕	WallTime	`test_slice_indexing[(50, 50, 50)-(slice(None, None, None), slice(None, None, None), slice(None, None, None))-local]`	N/A	671.5 ms	N/A
🆕	WallTime	`test_slice_indexing[(50, 50, 50)-(slice(None, None, None), slice(0, 3, 2), slice(0, 10, None))-memory]`	N/A	7.6 ms	N/A
🆕	WallTime	`test_slice_indexing[(50, 50, 50)-(slice(None, 10, None), slice(None, 10, None), slice(None, 10, None))-local]`	N/A	2.7 ms	N/A
🆕	WallTime	`test_slice_indexing[None-(slice(10, -10, 4), slice(10, -10, 4), slice(10, -10, 4))-memory]`	N/A	274.2 ms	N/A
🆕	WallTime	`test_slice_indexing[None-(slice(None, 10, None), slice(None, 10, None), slice(None, 10, None))-local]`	N/A	1.2 ms	N/A
🆕	WallTime	`test_slice_indexing[(50, 50, 50)-(slice(10, -10, 4), slice(10, -10, 4), slice(10, -10, 4))-local]`	N/A	486 ms	N/A
🆕	WallTime	`test_slice_indexing[(50, 50, 50)-(0, 0, 0)-local]`	N/A	2.6 ms	N/A
🆕	WallTime	`test_slice_indexing[None-(slice(10, -10, 4), slice(10, -10, 4), slice(10, -10, 4))-local]`	N/A	510.9 ms	N/A
🆕	WallTime	`test_slice_indexing[None-(slice(None, None, None), slice(0, 3, 2), slice(0, 10, None))-memory]`	N/A	4.7 ms	N/A
🆕	WallTime	`test_slice_indexing[(50, 50, 50)-(0, 0, 0)-memory]`	N/A	1.9 ms	N/A
🆕	WallTime	`test_slice_indexing[(50, 50, 50)-(slice(None, None, None), slice(0, 3, 2), slice(0, 10, None))-local]`	N/A	12.7 ms	N/A
...	...	...	...	...	...

ℹ️ Only the first 20 benchmarks are displayed. Go to the app to view all benchmarks.

_{Comparing aldenks:shard-slice-indexing-benchmark (0542993) with main (68359bc)}

6 benchmarks were skipped, so the baseline results were used instead. If they were deleted from the codebase, click here and archive them to remove them from the performance reports. ↩

d-v-b · 2026-01-20T08:45:54Z

if we want more control over the latency, we could wrap memory storage with a LatencyStore here. This would keep the benchmark a bit more isolated from stuff the OS is doing than localstore. Happy to implement this if you don't mind me pushing to this branch

aldenks · 2026-01-20T12:51:30Z

@d-v-b go ahead! If you make it I can't be accused of juicing my numbers by picking high latencies ;p

In all seriousness, that would be quite useful for picking a good default coalesce_max_gap_bytes.

Add benchmarks for sharded + local store indexing

c15bd60

Parameterize with shards and no shards. Parameterize with local + memory store to have an example of a store which has some modest latency.

github-actions bot added the needs release notes Automatically applied to PRs which haven't added release notes label Jan 20, 2026

aldenks mentioned this pull request Jan 20, 2026

Coalesce and parallelize partial shard reads #3004

Open

6 tasks

Merge branch 'main' into shard-slice-indexing-benchmark

0542993

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add benchmarks for sharded + local store indexing #3663

Add benchmarks for sharded + local store indexing #3663

aldenks commented Jan 20, 2026

Uh oh!

codspeed-hq bot commented Jan 20, 2026 •

edited

Loading

Uh oh!

d-v-b commented Jan 20, 2026

Uh oh!

aldenks commented Jan 20, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Add benchmarks for sharded + local store indexing #3663

Are you sure you want to change the base?

Add benchmarks for sharded + local store indexing #3663

Conversation

aldenks commented Jan 20, 2026

Uh oh!

codspeed-hq bot commented Jan 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merging this PR will not alter performance

Performance Changes

Footnotes

Uh oh!

d-v-b commented Jan 20, 2026

Uh oh!

aldenks commented Jan 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codspeed-hq bot commented Jan 20, 2026 •

edited

Loading

aldenks commented Jan 20, 2026 •

edited

Loading