refactor the global analytics bus to use a generic async batching util #13279

thrau · 2025-10-17T12:54:56Z

Motivation

While in the process of fixing an issue with the way aws request events are batched from the pro version, I cleaned up some of the implementation by pulling out a generic batching utility, and re-use that across both the GlobalAnalyticsBus as well as the ProServiceRequestAggregator. I also noticed that the CLI tests wold break when importing batcher because of the pydantic requirement that I found was not really necessary if we just used a regular constructor for Batcher. Basically just cleaning up the implementation and de-duplicating some code, nothing major.

Changes

no functional behavior, just refactoring
generalized the PublisherBuffer concept from the analytics client into an AsyncBatcher utility
renamed batch_policy.py to batching.py to be more consistent with our utils naming
simplified construction of Batcher and removed the pydantic dependency

github-actions · 2025-10-17T12:57:42Z

Test Results - Preflight, Unit

22 363 tests ±0 20 613 ✅ ±0 16m 15s ⏱️ +29s
1 suites ±0 1 750 💤 ±0
1 files ±0 0 ❌ ±0

Results for commit b89dd37. ± Comparison against base commit b4c510a.

This pull request removes 14 and adds 14 tests. Note that renamed tests count towards both.

tests.unit.utils.analytics.test_publisher.TestPublisherBuffer ‑ test_basic
tests.unit.utils.analytics.test_publisher.TestPublisherBuffer ‑ test_interval
tests.unit.utils.test_batch_policy.TestBatcher ‑ test_add_multple_items
tests.unit.utils.test_batch_policy.TestBatcher ‑ test_add_single_item
tests.unit.utils.test_batch_policy.TestBatcher ‑ test_deep_copy
tests.unit.utils.test_batch_policy.TestBatcher ‑ test_flush
tests.unit.utils.test_batch_policy.TestBatcher ‑ test_max_count_limit
tests.unit.utils.test_batch_policy.TestBatcher ‑ test_max_count_partial_flush
tests.unit.utils.test_batch_policy.TestBatcher ‑ test_max_window_limit
tests.unit.utils.test_batch_policy.TestBatcher ‑ test_multiple_policies
…

tests.unit.utils.test_batching.TestAsyncBatcher ‑ test_basic
tests.unit.utils.test_batching.TestAsyncBatcher ‑ test_interval
tests.unit.utils.test_batching.TestBatcher ‑ test_add_multiple_items
tests.unit.utils.test_batching.TestBatcher ‑ test_add_single_item
tests.unit.utils.test_batching.TestBatcher ‑ test_deep_copy
tests.unit.utils.test_batching.TestBatcher ‑ test_flush
tests.unit.utils.test_batching.TestBatcher ‑ test_max_count_limit
tests.unit.utils.test_batching.TestBatcher ‑ test_max_count_partial_flush
tests.unit.utils.test_batching.TestBatcher ‑ test_max_window_limit
tests.unit.utils.test_batching.TestBatcher ‑ test_multiple_policies
…

♻️ This comment has been updated with latest results.

github-actions · 2025-10-17T13:36:43Z

Test Results (amd64) - Acceptance

7 tests ±0 5 ✅ ±0 3m 19s ⏱️ -3s
1 suites ±0 2 💤 ±0
1 files ±0 0 ❌ ±0

Results for commit b89dd37. ± Comparison against base commit b4c510a.

♻️ This comment has been updated with latest results.

github-actions · 2025-10-17T14:13:03Z

Test Results (amd64) - Integration, Bootstrap

5 files 5 suites 2h 39m 23s ⏱️
5 252 tests 4 737 ✅ 515 💤 0 ❌
5 258 runs 4 737 ✅ 521 💤 0 ❌

Results for commit b89dd37.

♻️ This comment has been updated with latest results.

github-actions · 2025-10-17T14:23:42Z

LocalStack Community integration with Pro

2 files ±0 2 suites ±0 2h 5m 33s ⏱️ + 3m 55s
4 878 tests ±0 4 523 ✅ ±0 355 💤 ±0 0 ❌ ±0
4 880 runs ±0 4 523 ✅ ±0 357 💤 ±0 0 ❌ ±0

Results for commit b89dd37. ± Comparison against base commit b4c510a.

♻️ This comment has been updated with latest results.

…lity

gregfurman

Thanks for the addition and refactor here!

Just some questions regarding usage of the utility, suggestions for how we could integrate the current Batcher into the AsyncBatcher, and some additional context around the initial implementation.

gregfurman · 2025-10-21T10:12:18Z

localstack-core/localstack/utils/collections.py

    return list(filter(lambda _: condition, items))
+
+
+def iter_chunks(items: list[_E], chunk_size: int) -> Generator[list[_E], None, None]:


Does this not have identical behaviour to itertools.batched?

(sidenote: now that we're on Python 3.13 we can finally use this utility which makes me so happy 🤓)

cool, i did not know this existed. i added this before 3.13 :-) i think it's worth keeping until the only thing we support is 3.13+ or until we use this code exclusively for the runtime

tests/unit/utils/test_batching.py