Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

tests/datalake: deflake partition_movement_test #24661

Open
wants to merge 3 commits into
base: dev
Choose a base branch
from

Conversation

bharathv
Copy link
Contributor

@bharathv bharathv commented Dec 28, 2024

  • Fixes a subtle deadlock where positions are not updated before the memory fills up and query thread cannot make progress without correct fetch positions, so nothing drains the memory semaphore.

  • Optimizes verification process to batch enough messages and thus reducing # of queries and also lowering the frequency of updating fetch positions.

Fixes https://redpandadata.atlassian.net/browse/CORE-8581

Backports Required

  • none - not a bug fix
  • none - this is a backport
  • none - issue does not exist in previous branches
  • none - papercut/not impactful enough to backport
  • v24.3.x
  • v24.2.x
  • v24.1.x

Release Notes

  • none

@bharathv
Copy link
Contributor Author

/ci-repeat 1
dt-repeat 50
tests/rptest/tests/datalake/partition_movement_test.py::PartitionMovementTest\

This fixes a subtle deadlock where positions are not updated before the
memory fills up and query thread cannot make progress without correct
fetch positions.

Removed timer based logic that periodically updates positions, it will
be further refined in the next commits.
Two main optimizations here

- Batch enough messages so each SQL query spans bigger range of offsets
  (reduces # of queries)
- Avoids updating fetch positions too often, since the loop sweeps
  through every partition, debouncing it until enough messages are
  batched.
@bharathv
Copy link
Contributor Author

/ci-repeat 1
skip-units
dt-repeat 50
tests/rptest/tests/datalake/partition_movement_test.py::PartitionMovementTest

@bharathv
Copy link
Contributor Author

/ci-repeat 1
skip-units
skip-redpanda-build
dt-repeat=50
tests/rptest/tests/datalake/partition_movement_test.py::PartitionMovementTest

@bharathv bharathv marked this pull request as ready for review January 1, 2025 00:12
@bharathv bharathv requested review from mmaslankaprv and ztlpn January 1, 2025 16:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant