Skip to content

LaggingReplicasReassignmentGoal

Latest
Compare
Choose a tag to compare
@mavemuri mavemuri released this 13 Nov 22:43
· 1 commit to reassign-lagging-replicas since this release
feat: reassign stuck partition replicas

Sometimes replicas stop fetching/updating state to ZK without any reason- so far we have seen it happen on rolling restarts but as the source of this bug is as yet unknown, there could be other scenarios as well

This is problematic as the replica will be out of ISR until we manually reassign the partition to a broker(even same one)

Adding a new LaggingReplicaReassignmentGoal that will track such replicas and reassign them to same brokers once the MAX_LAGGING_REPLICA_REASSIGN_MS(default 30 mins) is reached

See also: https://lists.apache.org/thread.html/rbfe9557a4dd8604cffce369e76cc74f90ff8f717f934e6e8b5141053%40%3Cusers.kafka.apache.org%3E