LaggingReplicasReassignmentGoal
Latest
·
1 commit
to reassign-lagging-replicas
since this release
feat: reassign stuck partition replicas
Sometimes replicas stop fetching/updating state to ZK without any reason- so far we have seen it happen on rolling restarts but as the source of this bug is as yet unknown, there could be other scenarios as well
This is problematic as the replica will be out of ISR until we manually reassign the partition to a broker(even same one)
Adding a new LaggingReplicaReassignmentGoal that will track such replicas and reassign them to same brokers once the MAX_LAGGING_REPLICA_REASSIGN_MS(default 30 mins) is reached
See also: https://lists.apache.org/thread.html/rbfe9557a4dd8604cffce369e76cc74f90ff8f717f934e6e8b5141053%40%3Cusers.kafka.apache.org%3E