PartitionReassignmentTimeout
mavemuri
released this
06 Apr 17:58
·
182 commits
to anuvu-apic-nd
since this release
fix/cruisecontrol: add partition movement timeout to executor There is an edge case wherein after the partition reassignment was submitted to kafka and before it finished, there was a partition leadership re-lection- this causes the reassignment to stall until there is another re-election. However, we do see cases where there is no re-election triggered leading to a partition reaissgnment being in IN_PROGRESS indefinitely and potentially missing new anomalies due to executor state being in INTER_BROKER_REPLICA_ACTION By adding a max timeout, we avoid this state by cancelling such reassignemnts and retrying them later includes minor cleanup