You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
we have ADO pipelines those are configured to use self-hosted agents and those running on AKS as a pod, we have used KEDA to auto scale based on no.of jobs from ADO and each job it triggers new pod or agent on AKS.
it works perfectly for small jobs where those jobs completed with in 10-15 or 30 mins, if job taking too long like more than 40 mins or 102 hrs, those jobs are failing with stopped hearing from agent without any reason.
we don't have time set limits to kill the pod, we have bigger AKS cluster with multiple nodes, still pods getting killed after 40 mins sometime 1-2 hours and so on.
Expected Behavior
it supposed to kill the agent or pod only when there is no activity on agent for more then 15 mins.
Actual Behavior
agent supposed run job fully without killing pod.
Steps to Reproduce the Problem
Setup ADO pipeline that will run job on AKS with KEDA auto scaling
KEDA will be using ADO PAT token to connect
KEDA pod should be killed if no activity for more than 15 mins
Logs from KEDA operator
##[error]We stopped hearing from agent deployment-687dd7d466-vtmx4. Verify the agent machine is running and has a healthy network connection. Anything that terminates an agent process, starves it for CPU, or blocks its network access can cause this error. For more information, see: https://go.microsoft.com/fwlink/?linkid=846610
KEDA Version
2.9.3
Kubernetes Version
1.25
Platform
Microsoft Azure
Scaler Details
Azure Pipelines
Anything else?
No response
The text was updated successfully, but these errors were encountered:
This issue has been automatically marked as stale because it has not had recent activity. It will be closed in 7 days if no further activity occurs. Thank you for your contributions.
stalebot
added
the
stale
All issues that are marked as stale due to inactivity
label
Dec 4, 2023
Report
we have ADO pipelines those are configured to use self-hosted agents and those running on AKS as a pod, we have used KEDA to auto scale based on no.of jobs from ADO and each job it triggers new pod or agent on AKS.
it works perfectly for small jobs where those jobs completed with in 10-15 or 30 mins, if job taking too long like more than 40 mins or 102 hrs, those jobs are failing with stopped hearing from agent without any reason.
we don't have time set limits to kill the pod, we have bigger AKS cluster with multiple nodes, still pods getting killed after 40 mins sometime 1-2 hours and so on.
Expected Behavior
it supposed to kill the agent or pod only when there is no activity on agent for more then 15 mins.
Actual Behavior
agent supposed run job fully without killing pod.
Steps to Reproduce the Problem
Logs from KEDA operator
KEDA Version
2.9.3
Kubernetes Version
1.25
Platform
Microsoft Azure
Scaler Details
Azure Pipelines
Anything else?
No response
The text was updated successfully, but these errors were encountered: