-
Notifications
You must be signed in to change notification settings - Fork 3.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
3.5: Slow Archived Workflows query in Controller during estimation #13382
Comments
Root cause sounds like a duplicate of #13295 I'm surprised that it impacts the Controller though; it looks like the estimator is listing out archived workflows? |
The executed query is different, but it looks like the same cause.
I think so. Here is the source code. argo-workflows/workflow/controller/estimation/estimator_factory.go Lines 79 to 85 in 7357a1b
|
Huh, I didn't know there was a fallback to archived workflows there. TIL. How often does this slow query happen? Since it's a |
From looking at Query Insights, it seems to occur as frequently as the number of workflows executed.
The workflow is not big, just a 2-step workflow. |
@linzhengen try #13131 |
Does it have a big loop or anything? As in, does it produce a large graph? If not that's bizarre that a 2-step workflow is causing that... 🤔
From #13295, it looks like the JSON extraction is not re-used and happens once for each of the fields. So that and the JSON extraction being slow in general seems to cause the slowdown. The list query changed in 3.5 for the combined Workflow List UI+API. The estimator should probably not be using the same exact query, I don't think all those fields are necessary for it. |
Adding an option to skip workflow duration estimation might be one way to work around the issue. |
…13295 Signed-off-by: linzhengen <[email protected]>
Signed-off-by: linzhengen <[email protected]>
Signed-off-by: linzhengen <[email protected]>
Signed-off-by: linzhengen <[email protected]>
…r during estimation. Fixes argoproj#13382 Signed-off-by: linzhengen <[email protected]>
…r during estimation. Fixes argoproj#13382 Signed-off-by: linzhengen <[email protected]>
…r during estimation. Fixes argoproj#13382 Signed-off-by: linzhengen <[email protected]>
…r during estimation. Fixes argoproj#13382 Signed-off-by: linzhengen <[email protected]>
…r estimation. Fixes #13382 (#13394) Signed-off-by: linzhengen <[email protected]>
I think that's a good feature to have as well, but that PR did not pass CI. If you want to get CI passing in a new PR of that, I can review it too |
…r estimation. Fixes #13382 (#13394) Signed-off-by: linzhengen <[email protected]> (cherry picked from commit 7cc20bb)
Pre-requisites
:latest
image tag (i.e.quay.io/argoproj/workflow-controller:latest
) and can confirm the issue still exists on:latest
. If not, I have explained why, in detail, in my description below.What happened? What did you expect to happen?
I have confirmed that the following logs frequently occur from the workflow-controller. After downgrading to version v3.4.17, the slow query issue did not occur.
I have not confirmed version 3.5.8 yet, but based on the differences, it seems likely that the same issue will occur.
Version(s)
v3.5.7
Paste a minimal workflow that reproduces the issue. We must be able to run the workflow; don't enter a workflows that uses private images.
Referring to the above log
Logs from the workflow controller
Logs from in your workflow's wait container
The text was updated successfully, but these errors were encountered: