-
Notifications
You must be signed in to change notification settings - Fork 3.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
v3.4.9 podGC: onPodCompletion
not working properly
#11588
Comments
Can you check your K8s apiserver logs? |
Yeah they are available to me, but not sure what to check exactly? Just to verify nobody asked to delete the pod or (even tho Argo controller should have)? |
Can you turn on debug level log for workflow controller? |
Ah right, good idea 👍. I'll turn debug level for logs and see if I can find anything suspicious in there. |
$ k logs workflow-controller-ddfdc8d8b-zmxd7 | grep -i "Delete pods 404" -C3 | grep -i "cleaning up pod"
time="2023-08-17T10:32:42.994Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-staging/ca-keystone-feature-parity-testing-jlj95-1340600742-agent/deletePod
time="2023-08-17T10:32:47.032Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-staging/ca-keystone-feature-parity-testing-g98sx-1340600742-agent/deletePod
time="2023-08-17T10:32:58.886Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-staging/ca-keystone-feature-parity-testing-575zv-1340600742-agent/deletePod
time="2023-08-17T10:33:10.937Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-staging/ca-keystone-feature-parity-testing-txwjh-1340600742-agent/deletePod
time="2023-08-17T10:33:15.209Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-staging/ca-keystone-feature-parity-testing-dnqzb-1340600742-agent/deletePod
time="2023-08-17T10:33:25.018Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-staging/ca-keystone-feature-parity-testing-9k5h9-1340600742-agent/deletePod
time="2023-08-17T10:33:38.055Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-staging/ca-keystone-feature-parity-testing-j8zvl-1340600742-agent/deletePod
time="2023-08-17T10:35:12.182Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-staging/ca-keystone-feature-parity-testing-b4zx9-1340600742-agent/deletePod
time="2023-08-17T10:47:37.024Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-staging/processing-sar-auto-qa-stage-gforms-czxgk-1340600742-agent/deletePod
time="2023-08-17T10:49:56.984Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-staging/ca-keystone-feature-parity-testing-9j92x-1340600742-agent/deletePod
time="2023-08-17T10:49:57.969Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-staging/ca-keystone-feature-parity-testing-8gjw4-1340600742-agent/deletePod
time="2023-08-17T10:51:22.181Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-staging/ca-keystone-feature-parity-testing-5rm2x-1340600742-agent/deletePod
time="2023-08-17T10:51:24.192Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-staging/ca-keystone-feature-parity-testing-hwczx-1340600742-agent/deletePod
time="2023-08-17T10:51:27.344Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-staging/ca-keystone-feature-parity-testing-4wkcc-1340600742-agent/deletePod
time="2023-08-17T10:51:31.223Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-staging/ca-keystone-feature-parity-testing-wzb9f-1340600742-agent/deletePod
time="2023-08-17T10:51:32.294Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-staging/ca-keystone-feature-parity-testing-nvn4f-1340600742-agent/deletePod
time="2023-08-17T10:51:32.462Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-staging/ca-keystone-feature-parity-testing-5gtt4-1340600742-agent/deletePod
time="2023-08-17T10:51:36.453Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-staging/ca-keystone-feature-parity-testing-xw8x6-1340600742-agent/deletePod
time="2023-08-17T11:13:28.895Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-staging/ca-keystone-feature-parity-testing-s66l5-1340600742-agent/deletePod What is this But seems that this is some Argo internal naming? When I googled this thing there are few argo-workflows issues with that name and also the repo has it in some test: https://github.com/argoproj/argo-workflows/blob/v3.4.8/workflow/controller/operator_agent_test.go#L74 edit: This might not be an issue, at least there are few similar lines I see in our production environment (which is running |
Those are agent pods responsible for executing HTTP templates. |
Regarding the "Queueing Succeeded workflow" log line: does that have anything to do with this? I would think that might pertain if you were using |
These are changes to one of the workflows which didn't properly clean up all it's pods [Edit: I think these patches are not relevant to the issue, since it seems there are 409s later also when only the controller itself is doing updates.] We are using some patches in our workflow and I think those are for the rows 5-8 in this picture? There is a 409 immediately after it, but there are also some 409s later (rows 16, 18, 20) when only the argo-controller is making updates to the workflow. We are running 2 controller pods, but the leader election should make sure only one of them is making changes, so I'm not sure why it's getting these 409's (or where to find more information): - name: label-man
inputs:
parameters:
- name: label-name
- name: label-value
resource:
action: patch
mergeStrategy: json
flags:
- workflow
- "{{workflow.name}}"
manifest: |
- op: add
path: "/metadata/labels/{{inputs.parameters.label-name}}"
value: "{{inputs.parameters.label-value}}" |
Looking at the workflow-controller logs related this workflow, it seems that the time="2023-08-15T09:40:52.152Z" level=info msg="Processing workflow" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:40:53.686Z" level=info msg="Updated phase -> Running" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:40:53.686Z" level=info msg="Creating pvc sqs-sar-processor-production-8pwns-workvol" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:40:53.695Z" level=info msg="Creating pvc sqs-sar-processor-production-8pwns-workvol-qa" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:40:53.711Z" level=info msg="DAG node sqs-sar-processor-production-8pwns initialized Running" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:40:53.711Z" level=info msg="All of node sqs-sar-processor-production-8pwns.save-dcr-metadata dependencies [] completed" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:40:53.712Z" level=info msg="Pod node sqs-sar-processor-production-8pwns-4116272344 initialized Pending" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:40:53.745Z" level=info msg="Created pod: sqs-sar-processor-production-8pwns.save-dcr-metadata (sqs-sar-processor-production-8pwns-read-s3-metadata-4116272344)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:40:53.746Z" level=info msg="TaskSet Reconciliation" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:40:53.746Z" level=info msg=reconcileAgentPod namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:40:53.789Z" level=info msg="Workflow update successful" namespace=argo-managed-processor-production phase=Running resourceVersion=1228756717 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:03.752Z" level=info msg="Processing workflow" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:03.752Z" level=info msg="Task-result reconciliation" namespace=argo-managed-processor-production numObjs=0 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:03.752Z" level=warning msg="workflow uses legacy/insecure pod patch, see https://argoproj.github.io/argo-workflows/workflow-rbac/" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:03.754Z" level=info msg="node changed" namespace=argo-managed-processor-production new.message= new.phase=Succeeded new.progress=0/1 nodeID=sqs-sar-processor-production-8pwns-4116272344 old.message= old.phase=Pending old.progress=0/1 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:03.767Z" level=info msg="All of node sqs-sar-processor-production-8pwns.generate-parameters dependencies [save-dcr-metadata] completed" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:03.768Z" level=info msg="Pod node sqs-sar-processor-production-8pwns-1406809507 initialized Pending" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:03.810Z" level=info msg="Created pod: sqs-sar-processor-production-8pwns.generate-parameters (sqs-sar-processor-production-8pwns-generate-parameters-1406809507)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:03.818Z" level=info msg="TaskSet Reconciliation" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:03.818Z" level=info msg=reconcileAgentPod namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:03.869Z" level=info msg="Workflow update successful" namespace=argo-managed-processor-production phase=Running resourceVersion=1228757002 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:08.875Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-production/sqs-sar-processor-production-8pwns-read-s3-metadata-4116272344/deletePod
time="2023-08-15T09:41:13.822Z" level=info msg="Processing workflow" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:13.823Z" level=info msg="Task-result reconciliation" namespace=argo-managed-processor-production numObjs=0 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:13.823Z" level=warning msg="workflow uses legacy/insecure pod patch, see https://argoproj.github.io/argo-workflows/workflow-rbac/" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:13.823Z" level=info msg="node changed" namespace=argo-managed-processor-production new.message= new.phase=Succeeded new.progress=0/1 nodeID=sqs-sar-processor-production-8pwns-1406809507 old.message= old.phase=Pending old.progress=0/1 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:13.836Z" level=info msg="All of node sqs-sar-processor-production-8pwns.download-dcr dependencies [generate-parameters] completed" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:13.837Z" level=info msg="Pod node sqs-sar-processor-production-8pwns-2695146199 initialized Pending" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:13.885Z" level=info msg="Created pod: sqs-sar-processor-production-8pwns.download-dcr (sqs-sar-processor-production-8pwns-download-dcr-cli-2695146199)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:13.885Z" level=info msg="All of node sqs-sar-processor-production-8pwns.do-dcr-config-download dependencies [generate-parameters] completed" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:13.885Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-3154072718 initialized Skipped (message: when ''false' == 'false' && 'false' == 'true'' evaluated false)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:13.885Z" level=info msg="All of node sqs-sar-processor-production-8pwns.save-command-api-config dependencies [generate-parameters] completed" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:13.886Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-2055644340 initialized Skipped (message: when ''false' == 'true'' evaluated false)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:13.886Z" level=info msg="All of node sqs-sar-processor-production-8pwns.add-id-label dependencies [generate-parameters] completed" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:13.886Z" level=info msg="Pod node sqs-sar-processor-production-8pwns-3607818475 initialized Pending" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:13.918Z" level=info msg="Created pod: sqs-sar-processor-production-8pwns.add-id-label (sqs-sar-processor-production-8pwns-label-man-3607818475)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:13.918Z" level=info msg="All of node sqs-sar-processor-production-8pwns.add-run-label dependencies [generate-parameters] completed" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:13.918Z" level=info msg="Pod node sqs-sar-processor-production-8pwns-1096920377 initialized Pending" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:13.948Z" level=info msg="Created pod: sqs-sar-processor-production-8pwns.add-run-label (sqs-sar-processor-production-8pwns-label-man-1096920377)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:13.950Z" level=info msg="All of node sqs-sar-processor-production-8pwns.add-trace-label dependencies [generate-parameters] completed" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:13.951Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-2636514637 initialized Skipped (message: when ''false' == 'true'' evaluated false)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:13.951Z" level=info msg="All of node sqs-sar-processor-production-8pwns.add-tracing-enabled-label dependencies [generate-parameters] completed" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:13.952Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-1833346002 initialized Skipped (message: when ''false' == 'true'' evaluated false)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:13.952Z" level=info msg="TaskSet Reconciliation" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:13.952Z" level=info msg=reconcileAgentPod namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:14.000Z" level=info msg="Workflow update successful" namespace=argo-managed-processor-production phase=Running resourceVersion=1228757277 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:19.007Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-production/sqs-sar-processor-production-8pwns-generate-parameters-1406809507/deletePod
time="2023-08-15T09:41:23.889Z" level=info msg="Processing workflow" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:23.889Z" level=info msg="Task-result reconciliation" namespace=argo-managed-processor-production numObjs=0 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:23.889Z" level=info msg="node changed" namespace=argo-managed-processor-production new.message=PodInitializing new.phase=Pending new.progress=0/1 nodeID=sqs-sar-processor-production-8pwns-2695146199 old.message= old.phase=Pending old.progress=0/1 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:23.890Z" level=info msg="node changed" namespace=argo-managed-processor-production new.message= new.phase=Running new.progress=0/1 nodeID=sqs-sar-processor-production-8pwns-3607818475 old.message= old.phase=Pending old.progress=0/1 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:23.890Z" level=info msg="node changed" namespace=argo-managed-processor-production new.message= new.phase=Running new.progress=0/1 nodeID=sqs-sar-processor-production-8pwns-1096920377 old.message= old.phase=Pending old.progress=0/1 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:23.907Z" level=info msg="TaskSet Reconciliation" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:23.907Z" level=info msg=reconcileAgentPod namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:23.949Z" level=info msg="Workflow update successful" namespace=argo-managed-processor-production phase=Running resourceVersion=1228757555 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:39.709Z" level=info msg="Processing workflow" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:39.710Z" level=info msg="Task-result reconciliation" namespace=argo-managed-processor-production numObjs=0 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:39.710Z" level=warning msg="workflow uses legacy/insecure pod patch, see https://argoproj.github.io/argo-workflows/workflow-rbac/" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:39.710Z" level=info msg="node changed" namespace=argo-managed-processor-production new.message= new.phase=Succeeded new.progress=0/1 nodeID=sqs-sar-processor-production-8pwns-2695146199 old.message=PodInitializing old.phase=Pending old.progress=0/1 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:39.710Z" level=info msg="node unchanged" namespace=argo-managed-processor-production nodeID=sqs-sar-processor-production-8pwns-3607818475 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:39.710Z" level=info msg="node unchanged" namespace=argo-managed-processor-production nodeID=sqs-sar-processor-production-8pwns-1096920377 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:39.722Z" level=info msg="All of node sqs-sar-processor-production-8pwns.pulsepower dependencies [download-dcr] completed" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:39.723Z" level=info msg="Pod node sqs-sar-processor-production-8pwns-2066690381 initialized Pending" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:39.776Z" level=info msg="Created pod: sqs-sar-processor-production-8pwns.pulsepower (sqs-sar-processor-production-8pwns-pulsepower-2066690381)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:39.788Z" level=info msg="All of node sqs-sar-processor-production-8pwns.preprocess-qa dependencies [download-dcr] completed" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:39.788Z" level=info msg="Pod node sqs-sar-processor-production-8pwns-2513189394 initialized Pending" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:39.825Z" level=info msg="Created pod: sqs-sar-processor-production-8pwns.preprocess-qa (sqs-sar-processor-production-8pwns-preprocess-qa-2513189394)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:39.826Z" level=info msg="TaskSet Reconciliation" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:39.826Z" level=info msg=reconcileAgentPod namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:39.860Z" level=warning msg="Error updating workflow: Operation cannot be fulfilled on workflows.argoproj.io \"sqs-sar-processor-production-8pwns\": the object has been modified; please apply your changes to the latest version and try again Conflict" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:39.860Z" level=info msg="Re-applying updates on latest version and retrying update" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:39.955Z" level=info msg="Update retry attempt 1 successful" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:39.955Z" level=info msg="Workflow update successful" namespace=argo-managed-processor-production phase=Running resourceVersion=1228757963 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:44.961Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-production/sqs-sar-processor-production-8pwns-download-dcr-cli-2695146199/deletePod
time="2023-08-15T09:41:49.778Z" level=info msg="Processing workflow" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:49.779Z" level=info msg="Task-result reconciliation" namespace=argo-managed-processor-production numObjs=0 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:49.779Z" level=info msg="node changed" namespace=argo-managed-processor-production new.message=PodInitializing new.phase=Pending new.progress=0/1 nodeID=sqs-sar-processor-production-8pwns-2066690381 old.message= old.phase=Pending old.progress=0/1 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:49.779Z" level=info msg="node changed" namespace=argo-managed-processor-production new.message= new.phase=Succeeded new.progress=0/1 nodeID=sqs-sar-processor-production-8pwns-1096920377 old.message= old.phase=Running old.progress=0/1 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:49.779Z" level=info msg="node changed" namespace=argo-managed-processor-production new.message=PodInitializing new.phase=Pending new.progress=0/1 nodeID=sqs-sar-processor-production-8pwns-2513189394 old.message= old.phase=Pending old.progress=0/1 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:49.779Z" level=info msg="node changed" namespace=argo-managed-processor-production new.message= new.phase=Succeeded new.progress=0/1 nodeID=sqs-sar-processor-production-8pwns-3607818475 old.message= old.phase=Running old.progress=0/1 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:49.792Z" level=info msg="TaskSet Reconciliation" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:49.792Z" level=info msg=reconcileAgentPod namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:49.835Z" level=info msg="Workflow update successful" namespace=argo-managed-processor-production phase=Running resourceVersion=1228758221 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:41:54.839Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-production/sqs-sar-processor-production-8pwns-label-man-3607818475/deletePod
time="2023-08-15T09:41:54.839Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-production/sqs-sar-processor-production-8pwns-label-man-1096920377/deletePod
time="2023-08-15T09:42:00.100Z" level=info msg="Processing workflow" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.100Z" level=info msg="Task-result reconciliation" namespace=argo-managed-processor-production numObjs=0 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.100Z" level=warning msg="workflow uses legacy/insecure pod patch, see https://argoproj.github.io/argo-workflows/workflow-rbac/" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.101Z" level=info msg="node changed" namespace=argo-managed-processor-production new.message= new.phase=Succeeded new.progress=0/1 nodeID=sqs-sar-processor-production-8pwns-2513189394 old.message=PodInitializing old.phase=Pending old.progress=0/1 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.101Z" level=info msg="Pod failed: Error (exit code 1)" displayName=pulsepower namespace=argo-managed-processor-production pod=sqs-sar-processor-production-8pwns-pulsepower-2066690381 templateName=pulsepower workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.101Z" level=warning msg="workflow uses legacy/insecure pod patch, see https://argoproj.github.io/argo-workflows/workflow-rbac/" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.101Z" level=info msg="node changed" namespace=argo-managed-processor-production new.message="Error (exit code 1)" new.phase=Failed new.progress=0/1 nodeID=sqs-sar-processor-production-8pwns-2066690381 old.message=PodInitializing old.phase=Pending old.progress=0/1 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.114Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-3649619691 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.115Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-3051136182 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.115Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-1555690789 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.115Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-1590775209 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.116Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-1977079561 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.117Z" level=info msg="All of node sqs-sar-processor-production-8pwns.slack-notify-abort-preprocess-check-failure dependencies [pulsepower] completed" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.118Z" level=info msg="Retry node sqs-sar-processor-production-8pwns-515552909 initialized Running" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.118Z" level=info msg="Pod node sqs-sar-processor-production-8pwns-2469401052 initialized Pending" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.171Z" level=info msg="Created pod: sqs-sar-processor-production-8pwns.slack-notify-abort-preprocess-check-failure(0) (sqs-sar-processor-production-8pwns-slack-send-2469401052)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.172Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-3480997215 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.172Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-2003323528 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.172Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-690644360 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.172Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-1277658783 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.172Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-3569950184 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.172Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-1038190901 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.173Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-2793774353 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.173Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-566304367 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.173Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-2813658222 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.173Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-975619886 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.173Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-1335356588 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.173Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-31247671 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.173Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-1293817112 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.174Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-395271223 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.174Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-1309994339 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.174Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-724898383 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.174Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-1567701854 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.174Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-1961045326 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.174Z" level=info msg="Skipped node sqs-sar-processor-production-8pwns-2334882381 initialized Omitted (message: omitted: depends condition not met)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.174Z" level=info msg="TaskSet Reconciliation" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.174Z" level=info msg=reconcileAgentPod namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:00.227Z" level=info msg="Workflow update successful" namespace=argo-managed-processor-production phase=Running resourceVersion=1228758483 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:05.235Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-production/sqs-sar-processor-production-8pwns-pulsepower-2066690381/deletePod
time="2023-08-15T09:42:05.236Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-production/sqs-sar-processor-production-8pwns-preprocess-qa-2513189394/deletePod
time="2023-08-15T09:42:10.185Z" level=info msg="Processing workflow" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:10.185Z" level=info msg="Task-result reconciliation" namespace=argo-managed-processor-production numObjs=0 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:10.186Z" level=warning msg="workflow uses legacy/insecure pod patch, see https://argoproj.github.io/argo-workflows/workflow-rbac/" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:10.186Z" level=info msg="node changed" namespace=argo-managed-processor-production new.message= new.phase=Succeeded new.progress=0/1 nodeID=sqs-sar-processor-production-8pwns-2469401052 old.message= old.phase=Pending old.progress=0/1 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:10.201Z" level=info msg="node sqs-sar-processor-production-8pwns-515552909 phase Running -> Succeeded" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:10.201Z" level=info msg="node sqs-sar-processor-production-8pwns-515552909 finished: 2023-08-15 09:42:10.201540523 +0000 UTC" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:10.210Z" level=info msg="Outbound nodes of sqs-sar-processor-production-8pwns set to [sqs-sar-processor-production-8pwns-1555690789 sqs-sar-processor-production-8pwns-3607818475 sqs-sar-processor-production-8pwns-1590775209 sqs-sar-processor-production-8pwns-1096920377 sqs-sar-processor-production-8pwns-1977079561 sqs-sar-processor-production-8pwns-2636514637 sqs-sar-processor-production-8pwns-1833346002 sqs-sar-processor-production-8pwns-2513189394 sqs-sar-processor-production-8pwns-2469401052 sqs-sar-processor-production-8pwns-2003323528 sqs-sar-processor-production-8pwns-1038190901 sqs-sar-processor-production-8pwns-2793774353 sqs-sar-processor-production-8pwns-566304367 sqs-sar-processor-production-8pwns-2813658222 sqs-sar-processor-production-8pwns-975619886 sqs-sar-processor-production-8pwns-1335356588 sqs-sar-processor-production-8pwns-395271223 sqs-sar-processor-production-8pwns-724898383 sqs-sar-processor-production-8pwns-1567701854 sqs-sar-processor-production-8pwns-2334882381]" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:10.211Z" level=info msg="node sqs-sar-processor-production-8pwns phase Running -> Failed" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:10.211Z" level=info msg="node sqs-sar-processor-production-8pwns finished: 2023-08-15 09:42:10.211025708 +0000 UTC" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:10.211Z" level=info msg="Checking daemoned children of sqs-sar-processor-production-8pwns" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:10.211Z" level=info msg="TaskSet Reconciliation" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:10.211Z" level=info msg=reconcileAgentPod namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:10.211Z" level=info msg="Running OnExit handler: exit-handler" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:10.211Z" level=info msg="Steps node sqs-sar-processor-production-8pwns-411787230 initialized Running" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:10.211Z" level=info msg="StepGroup node sqs-sar-processor-production-8pwns-793041056 initialized Running" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:10.212Z" level=info msg="Retry node sqs-sar-processor-production-8pwns-3379985614 initialized Running" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:10.212Z" level=info msg="Pod node sqs-sar-processor-production-8pwns-1389842557 initialized Pending" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:10.252Z" level=info msg="Created pod: sqs-sar-processor-production-8pwns.onExit[0].exit-notification(0) (sqs-sar-processor-production-8pwns-slack-send-1389842557)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:10.252Z" level=info msg="Workflow step group node sqs-sar-processor-production-8pwns-793041056 not yet completed" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:10.308Z" level=info msg="Workflow update successful" namespace=argo-managed-processor-production phase=Running resourceVersion=1228758769 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:15.316Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-production/sqs-sar-processor-production-8pwns-slack-send-2469401052/deletePod
time="2023-08-15T09:42:20.264Z" level=info msg="Processing workflow" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.265Z" level=info msg="Task-result reconciliation" namespace=argo-managed-processor-production numObjs=0 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.265Z" level=warning msg="workflow uses legacy/insecure pod patch, see https://argoproj.github.io/argo-workflows/workflow-rbac/" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.265Z" level=info msg="node changed" namespace=argo-managed-processor-production new.message= new.phase=Succeeded new.progress=0/1 nodeID=sqs-sar-processor-production-8pwns-1389842557 old.message= old.phase=Pending old.progress=0/1 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.276Z" level=info msg="TaskSet Reconciliation" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.276Z" level=info msg=reconcileAgentPod namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.276Z" level=info msg="Running OnExit handler: exit-handler" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.277Z" level=info msg="node sqs-sar-processor-production-8pwns-3379985614 phase Running -> Succeeded" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.277Z" level=info msg="node sqs-sar-processor-production-8pwns-3379985614 finished: 2023-08-15 09:42:20.277468408 +0000 UTC" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.277Z" level=info msg="Step group node sqs-sar-processor-production-8pwns-793041056 successful" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.277Z" level=info msg="node sqs-sar-processor-production-8pwns-793041056 phase Running -> Succeeded" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.277Z" level=info msg="node sqs-sar-processor-production-8pwns-793041056 finished: 2023-08-15 09:42:20.277955807 +0000 UTC" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.278Z" level=info msg="Outbound nodes of sqs-sar-processor-production-8pwns-3379985614 is [sqs-sar-processor-production-8pwns-1389842557]" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.278Z" level=info msg="Outbound nodes of sqs-sar-processor-production-8pwns-411787230 is [sqs-sar-processor-production-8pwns-1389842557]" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.278Z" level=info msg="node sqs-sar-processor-production-8pwns-411787230 phase Running -> Succeeded" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.278Z" level=info msg="node sqs-sar-processor-production-8pwns-411787230 finished: 2023-08-15 09:42:20.278547127 +0000 UTC" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.278Z" level=info msg="Checking daemoned children of sqs-sar-processor-production-8pwns-411787230" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.278Z" level=info msg="Updated phase Running -> Failed" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.278Z" level=info msg="Marking workflow completed" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.279Z" level=info msg="Marking workflow as pending archiving" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.279Z" level=info msg="Deleting PVC sqs-sar-processor-production-8pwns-workvol" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.290Z" level=info msg="Deleting PVC sqs-sar-processor-production-8pwns-workvol-qa" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.296Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-production/sqs-sar-processor-production-8pwns-1340600742-agent/deletePod
time="2023-08-15T09:42:20.301Z" level=info msg="Removing PVC \"kubernetes.io/pvc-protection\" finalizer" claimName=sqs-sar-processor-production-8pwns-workvol namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.319Z" level=info msg="Removing PVC \"kubernetes.io/pvc-protection\" finalizer" claimName=sqs-sar-processor-production-8pwns-workvol-qa namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.335Z" level=info msg="Deleted 2/2 PVCs" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.335Z" level=info msg="Checking daemoned children of " namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:20.389Z" level=info msg="Workflow update successful" namespace=argo-managed-processor-production phase=Failed resourceVersion=1228759025 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:25.401Z" level=info msg="cleaning up pod" action=deletePod key=argo-managed-processor-production/sqs-sar-processor-production-8pwns-slack-send-1389842557/deletePod
time="2023-08-15T09:42:38.854Z" level=info msg="Processing workflow" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:38.856Z" level=info msg="Task-result reconciliation" namespace=argo-managed-processor-production numObjs=0 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:38.856Z" level=info msg="Workflow pod is missing" namespace=argo-managed-processor-production nodeName="sqs-sar-processor-production-8pwns.slack-notify-abort-preprocess-check-failure(0)" nodePhase=Pending recentlyStarted=false workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:38.928Z" level=info msg="Created pod: sqs-sar-processor-production-8pwns.slack-notify-abort-preprocess-check-failure(0) (sqs-sar-processor-production-8pwns-slack-send-2469401052)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:38.938Z" level=info msg="TaskSet Reconciliation" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:38.938Z" level=info msg=reconcileAgentPod namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:48.932Z" level=info msg="Processing workflow" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:48.932Z" level=info msg="Task-result reconciliation" namespace=argo-managed-processor-production numObjs=0 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:48.933Z" level=warning msg="workflow uses legacy/insecure pod patch, see https://argoproj.github.io/argo-workflows/workflow-rbac/" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:48.933Z" level=info msg="node changed" namespace=argo-managed-processor-production new.message= new.phase=Succeeded new.progress=0/1 nodeID=sqs-sar-processor-production-8pwns-2469401052 old.message= old.phase=Pending old.progress=0/1 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:48.947Z" level=info msg="node sqs-sar-processor-production-8pwns-515552909 phase Running -> Succeeded" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:48.947Z" level=info msg="node sqs-sar-processor-production-8pwns-515552909 finished: 2023-08-15 09:42:48.947974914 +0000 UTC" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:48.956Z" level=info msg="Outbound nodes of sqs-sar-processor-production-8pwns set to [sqs-sar-processor-production-8pwns-1555690789 sqs-sar-processor-production-8pwns-3607818475 sqs-sar-processor-production-8pwns-1590775209 sqs-sar-processor-production-8pwns-1096920377 sqs-sar-processor-production-8pwns-1977079561 sqs-sar-processor-production-8pwns-2636514637 sqs-sar-processor-production-8pwns-1833346002 sqs-sar-processor-production-8pwns-2513189394 sqs-sar-processor-production-8pwns-2469401052 sqs-sar-processor-production-8pwns-2003323528 sqs-sar-processor-production-8pwns-1038190901 sqs-sar-processor-production-8pwns-2793774353 sqs-sar-processor-production-8pwns-566304367 sqs-sar-processor-production-8pwns-2813658222 sqs-sar-processor-production-8pwns-975619886 sqs-sar-processor-production-8pwns-1335356588 sqs-sar-processor-production-8pwns-395271223 sqs-sar-processor-production-8pwns-724898383 sqs-sar-processor-production-8pwns-1567701854 sqs-sar-processor-production-8pwns-2334882381]" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:48.956Z" level=info msg="node sqs-sar-processor-production-8pwns phase Running -> Failed" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:48.956Z" level=info msg="node sqs-sar-processor-production-8pwns finished: 2023-08-15 09:42:48.956507704 +0000 UTC" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:48.956Z" level=info msg="Checking daemoned children of sqs-sar-processor-production-8pwns" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:48.956Z" level=info msg="TaskSet Reconciliation" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:48.956Z" level=info msg=reconcileAgentPod namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:48.956Z" level=info msg="Running OnExit handler: exit-handler" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:48.956Z" level=info msg="Steps node sqs-sar-processor-production-8pwns-411787230 initialized Running" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:48.956Z" level=info msg="StepGroup node sqs-sar-processor-production-8pwns-793041056 initialized Running" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:48.957Z" level=info msg="Retry node sqs-sar-processor-production-8pwns-3379985614 initialized Running" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:48.957Z" level=info msg="Pod node sqs-sar-processor-production-8pwns-1389842557 initialized Pending" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:48.997Z" level=info msg="Created pod: sqs-sar-processor-production-8pwns.onExit[0].exit-notification(0) (sqs-sar-processor-production-8pwns-slack-send-1389842557)" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:48.997Z" level=info msg="Workflow step group node sqs-sar-processor-production-8pwns-793041056 not yet completed" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:49.075Z" level=warning msg="Error updating workflow: Operation cannot be fulfilled on workflows.argoproj.io \"sqs-sar-processor-production-8pwns\": the object has been modified; please apply your changes to the latest version and try again Conflict" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:49.075Z" level=info msg="Re-applying updates on latest version and retrying update" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:49.121Z" level=info msg="Failed to re-apply update" error="must never update completed workflows" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:59.002Z" level=info msg="Processing workflow" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:59.004Z" level=info msg="Task-result reconciliation" namespace=argo-managed-processor-production numObjs=0 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:59.005Z" level=warning msg="workflow uses legacy/insecure pod patch, see https://argoproj.github.io/argo-workflows/workflow-rbac/" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:59.005Z" level=info msg="node changed" namespace=argo-managed-processor-production new.message= new.phase=Succeeded new.progress=0/1 nodeID=sqs-sar-processor-production-8pwns-2469401052 old.message= old.phase=Pending old.progress=0/1 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:59.020Z" level=info msg="node sqs-sar-processor-production-8pwns-515552909 phase Running -> Succeeded" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:59.020Z" level=info msg="node sqs-sar-processor-production-8pwns-515552909 finished: 2023-08-15 09:42:59.020616834 +0000 UTC" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:59.029Z" level=info msg="Outbound nodes of sqs-sar-processor-production-8pwns set to [sqs-sar-processor-production-8pwns-1555690789 sqs-sar-processor-production-8pwns-3607818475 sqs-sar-processor-production-8pwns-1590775209 sqs-sar-processor-production-8pwns-1096920377 sqs-sar-processor-production-8pwns-1977079561 sqs-sar-processor-production-8pwns-2636514637 sqs-sar-processor-production-8pwns-1833346002 sqs-sar-processor-production-8pwns-2513189394 sqs-sar-processor-production-8pwns-2469401052 sqs-sar-processor-production-8pwns-2003323528 sqs-sar-processor-production-8pwns-1038190901 sqs-sar-processor-production-8pwns-2793774353 sqs-sar-processor-production-8pwns-566304367 sqs-sar-processor-production-8pwns-2813658222 sqs-sar-processor-production-8pwns-975619886 sqs-sar-processor-production-8pwns-1335356588 sqs-sar-processor-production-8pwns-395271223 sqs-sar-processor-production-8pwns-724898383 sqs-sar-processor-production-8pwns-1567701854 sqs-sar-processor-production-8pwns-2334882381]" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:59.030Z" level=info msg="node sqs-sar-processor-production-8pwns phase Running -> Failed" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:59.030Z" level=info msg="node sqs-sar-processor-production-8pwns finished: 2023-08-15 09:42:59.030082039 +0000 UTC" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:59.030Z" level=info msg="Checking daemoned children of sqs-sar-processor-production-8pwns" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:59.030Z" level=info msg="TaskSet Reconciliation" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:59.030Z" level=info msg=reconcileAgentPod namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:59.030Z" level=info msg="Running OnExit handler: exit-handler" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:59.030Z" level=info msg="Steps node sqs-sar-processor-production-8pwns-411787230 initialized Running" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:59.030Z" level=info msg="StepGroup node sqs-sar-processor-production-8pwns-793041056 initialized Running" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:59.031Z" level=info msg="Retry node sqs-sar-processor-production-8pwns-3379985614 initialized Running" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:59.031Z" level=info msg="Pod node sqs-sar-processor-production-8pwns-1389842557 initialized Pending" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:59.031Z" level=info msg="Workflow step group node sqs-sar-processor-production-8pwns-793041056 not yet completed" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:59.060Z" level=warning msg="Error updating workflow: Operation cannot be fulfilled on workflows.argoproj.io \"sqs-sar-processor-production-8pwns\": the object has been modified; please apply your changes to the latest version and try again Conflict" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:59.060Z" level=info msg="Re-applying updates on latest version and retrying update" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:42:59.099Z" level=info msg="Failed to re-apply update" error="must never update completed workflows" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:43:09.065Z" level=info msg="Processing workflow" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:43:09.066Z" level=info msg="Task-result reconciliation" namespace=argo-managed-processor-production numObjs=0 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:43:09.066Z" level=warning msg="workflow uses legacy/insecure pod patch, see https://argoproj.github.io/argo-workflows/workflow-rbac/" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:43:09.066Z" level=info msg="node changed" namespace=argo-managed-processor-production new.message= new.phase=Succeeded new.progress=0/1 nodeID=sqs-sar-processor-production-8pwns-2469401052 old.message= old.phase=Pending old.progress=0/1 workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:43:09.081Z" level=info msg="node sqs-sar-processor-production-8pwns-515552909 phase Running -> Succeeded" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:43:09.081Z" level=info msg="node sqs-sar-processor-production-8pwns-515552909 finished: 2023-08-15 09:43:09.081803628 +0000 UTC" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:43:09.093Z" level=info msg="Outbound nodes of sqs-sar-processor-production-8pwns set to [sqs-sar-processor-production-8pwns-1555690789 sqs-sar-processor-production-8pwns-3607818475 sqs-sar-processor-production-8pwns-1590775209 sqs-sar-processor-production-8pwns-1096920377 sqs-sar-processor-production-8pwns-1977079561 sqs-sar-processor-production-8pwns-2636514637 sqs-sar-processor-production-8pwns-1833346002 sqs-sar-processor-production-8pwns-2513189394 sqs-sar-processor-production-8pwns-2469401052 sqs-sar-processor-production-8pwns-2003323528 sqs-sar-processor-production-8pwns-1038190901 sqs-sar-processor-production-8pwns-2793774353 sqs-sar-processor-production-8pwns-566304367 sqs-sar-processor-production-8pwns-2813658222 sqs-sar-processor-production-8pwns-975619886 sqs-sar-processor-production-8pwns-1335356588 sqs-sar-processor-production-8pwns-395271223 sqs-sar-processor-production-8pwns-724898383 sqs-sar-processor-production-8pwns-1567701854 sqs-sar-processor-production-8pwns-2334882381]" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:43:09.093Z" level=info msg="node sqs-sar-processor-production-8pwns phase Running -> Failed" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:43:09.093Z" level=info msg="node sqs-sar-processor-production-8pwns finished: 2023-08-15 09:43:09.093162956 +0000 UTC" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:43:09.093Z" level=info msg="Checking daemoned children of sqs-sar-processor-production-8pwns" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:43:09.093Z" level=info msg="TaskSet Reconciliation" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:43:09.093Z" level=info msg=reconcileAgentPod namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:43:09.093Z" level=info msg="Running OnExit handler: exit-handler" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:43:09.094Z" level=info msg="Steps node sqs-sar-processor-production-8pwns-411787230 initialized Running" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:43:09.094Z" level=info msg="StepGroup node sqs-sar-processor-production-8pwns-793041056 initialized Running" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:43:09.095Z" level=info msg="Retry node sqs-sar-processor-production-8pwns-3379985614 initialized Running" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:43:09.096Z" level=info msg="Pod node sqs-sar-processor-production-8pwns-1389842557 initialized Pending" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:43:09.096Z" level=info msg="Workflow step group node sqs-sar-processor-production-8pwns-793041056 not yet completed" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:43:09.121Z" level=warning msg="Error updating workflow: Operation cannot be fulfilled on workflows.argoproj.io \"sqs-sar-processor-production-8pwns\": the object has been modified; please apply your changes to the latest version and try again Conflict" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:43:09.122Z" level=info msg="Re-applying updates on latest version and retrying update" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:43:09.151Z" level=info msg="Failed to re-apply update" error="must never update completed workflows" namespace=argo-managed-processor-production workflow=sqs-sar-processor-production-8pwns
time="2023-08-15T09:43:50.594Z" level=info msg="archiving workflow" namespace=argo-managed-processor-production uid=994d8894-9f0b-4206-bd29-8d49926c7e21 workflow=sqs-sar-processor-production-8pwns |
We see a similar issue after upgrading from 3.4.4 to 3.4.9. In our case, workflows occasionally complete successfully but somehow enter error/failure state afterwards and the exit handler gets triggered again and sends a failure notification. Here is our workflow (simplified): apiVersion: argoproj.io/v1alpha1
kind: Workflow
metadata:
generateName: example-workflow-
spec:
entrypoint: main
onExit: exit-handler
podGC:
strategy: OnWorkflowCompletion
templates:
- name: main
dag: ...
- name: exit-handler
steps:
- - name: notify-failed
templateRef:
name: slack
template: notify-failed
when: "{{workflow.status}} != Succeeded" From the workflow controller logs below you can see that the workflow completes at
|
Thanks @jmaicher We ended up migrating to |
I'm seeing a very similar issue in v3.4.10 as above: Here's an example debug level log trace from the workflow controller:
From what I can see in the pod log, the pod actually succeeded without any error, but the workflow controller thought the pod was missing and thus updated the workflow status to failure. |
I can see the same issue happens in v3.4.7, v3.4.5, v3.4.4 as well. I'm using EKS with k8s version 1.24. |
Actually my error might be irrelevant because our cluster has an operator that actively cleans up pods in the Succeeded phase before workflow controller can find the pods and update the status. |
Has anyone confirmed that this is still an issue in later 3.4.x versions, e.g. 3.4.16, 3.4.17? |
podGC: onPodCompletion
not working properly
Thanks for checking! It turned out my issue was irrelevant to the workflow controller and after some changes on our end it worked fine. |
@popsu are u getting this on recent version? |
Pre-requisites
:latest
What happened/what you expected to happen?
Workflow-controller garbage collection doesn't work properly.
After we updated our Argo-workflows from
3.3.8
to3.4.9
there was issue with the argo controller failing to remove complete pods. We are usingpodGC
:OnPodCompletion
strategy for all workflows in our controller-configmap. Most of the time the pod was removed properly, but maybe 1-5% of the workflows it didn't remove the finished pod properly and the pod just stayed in the cluster as completed.When looking at the workflow-controller logs there seems to be some issues with the garbage collection, it seems to keep putting same workflow to the garbage collection queue multiple times, even tho it should only be put there once. Now the logs are constantly spamming lines like these, while previously with
3.3.8
this wasn't the case.I don't get why this workflow is put into the queue multiple times? Once should be enough, and looking at previous version logs that's what was the case.
Here's the number of log lines from the controller from our DataDog (ignore the color changes). The number of log lines was stable low in the previous version, but when we updated to
3.4.9
the number of log lines went up. At the end of the image we decided to go back to3.3.8
and the number of log lines are back to previous level.Version
v3.4.9
Paste a small workflow that reproduces the issue. We must be able to run the workflow; don't enter a workflows that uses private images.
Logs from the workflow controller
Logs from in your workflow's wait container
The text was updated successfully, but these errors were encountered: