You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
#alice
2024-07-05 10:39:04.354 INFO queue/queue.go:124 Finish processing item: queue id[kuscia-coredns-controller], key[alice/job-psi-3-partner-0-fed] (5.923µs)
2024-07-05 10:39:04.354 INFO queue/queue.go:124 Finish processing item: queue id[endpoints-queue], key[alice/job-psi-3-partner-0-spu] (9.069µs)
2024-07-05 10:39:04.354 INFO queue/queue.go:124 Finish processing item: queue id[kuscia-coredns-controller], key[alice/job-psi-3-partner-0-spu] (7.534µs)
2024-07-05 10:39:04.359 INFO queue/queue.go:124 Finish processing item: queue id[kuscia-coredns-controller], key[alice/job-psi-3-partner-0-fed] (5.552µs)
2024-07-05 10:39:04.359 INFO controller/endpoints.go:189 Updating endpoint alice/job-psi-3-partner-0-fed/453915
2024-07-05 10:39:04.359 INFO queue/queue.go:124 Finish processing item: queue id[endpoints-queue], key[alice/job-psi-3-partner-0-fed] (6.859µs)
2024-07-05 10:39:04.359 INFO queue/queue.go:124 Finish processing item: queue id[kuscia-coredns-controller], key[alice/job-psi-3-partner-0-global] (6.055µs)
2024-07-05 10:39:04.363 INFO queue/queue.go:124 Finish processing item: queue id[kuscia-coredns-controller], key[alice/job-psi-3-partner-0-global] (6.415µs)
2024-07-05 10:39:04.363 INFO controller/endpoints.go:189 Updating endpoint alice/job-psi-3-partner-0-global/453917
2024-07-05 10:39:04.363 INFO queue/queue.go:124 Finish processing item: queue id[endpoints-queue], key[alice/job-psi-3-partner-0-global] (8.504µs)
2024-07-05 10:39:06.565 INFO source/apiserver.go:58 Receive pod "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)" add event from apiserver
2024-07-05 10:39:06.565 INFO source/config.go:100 Pod change merged, source=api, adds=[job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)], updates=[], deletes=[], removes=[], reconciles=[]
2024-07-05 10:39:06.565 INFO framework/pods_controller.go:263 SyncLoop ADD, source=api, pods=[job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)]
2024-07-05 10:39:06.565 INFO framework/pods_controller.go:515 Sync pod "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)" enter
2024-07-05 10:39:06.566 INFO pod/cri_provider.go:756 CRIProvider start syncing pod "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)"
2024-07-05 10:39:06.566 INFO resource/volume_manager.go:178 Mount (dump) file success, path=/home/kuscia/var/pods/611303bd-7eb4-4b90-a594-a26e9f1341aa/volumes/kubernetes.io~configmap/job-psi-3-configtemplate/task-config.conf, mode=420, size=176
2024-07-05 10:39:06.566 INFO resource/volume_manager.go:103 Mount volumes map[config-template:{HostPath:/home/kuscia/var/pods/611303bd-7eb4-4b90-a594-a26e9f1341aa/volumes/kubernetes.io~configmap/job-psi-3-configtemplate ReadOnly:true Managed:true SELinuxRelabel:true}] for pod "job-psi-3-partner-0" succeed
2024-07-05 10:39:06.566 INFO kuberuntime/kuberuntime_manager.go:325 No sandbox for pod "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)" can be found. Need to start a new one
2024-07-05 10:39:06.566 INFO kuberuntime/kuberuntime_manager.go:547 ComputePodActions got for pod "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)": {true true 0 nil [0] map[] []}
2024-07-05 10:39:06.567 INFO record/event.go:285 Event(v1.ObjectReference{Kind:"Node", Namespace:"alice", Name:"root-kuscia-lite-alice", UID:"root-kuscia-lite-alice", APIVersion:"", ResourceVersion:"", FieldPath:""}): type: 'Warning' reason: 'MissingClusterDNS' kubelet does not have ClusterDNS IP configured and cannot create Pod using "ClusterFirst" policy. Falling back to "Default" policy.
2024-07-05 10:39:06.567 INFO record/event.go:285 Event(v1.ObjectReference{Kind:"Pod", Namespace:"alice", Name:"job-psi-3-partner-0", UID:"611303bd-7eb4-4b90-a594-a26e9f1341aa", APIVersion:"v1", ResourceVersion:"453956", FieldPath:""}): type: 'Warning' reason: 'MissingClusterDNS' pod: "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)". kubelet does not have ClusterDNS IP configured and cannot create Pod using "ClusterFirst" policy. Falling back to "Default" policy.
2024-07-05 10:39:06.641 INFO status/status_manager.go:625 Patch status for pod "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)", patch={"metadata":{"uid":"611303bd-7eb4-4b90-a594-a26e9f1341aa"},"status":{"$setElementOrder/conditions":[{"type":"Initialized"},{"type":"Ready"},{"type":"ContainersReady"},{"type":"PodScheduled"}],"conditions":[{"lastProbeTime":null,"lastTransitionTime":"2024-07-05T02:39:06Z","status":"True","type":"Initialized"},{"lastProbeTime":null,"lastTransitionTime":"2024-07-05T02:39:06Z","message":"containers with unready status: [secretflow]","reason":"ContainersNotReady","status":"False","type":"Ready"},{"lastProbeTime":null,"lastTransitionTime":"2024-07-05T02:39:06Z","message":"containers with unready status: [secretflow]","reason":"ContainersNotReady","status":"False","type":"ContainersReady"}],"containerStatuses":[{"image":"secretflow-registry.cn-hangzhou.cr.aliyuncs.com/secretflow/secretflow-lite-anolis8:1.3.0b0","imageID":"","lastState":{},"name":"secretflow","ready":false,"restartCount":0,"started":false,"state":{"waiting":{"reason":"ContainerCreating"}}}],"hostIP":"172.18.0.3","qosClass":null,"startTime":"2024-07-05T02:39:06Z"}}
2024-07-05 10:39:06.649 INFO source/apiserver.go:65 Receive pod "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)" update event from apiserver
2024-07-05 10:39:06.650 INFO source/config.go:100 Pod change merged, source=api, adds=[], updates=[], deletes=[], removes=[], reconciles=[job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)]
2024-07-05 10:39:06.650 INFO framework/pods_controller.go:276 SyncLoop RECONCILE, source=api, pods=[job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)]
2024-07-05 10:39:06.709 INFO record/event.go:285 Event(v1.ObjectReference{Kind:"Node", Namespace:"alice", Name:"root-kuscia-lite-alice", UID:"root-kuscia-lite-alice", APIVersion:"", ResourceVersion:"", FieldPath:""}): type: 'Warning' reason: 'MissingClusterDNS' kubelet does not have ClusterDNS IP configured and cannot create Pod using "ClusterFirst" policy. Falling back to "Default" policy.
2024-07-05 10:39:06.709 INFO record/event.go:285 Event(v1.ObjectReference{Kind:"Pod", Namespace:"alice", Name:"job-psi-3-partner-0", UID:"611303bd-7eb4-4b90-a594-a26e9f1341aa", APIVersion:"v1", ResourceVersion:"453956", FieldPath:""}): type: 'Warning' reason: 'MissingClusterDNS' pod: "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)". kubelet does not have ClusterDNS IP configured and cannot create Pod using "ClusterFirst" policy. Falling back to "Default" policy.
2024-07-05 10:39:06.710 INFO record/event.go:285 Event(v1.ObjectReference{Kind:"Pod", Namespace:"alice", Name:"job-psi-3-partner-0", UID:"611303bd-7eb4-4b90-a594-a26e9f1341aa", APIVersion:"v1", ResourceVersion:"453956", FieldPath:"spec.containers{secretflow}"}): type: 'Normal' reason: 'Pulled' Container image "secretflow-registry.cn-hangzhou.cr.aliyuncs.com/secretflow/secretflow-lite-anolis8:1.3.0b0" already present on machine
2024-07-05 10:39:06.865 INFO pod/cri_provider.go:348 Receive pleg event: &{ID:611303bd-7eb4-4b90-a594-a26e9f1341aa Type:ContainerStarted Data:f5e1f8500a827258dce03511e5e8ab3adb89bcf23be11bfb21a9cbac20f3066b}
2024-07-05 10:39:06.885 INFO certissuance/cert_issuance.go:302 Successfully issued certificate(server=true,client=true) forcontainer "secretflow"in pod "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)"
2024-07-05 10:39:06.885 INFO configrender/config_render.go:245 Render config template forcontainer "secretflow"in pod "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)" succeed, templatePath=/home/kuscia/var/pods/611303bd-7eb4-4b90-a594-a26e9f1341aa/volumes/kubernetes.io~configmap/job-psi-3-configtemplate/task-config.conf, configPath=/home/kuscia/var/pods/611303bd-7eb4-4b90-a594-a26e9f1341aa/volumes/config-render/secretflow/config-template/task-config.conf
2024-07-05 10:39:06.885 INFO pod/cri_provider.go:573 Successfully generated the run options of container "secretflow"in pod "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)"
2024-07-05 10:39:06.909 INFO record/event.go:285 Event(v1.ObjectReference{Kind:"Pod", Namespace:"alice", Name:"job-psi-3-partner-0", UID:"611303bd-7eb4-4b90-a594-a26e9f1341aa", APIVersion:"v1", ResourceVersion:"453956", FieldPath:"spec.containers{secretflow}"}): type: 'Normal' reason: 'Created' Created container secretflow
2024-07-05 10:39:06.944 INFO framework/pods_controller.go:517 Sync pod "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)" exit, isTerminal=false
2024-07-05 10:39:06.945 INFO record/event.go:285 Event(v1.ObjectReference{Kind:"Pod", Namespace:"alice", Name:"job-psi-3-partner-0", UID:"611303bd-7eb4-4b90-a594-a26e9f1341aa", APIVersion:"v1", ResourceVersion:"453956", FieldPath:"spec.containers{secretflow}"}): type: 'Normal' reason: 'Started' Started container secretflow
2024-07-05 10:39:06.946 INFO framework/pods_controller.go:515 Sync pod "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)" enter
2024-07-05 10:39:06.946 INFO pod/cri_provider.go:756 CRIProvider start syncing pod "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)"
2024-07-05 10:39:06.946 INFO resource/volume_manager.go:178 Mount (dump) file success, path=/home/kuscia/var/pods/611303bd-7eb4-4b90-a594-a26e9f1341aa/volumes/kubernetes.io~configmap/job-psi-3-configtemplate/task-config.conf, mode=420, size=176
2024-07-05 10:39:06.946 INFO resource/volume_manager.go:103 Mount volumes map[config-template:{HostPath:/home/kuscia/var/pods/611303bd-7eb4-4b90-a594-a26e9f1341aa/volumes/kubernetes.io~configmap/job-psi-3-configtemplate ReadOnly:true Managed:true SELinuxRelabel:true}] for pod "job-psi-3-partner-0" succeed
2024-07-05 10:39:06.946 INFO kuberuntime/kuberuntime_manager.go:547 ComputePodActions got for pod "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)": {false false f5e1f8500a827258dce03511e5e8ab3adb89bcf23be11bfb21a9cbac20f3066b 0 nil [] map[] []}
2024-07-05 10:39:06.946 INFO framework/pods_controller.go:517 Sync pod "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)" exit, isTerminal=false
2024-07-05 10:39:06.947 INFO record/event.go:285 Event(v1.ObjectReference{Kind:"Node", Namespace:"alice", Name:"root-kuscia-lite-alice", UID:"root-kuscia-lite-alice", APIVersion:"", ResourceVersion:"", FieldPath:""}): type: 'Warning' reason: 'MissingClusterDNS' kubelet does not have ClusterDNS IP configured and cannot create Pod using "ClusterFirst" policy. Falling back to "Default" policy.
2024-07-05 10:39:06.947 INFO record/event.go:285 Event(v1.ObjectReference{Kind:"Pod", Namespace:"alice", Name:"job-psi-3-partner-0", UID:"611303bd-7eb4-4b90-a594-a26e9f1341aa", APIVersion:"v1", ResourceVersion:"453956", FieldPath:""}): type: 'Warning' reason: 'MissingClusterDNS' pod: "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)". kubelet does not have ClusterDNS IP configured and cannot create Pod using "ClusterFirst" policy. Falling back to "Default" policy.
2024-07-05 10:39:06.965 INFO status/status_manager.go:625 Patch status for pod "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)", patch={"metadata":{"uid":"611303bd-7eb4-4b90-a594-a26e9f1341aa"},"status":{"$setElementOrder/conditions":[{"type":"Initialized"},{"type":"Ready"},{"type":"ContainersReady"},{"type":"PodScheduled"}],"conditions":[{"message":null,"reason":null,"status":"True","type":"Ready"},{"message":null,"reason":null,"status":"True","type":"ContainersReady"}],"containerStatuses":[{"containerID":"containerd://1050bbbee6ee9906b5952122cdfc5b25a545a5ef98f1b28a41488eca640365a2","image":"secretflow-registry.cn-hangzhou.cr.aliyuncs.com/secretflow/secretflow-lite-anolis8:1.3.0b0","imageID":"sha256:b7b67c366e17624c5c8b44d1e76f49e135a6f3cbc822df1b4cba1b0fe5a04fef","lastState":{},"name":"secretflow","ready":true,"restartCount":0,"started":true,"state":{"running":{"startedAt":"2024-07-05T02:39:06Z"}}}],"phase":"Running","podIP":"10.88.0.5","podIPs":[{"ip":"10.88.0.5"}]}}
2024-07-05 10:39:06.966 INFO source/apiserver.go:65 Receive pod "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)" update event from apiserver
2024-07-05 10:39:06.966 INFO source/config.go:100 Pod change merged, source=api, adds=[], updates=[], deletes=[], removes=[], reconciles=[job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)]
2024-07-05 10:39:06.966 INFO framework/pods_controller.go:276 SyncLoop RECONCILE, source=api, pods=[job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)]
2024-07-05 10:39:06.970 INFO controller/endpoints.go:189 Updating endpoint alice/job-psi-3-partner-0-spu/453979
2024-07-05 10:39:06.970 INFO xds/xds.go:434 Add cluster:service-job-psi-3-partner-0-spu
2024-07-05 10:39:06.971 INFO queue/queue.go:124 Finish processing item: queue id[kuscia-coredns-controller], key[alice/job-psi-3-partner-0-spu] (8.006µs)
2024-07-05 10:39:06.971 INFO queue/queue.go:124 Finish processing item: queue id[kuscia-coredns-controller], key[alice/job-psi-3-partner-0-global] (4.116µs)
2024-07-05 10:39:06.970 INFO controller/endpoints.go:189 Updating endpoint alice/job-psi-3-partner-0-global/453982
2024-07-05 10:39:06.971 INFO xds/xds.go:434 Add cluster:service-job-psi-3-partner-0-global
2024-07-05 10:39:06.975 INFO queue/queue.go:124 Finish processing item: queue id[kuscia-coredns-controller], key[alice/job-psi-3-partner-0-fed] (7.631µs)
2024-07-05 10:39:06.975 INFO controller/endpoints.go:189 Updating endpoint alice/job-psi-3-partner-0-fed/453985
2024-07-05 10:39:06.976 INFO xds/xds.go:434 Add cluster:service-job-psi-3-partner-0-fed
2024-07-05 10:39:06.979 INFO queue/queue.go:124 Finish processing item: queue id[kuscia-coredns-controller], key[alice/job-psi-3-partner-0-spu] (5.536µs)
2024-07-05 10:39:06.979 INFO queue/queue.go:124 Finish processing item: queue id[endpoints-queue], key[alice/job-psi-3-partner-0-spu] (8.390869ms)
2024-07-05 10:39:06.982 INFO queue/queue.go:124 Finish processing item: queue id[endpoints-queue], key[alice/job-psi-3-partner-0-global] (10.406168ms)
2024-07-05 10:39:06.982 INFO queue/queue.go:124 Finish processing item: queue id[kuscia-coredns-controller], key[alice/job-psi-3-partner-0-global] (19.75µs)
2024-07-05 10:39:06.988 INFO queue/queue.go:124 Finish processing item: queue id[kuscia-coredns-controller], key[alice/job-psi-3-partner-0-fed] (5.745µs)
2024-07-05 10:39:06.989 INFO queue/queue.go:124 Finish processing item: queue id[endpoints-queue], key[alice/job-psi-3-partner-0-fed] (13.106722ms)
2024-07-05 10:39:07.867 INFO pod/cri_provider.go:348 Receive pleg event: &{ID:611303bd-7eb4-4b90-a594-a26e9f1341aa Type:ContainerStarted Data:1050bbbee6ee9906b5952122cdfc5b25a545a5ef98f1b28a41488eca640365a2}
2024-07-05 10:39:07.868 INFO framework/pods_controller.go:515 Sync pod "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)" enter
2024-07-05 10:39:07.868 INFO status/status_manager.go:490 Ignoring same status for pod "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)"
2024-07-05 10:39:07.868 INFO pod/cri_provider.go:756 CRIProvider start syncing pod "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)"
2024-07-05 10:39:07.868 INFO resource/volume_manager.go:178 Mount (dump) file success, path=/home/kuscia/var/pods/611303bd-7eb4-4b90-a594-a26e9f1341aa/volumes/kubernetes.io~configmap/job-psi-3-configtemplate/task-config.conf, mode=420, size=176
2024-07-05 10:39:07.869 INFO resource/volume_manager.go:103 Mount volumes map[config-template:{HostPath:/home/kuscia/var/pods/611303bd-7eb4-4b90-a594-a26e9f1341aa/volumes/kubernetes.io~configmap/job-psi-3-configtemplate ReadOnly:true Managed:true SELinuxRelabel:true}] for pod "job-psi-3-partner-0" succeed
2024-07-05 10:39:07.869 INFO kuberuntime/kuberuntime_manager.go:547 ComputePodActions got for pod "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)": {false false f5e1f8500a827258dce03511e5e8ab3adb89bcf23be11bfb21a9cbac20f3066b 0 nil [] map[] []}
2024-07-05 10:39:07.869 INFO framework/pods_controller.go:517 Sync pod "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)" exit, isTerminal=false
2024-07-05 10:39:07.869 INFO record/event.go:285 Event(v1.ObjectReference{Kind:"Node", Namespace:"alice", Name:"root-kuscia-lite-alice", UID:"root-kuscia-lite-alice", APIVersion:"", ResourceVersion:"", FieldPath:""}): type: 'Warning' reason: 'MissingClusterDNS' kubelet does not have ClusterDNS IP configured and cannot create Pod using "ClusterFirst" policy. Falling back to "Default" policy.
2024-07-05 10:39:07.869 INFO record/event.go:285 Event(v1.ObjectReference{Kind:"Pod", Namespace:"alice", Name:"job-psi-3-partner-0", UID:"611303bd-7eb4-4b90-a594-a26e9f1341aa", APIVersion:"v1", ResourceVersion:"453956", FieldPath:""}): type: 'Warning' reason: 'MissingClusterDNS' pod: "job-psi-3-partner-0_alice(611303bd-7eb4-4b90-a594-a26e9f1341aa)". kubelet does not have ClusterDNS IP configured and cannot create Pod using "ClusterFirst" policy. Falling back to "Default" policy.
2024-07-05 10:39:18.883 INFO pleg/generic.go:303 Generic (PLEG): container finished, podID=611303bd-7eb4-4b90-a594-a26e9f1341aa, containerID=1050bbbee6ee9906b5952122cdfc5b25a545a5ef98f1b28a41488eca640365a2, exitCode=1
#bob
2024-07-05 10:39:29.299 INFO framework/pods_controller.go:661 Sync terminating pod "job-psi-3-partner-0_bob(a1aa6f55-1f52-45fd-96bc-525b36d74dde)"exit
2024-07-05 10:39:29.307 INFO framework/pods_controller.go:669 Sync terminated pod "job-psi-3-partner-0_bob(a1aa6f55-1f52-45fd-96bc-525b36d74dde)" enter
2024-07-05 10:39:29.344 INFO pod/cri_provider.go:811 CRIProvider start deleting pod "job-psi-3-partner-0_bob(a1aa6f55-1f52-45fd-96bc-525b36d74dde)"
2024-07-05 10:39:29.344 INFO framework/pods_controller.go:685 Sync terminated pod "job-psi-3-partner-0_bob(a1aa6f55-1f52-45fd-96bc-525b36d74dde)"exit
2024-07-05 10:39:29.449 INFO source/apiserver.go:65 Receive pod "job-psi-3-partner-0_bob(a1aa6f55-1f52-45fd-96bc-525b36d74dde)" update event from apiserver
2024-07-05 10:39:29.450 INFO source/config.go:100 Pod change merged, source=api, adds=[], updates=[], deletes=[], removes=[], reconciles=[job-psi-3-partner-0_bob(a1aa6f55-1f52-45fd-96bc-525b36d74dde)]
2024-07-05 10:39:29.450 INFO framework/pods_controller.go:276 SyncLoop RECONCILE, source=api, pods=[job-psi-3-partner-0_bob(a1aa6f55-1f52-45fd-96bc-525b36d74dde)]
2024-07-05 10:39:29.454 INFO status/status_manager.go:625 Patch status for pod "job-psi-3-partner-0_bob(a1aa6f55-1f52-45fd-96bc-525b36d74dde)", patch={"metadata":{"uid":"a1aa6f55-1f52-45fd-96bc-525b36d74dde"},"status":{"$setElementOrder/conditions":[{"type":"Initialized"},{"type":"Ready"},{"type":"ContainersReady"},{"type":"PodScheduled"}],"conditions":[{"lastTransitionTime":"2024-07-05T02:39:29Z","reason":"PodFailed","status":"False","type":"Ready"},{"lastTransitionTime":"2024-07-05T02:39:29Z","reason":"PodFailed","status":"False","type":"ContainersReady"}],"containerStatuses":[{"containerID":"containerd://d9872dba311405bb3ba83355c6aff2419af1f0be3db3aba1bb685ee0d3971a5a","image":"secretflow-registry.cn-hangzhou.cr.aliyuncs.com/secretflow/secretflow-lite-anolis8:1.3.0b0","imageID":"sha256:b7b67c366e17624c5c8b44d1e76f49e135a6f3cbc822df1b4cba1b0fe5a04fef","lastState":{},"name":"secretflow","ready":false,"restartCount":0,"started":false,"state":{"terminated":{"containerID":"containerd://d9872dba311405bb3ba83355c6aff2419af1f0be3db3aba1bb685ee0d3971a5a","exitCode":15,"finishedAt":"2024-07-05T02:39:28Z","message":"WARNING:root:Since the GPL-licensed package `unidecode` is not installed, using Python's `unicodedata` package which yields worse results.\n2024-07-05 02:39:15,099|bob|INFO|secretflow|entry.py:start_ray:56| ray_conf: RayConfig(ray_node_ip_address='job-psi-3-partner-0-global.bob.svc', ray_node_manager_port=23820, ray_object_manager_port=23821, ray_client_server_port=23822, ray_worker_ports=[], ray_gcs_port=23819)\n2024-07-05 02:39:15,099|bob|INFO|secretflow|entry.py:start_ray:60| Trying to start ray head node at job-psi-3-partner-0-global.bob.svc, start command: RAY_BACKEND_LOG_LEVEL=debug RAY_grpc_enable_http_proxy=true OMP_NUM_THREADS=2 ray start --head --include-dashboard=false --disable-usage-stats --num-cpus=32 --node-ip-address=job-psi-3-partner-0-global.bob.svc --port=23819 --node-manager-port=23820 --object-manager-port=23821 --ray-client-server-port=23822\n","reason":"Error","startedAt":"2024-07-05T02:39:07Z"}}}],"phase":"Failed","podIP":null,"podIPs":null}}
2024-07-05 10:39:29.454 INFO framework/pod.go:751 Pod "job-psi-3-partner-0_bob(a1aa6f55-1f52-45fd-96bc-525b36d74dde)" is terminated and all resources are reclaimed
2024-07-05 10:39:29.457 INFO controller/endpoints.go:189 Updating endpoint bob/job-psi-3-partner-0-spu/454081
2024-07-05 10:39:29.501 INFO status/status_manager.go:653 Pod "job-psi-3-partner-0_bob(a1aa6f55-1f52-45fd-96bc-525b36d74dde)" fully terminated and removed from etcd
2024-07-05 10:39:29.503 INFO controller/endpoints.go:189 Updating endpoint bob/job-psi-3-partner-0-global/454083
2024-07-05 10:39:29.503 INFO queue/queue.go:124 Finish processing item: queue id[kuscia-coredns-controller], key[bob/job-psi-3-partner-0-spu] (9.762µs)
2024-07-05 10:39:29.507 INFO queue/queue.go:124 Finish processing item: queue id[kuscia-coredns-controller], key[bob/job-psi-3-partner-0-global] (23.75µs)
2024-07-05 10:39:29.507 INFO queue/queue.go:124 Finish processing item: queue id[kuscia-coredns-controller], key[bob/job-psi-3-partner-0-fed] (4.107µs)
2024-07-05 10:39:29.507 INFO source/apiserver.go:65 Receive pod "job-psi-3-partner-0_bob(a1aa6f55-1f52-45fd-96bc-525b36d74dde)" update event from apiserver
2024-07-05 10:39:29.507 INFO source/apiserver.go:72 Receive pod "job-psi-3-partner-0_bob(a1aa6f55-1f52-45fd-96bc-525b36d74dde)" delete event from apiserver
2024-07-05 10:39:29.507 INFO queue/queue.go:124 Finish processing item: queue id[kuscia-coredns-controller], key[bob/job-psi-3-partner-0-spu] (3.322µs)
2024-07-05 10:39:29.507 INFO source/config.go:100 Pod change merged, source=api, adds=[], updates=[], deletes=[job-psi-3-partner-0_bob(a1aa6f55-1f52-45fd-96bc-525b36d74dde)], removes=[], reconciles=[]
2024-07-05 10:39:29.507 INFO source/config.go:100 Pod change merged, source=api, adds=[], updates=[], deletes=[], removes=[job-psi-3-partner-0_bob(a1aa6f55-1f52-45fd-96bc-525b36d74dde)], reconciles=[]
2024-07-05 10:39:29.507 INFO framework/pods_controller.go:279 SyncLoop DELETE, source=api, pods=[job-psi-3-partner-0_bob(a1aa6f55-1f52-45fd-96bc-525b36d74dde)]
2024-07-05 10:39:29.507 INFO framework/pods_controller.go:273 SyncLoop REMOVE, source=api, pods=[job-psi-3-partner-0_bob(a1aa6f55-1f52-45fd-96bc-525b36d74dde)]
2024-07-05 10:39:29.507 INFO framework/pods_controller.go:347 Pod "job-psi-3-partner-0_bob(a1aa6f55-1f52-45fd-96bc-525b36d74dde)" has been deleted and must be killed
2024-07-05 10:39:29.507 INFO queue/queue.go:124 Finish processing item: queue id[kuscia-coredns-controller], key[bob/job-psi-3-partner-0-fed] (5.778µs)
2024-07-05 10:39:29.507 INFO queue/queue.go:124 Finish processing item: queue id[kuscia-coredns-controller], key[bob/job-psi-3-partner-0-global] (2.723µs)
2024-07-05 10:39:29.507 INFO queue/queue.go:124 Finish processing item: queue id[kuscia-coredns-controller], key[bob/job-psi-3-partner-0-spu] (2.722µs)
2024-07-05 10:39:29.507 INFO queue/queue.go:124 Finish processing item: queue id[endpoints-queue], key[bob/job-psi-3-partner-0-global] (419.249µs)
2024-07-05 10:39:29.583 INFO controller/endpoints.go:189 Updating endpoint bob/job-psi-3-partner-0-fed/454084
2024-07-05 10:39:29.583 INFO controller/endpoints.go:189 Updating endpoint bob/job-psi-3-partner-0-fed/454093
2024-07-05 10:39:29.583 INFO controller/endpoints.go:189 Updating endpoint bob/job-psi-3-partner-0-global/454094
2024-07-05 10:39:29.583 INFO controller/endpoints.go:189 Updating endpoint bob/job-psi-3-partner-0-spu/454095
2024-07-05 10:39:29.583 INFO queue/queue.go:124 Finish processing item: queue id[endpoints-queue], key[bob/job-psi-3-partner-0-fed] (266.327µs)
2024-07-05 10:39:29.583 INFO queue/queue.go:124 Finish processing item: queue id[endpoints-queue], key[bob/job-psi-3-partner-0-global] (106.138µs)
2024-07-05 10:39:29.583 INFO status/status_manager.go:604 Pod "job-psi-3-partner-0_bob(a1aa6f55-1f52-45fd-96bc-525b36d74dde)" does not exist on the server
2024-07-05 10:39:29.503 INFO queue/queue.go:124 Finish processing item: queue id[endpoints-queue], key[bob/job-psi-3-partner-0-spu] (33.477µs)
2024-07-05 10:39:29.591 INFO queue/queue.go:124 Finish processing item: queue id[endpoints-queue], key[bob/job-psi-3-partner-0-spu] (7.978222ms)
The text was updated successfully, but these errors were encountered:
Issue Type
Install/Deploy
Search for existing issues similar to yours
Yes
OS Platform and Distribution
Ubuntu22.04
Kuscia Version
0.7.0b0
Deployment
docker
deployment Version
26.1.4
App Running type
secretflow
App Running version
1.3.0b0
Configuration file used to run kuscia.
What happend and What you expected to happen.
Kuscia log output.
The text was updated successfully, but these errors were encountered: