You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
'spark-test.sh' integration tests FAILED on ps: command not found in Rocky Docker containers, ps is not installed in some nvidia Rocky docker image, e.g. nvidia/cuda:12.0.0-runtime-rockylinux9
+ stop-worker.sh
/home/jenkins/agent/workspace/jenkins-rapids_it-3.5.x-SNAPSHOT-dev-github-9/jars/spark-3.5.1-SNAPSHOT-bin-hadoop3.2/bin/load-spark-env.sh: line 68: ps: command not found
/home/jenkins/agent/workspace/jenkins-rapids_it-3.5.x-SNAPSHOT-dev-github-9/jars/spark-3.5.1-SNAPSHOT-bin-hadoop3.2/bin/load-spark-env.sh: line 68: ps: command not found
/home/jenkins/agent/workspace/jenkins-rapids_it-3.5.x-SNAPSHOT-dev-github-9/jars/spark-3.5.1-SNAPSHOT-bin-hadoop3.2/sbin/spark-daemon.sh: line 214: ps: command not found
no org.apache.spark.deploy.worker.Worker to stop
+ stop-master.sh
/home/jenkins/agent/workspace/jenkins-rapids_it-3.5.x-SNAPSHOT-dev-github-9/jars/spark-3.5.1-SNAPSHOT-bin-hadoop3.2/bin/load-spark-env.sh: line 68: ps: command not found
/home/jenkins/agent/workspace/jenkins-rapids_it-3.5.x-SNAPSHOT-dev-github-9/jars/spark-3.5.1-SNAPSHOT-bin-hadoop3.2/sbin/spark-daemon.sh: line 214: ps: command not found
no org.apache.spark.deploy.master.Master to stop
Describe the bug
'spark-test.sh' integration tests FAILED on
ps: command not found
in Rocky Docker containers,ps
is not installed in some nvidia Rocky docker image, e.g. nvidia/cuda:12.0.0-runtime-rockylinux9ps
command is required for starting/stopping spark clusters: e.g. https://github.com/apache/spark/blob/v3.3.2/bin/load-spark-env.sh#L68As our integration tests support running in Rocky Docker containers, https://github.com/NVIDIA/spark-rapids/blob/branch-24.02/jenkins/Dockerfile-blossom.integration.rocky. We can install
ps
tools in Rocky Docker images to fix the issue.Failure logs
Steps/Code to reproduce bug
1, Run Rocky9 docker container using the docker file : https://github.com/NVIDIA/spark-rapids/blob/branch-24.02/jenkins/Dockerfile-blossom.integration.rocky
2, Run integration tests https://github.com/NVIDIA/spark-rapids/blob/branch-24.02/jenkins/spark-test.sh, e.g.
Expected behavior
Integration tests can PASS in the above provided Rocky8 docker container
Environment details (please complete the following information)
The text was updated successfully, but these errors were encountered: