You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Noticed failures like the following in the conda-python-tests / 11.8.0, 3.11, arm64, ubuntu20.04, a100, latest-driver, latest-deps job on 24.12 nightly tests:
___________________ test_feature_storage_wholegraph_backend ____________________
Traceback (most recent call last):
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/runner.py", line 341, in from_call
result: TResult | None = func()
^^^^^^
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/runner.py", line 242, in <lambda>
lambda: runtest_hook(item=item, **kwds), when=when, reraise=reraise
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_hooks.py", line 513, in __call__
return self._hookexec(self.name, self._hookimpls.copy(), kwargs, firstresult)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_manager.py", line 120, in _hookexec
return self._inner_hookexec(hook_name, methods, kwargs, firstresult)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_callers.py", line 182, in _multicall
return outcome.get_result()
^^^^^^^^^^^^^^^^^^^^
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_result.py", line 100, in get_result
raise exc.with_traceback(exc.__traceback__)
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_callers.py", line 167, in _multicall
teardown.throw(outcome._exception)
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/threadexception.py", line 92, in pytest_runtest_call
yield from thread_exception_runtest_hook()
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/threadexception.py", line 68, in thread_exception_runtest_hook
yield
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_callers.py", line 167, in _multicall
teardown.throw(outcome._exception)
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/unraisableexception.py", line 95, in pytest_runtest_call
yield from unraisable_exception_runtest_hook()
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/unraisableexception.py", line 70, in unraisable_exception_runtest_hook
yield
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_callers.py", line 167, in _multicall
teardown.throw(outcome._exception)
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/logging.py", line 846, in pytest_runtest_call
yield from self._runtest_for(item, "call")
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/logging.py", line 829, in _runtest_for
yield
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_callers.py", line 167, in _multicall
teardown.throw(outcome._exception)
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/capture.py", line 880, in pytest_runtest_call
return (yield)
^^^^^
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_callers.py", line 167, in _multicall
teardown.throw(outcome._exception)
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/skipping.py", line 257, in pytest_runtest_call
return (yield)
^^^^^
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_callers.py", line 103, in _multicall
res = hook_impl.function(*args)
^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/runner.py", line 174, in pytest_runtest_call
item.runtest()
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/python.py", line 1627, in runtest
self.ihook.pytest_pyfunc_call(pyfuncitem=self)
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_hooks.py", line 513, in __call__
return self._hookexec(self.name, self._hookimpls.copy(), kwargs, firstresult)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_manager.py", line 120, in _hookexec
return self._inner_hookexec(hook_name, methods, kwargs, firstresult)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_callers.py", line 139, in _multicall
raise exception.with_traceback(exception.__traceback__)
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_callers.py", line 103, in _multicall
res = hook_impl.function(*args)
^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/python.py", line 159, in pytest_pyfunc_call
result = testfunction(**testargs)
^^^^^^^^^^^^^^^^^^^^^^^^
File "/__w/cugraph/cugraph/python/cugraph/cugraph/tests/data_store/test_gnn_feat_storage_wholegraph.py", line 82, in test_feature_storage_wholegraph_backend
assert world_size > 0
AssertionError: assert 0 > 0
----------------------------- Captured stdout call -----------------------------
gpu count: 0
__________________ test_feature_storage_wholegraph_backend_mg __________________
Traceback (most recent call last):
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/runner.py", line 341, in from_call
result: TResult | None = func()
^^^^^^
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/runner.py", line 242, in <lambda>
lambda: runtest_hook(item=item, **kwds), when=when, reraise=reraise
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_hooks.py", line 513, in __call__
return self._hookexec(self.name, self._hookimpls.copy(), kwargs, firstresult)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_manager.py", line 120, in _hookexec
return self._inner_hookexec(hook_name, methods, kwargs, firstresult)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_callers.py", line 182, in _multicall
return outcome.get_result()
^^^^^^^^^^^^^^^^^^^^
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_result.py", line 100, in get_result
raise exc.with_traceback(exc.__traceback__)
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_callers.py", line 167, in _multicall
teardown.throw(outcome._exception)
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/threadexception.py", line 92, in pytest_runtest_call
yield from thread_exception_runtest_hook()
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/threadexception.py", line 68, in thread_exception_runtest_hook
yield
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_callers.py", line 167, in _multicall
teardown.throw(outcome._exception)
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/unraisableexception.py", line 95, in pytest_runtest_call
yield from unraisable_exception_runtest_hook()
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/unraisableexception.py", line 70, in unraisable_exception_runtest_hook
yield
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_callers.py", line 167, in _multicall
teardown.throw(outcome._exception)
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/logging.py", line 846, in pytest_runtest_call
yield from self._runtest_for(item, "call")
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/logging.py", line 829, in _runtest_for
yield
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_callers.py", line 167, in _multicall
teardown.throw(outcome._exception)
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/capture.py", line 880, in pytest_runtest_call
return (yield)
^^^^^
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_callers.py", line 167, in _multicall
teardown.throw(outcome._exception)
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/skipping.py", line 257, in pytest_runtest_call
return (yield)
^^^^^
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_callers.py", line 103, in _multicall
res = hook_impl.function(*args)
^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/runner.py", line 174, in pytest_runtest_call
item.runtest()
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/python.py", line 1627, in runtest
self.ihook.pytest_pyfunc_call(pyfuncitem=self)
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_hooks.py", line 513, in __call__
return self._hookexec(self.name, self._hookimpls.copy(), kwargs, firstresult)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_manager.py", line 120, in _hookexec
return self._inner_hookexec(hook_name, methods, kwargs, firstresult)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_callers.py", line 139, in _multicall
raise exception.with_traceback(exception.__traceback__)
File "/opt/conda/envs/test/lib/python3.11/site-packages/pluggy/_callers.py", line 103, in _multicall
res = hook_impl.function(*args)
^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/conda/envs/test/lib/python3.11/site-packages/_pytest/python.py", line 159, in pytest_pyfunc_call
result = testfunction(**testargs)
^^^^^^^^^^^^^^^^^^^^^^^^
File "/__w/cugraph/cugraph/python/cugraph/cugraph/tests/data_store/test_gnn_feat_storage_wholegraph.py", line 100, in test_feature_storage_wholegraph_backend_mg
assert world_size > 0
AssertionError: assert 0 > 0
----------------------------- Captured stdout call -----------------------------
gpu count: 0
</details>
### Other/Misc.
@jakirkham and @alexbarghi-nv saw these exact tests fail exactly this way in https://github.com/rapidsai/cugraph/pull/4703#issuecomment-2403280592. There, the root cause was some mix of PyTorch and cupy versions (I think).
### Code of Conduct
- [x] I agree to follow cuGraph's Code of Conduct
- [x] I have searched the [open bugs](https://github.com/rapidsai/cugraph/issues?q=is%3Aopen+is%3Aissue+label%3Abug) and have found no duplicates for this bug report
The text was updated successfully, but these errors were encountered:
jameslamb
changed the title
[BUG]: wholegraph feature store tests failing on CUDA 11.8.0, Python 3.11, arm64
[BUG]: wholegraph feature store tests failing on CUDA 11.8.0, Python 3.11, arm64 (nightly tests)
Dec 9, 2024
Version
24.12
Which installation method(s) does this occur on?
Conda
Describe the bug.
Noticed failures like the following in the
conda-python-tests / 11.8.0, 3.11, arm64, ubuntu20.04, a100, latest-driver, latest-deps
job on 24.12 nightly tests:full stacktrace (click me)
(build link)
Saw them again on a manual re-run of that test job a few hours later.
Minimum reproducible example
See CI links above.
Note this is only happening for that one
conda-python-tests
job, not on any other configuration or with wheels.Relevant log output
N/A
Environment details
See CI links above.
output of the last 'conda install' before the failing tests (click me)
Package Version Build Channel Size
───────────────────────────────────────────────────────────────────────────────────────────────────────────────
Install:
───────────────────────────────────────────────────────────────────────────────────────────────────────────────
Summary:
Install: 76 packages
The text was updated successfully, but these errors were encountered: