-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] running the Leiden algorithm doesn't support oversubscription #171
Comments
where did you get the error? That means do you get this in rsc or cugraph. Could you also please upload the full stack-trace. If you can reproduce the error just with cugraph. I think I would be amazing if you create an issue there too. |
I get this in rapids_singlecell RuntimeError Traceback (most recent call last) File ~/anaconda3/envs/sc_rapids/lib/python3.10/site-packages/rapids_singlecell/tools/_clustering.py:125, in leiden(adata, resolution, random_state, restrict_to, key_added, adjacency, n_iterations, use_weights, neighbors_key, obsp, copy) File ~/anaconda3/envs/sc_rapids/lib/python3.10/site-packages/rapids_singlecell/tools/_clustering.py:31, in _create_graph(adjacency, use_weights) File ~/anaconda3/envs/sc_rapids/lib/python3.10/site-packages/cugraph/structure/graph_classes.py:193, in Graph.from_cudf_edgelist(self, input_df, source, destination, edge_attr, weight, edge_id, edge_type, renumber, store_transposed, legacy_renum_only) File ~/anaconda3/envs/sc_rapids/lib/python3.10/site-packages/cugraph/structure/graph_implementation/simpleGraph.py:262, in simpleGraphImpl.__from_edgelist(self, input_df, source, destination, edge_attr, weight, edge_id, edge_type, renumber, legacy_renum_only, store_transposed) File ~/anaconda3/envs/sc_rapids/lib/python3.10/site-packages/cugraph/structure/symmetrize.py:281, in symmetrize(input_df, source_col_name, dest_col_name, value_col_name, multi, symmetrize, do_expensive_check) File ~/anaconda3/envs/sc_rapids/lib/python3.10/site-packages/cugraph/structure/symmetrize.py:100, in symmetrize_df(df, src_name, dst_name, weight_name, multi, symmetrize) File ~/anaconda3/envs/sc_rapids/lib/python3.10/site-packages/cudf/core/mixins/mixin_factory.py:11, in _partialmethod..wrapper(self, *args2, **kwargs2) File ~/anaconda3/envs/sc_rapids/lib/python3.10/site-packages/cudf/core/groupby/groupby.py:701, in GroupBy._reduce(self, op, numeric_only, min_count, *args, **kwargs) File ~/anaconda3/envs/sc_rapids/lib/python3.10/site-packages/nvtx/nvtx.py:116, in annotate.call..inner(*args, **kwargs) File ~/anaconda3/envs/sc_rapids/lib/python3.10/site-packages/cudf/core/groupby/groupby.py:567, in GroupBy.agg(self, func) File groupby.pyx:350, in cudf._lib.groupby.GroupBy.aggregate() File groupby.pyx:252, in cudf._lib.groupby.GroupBy.aggregate_internal() RuntimeError: CUDA error encountered at: /opt/conda/conda-bld/work/cpp/src/hash/concurrent_unordered_map.cuh:546: 101 cudaErrorInvalidDevice invalid device ordinal |
Ok I cant reproduce the error. Can you make an issue on cugraph. This happens inside of the cugraph graph construction. They should know about this, because they might be able to fix this. |
Describe the bug
While running the leiden algorithm
rsc.tl.leiden(adata, key_added="leiden_res0_25", resolution=0.25
, I got a "CUDA error encountered 101 cudaErrorInvalidDevice invalid device ordinal"Just setting
rmm.reinitialize
managed_memory
toFalse
resolved the issueExpected behavior
Just information for other people running into the same error
Environment details (please complete the following information):
The text was updated successfully, but these errors were encountered: