[FEA]: Add parameter to prevent persisted edgelists in datasets
API
#4241
Labels
feature request
New feature or request
improvement
Improvement / enhancement to an existing function
python
Milestone
Is this a new feature, an improvement, or a change to existing functionality?
Improvement
How would you describe the priority of this feature request
Low (would be nice)
Please provide a clear description of problem this feature solves
When
cugraph.datasets
objects are used to clean-up MG tests (ex. #4197), they often need to store edge-lists for SG and MG (dask_cudf
) usage. However, the current implementation of datasets requires constant calls tounload
to avoid these issues.This also happened to interfere with CI due to the fact that edge-lists were persisted between files.
Describe your ideal solution
Similar to how MG algorithms have a flag that developers use for testing/debugging (
perform_expensive_check
), perhaps thedatasets
API should also have a flag that is set when used for testing purposes in order to automatically check for preexisting edge-lists andunload
them.Describe any alternatives you have considered
Since this issue only affects tests, an alternative could be to use fixtures that perform the "check and unload" steps in each unit test.
Additional context
This is part of a general effort to improve readability of the MG tests #4187
Code of Conduct
The text was updated successfully, but these errors were encountered: