-
Notifications
You must be signed in to change notification settings - Fork 309
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
This PR replaces and is a continuation of #3857 (by @betochimas) > This PR primarily adds testing for the `Resultset` class, introduced earlier in 23.10. The tests take a similar approach to test_dataset, creating a temporary directory to test downloading all result files. To align `Resultset` and `Dataset`, the setter and getter for each download directory is moved into `DefaultDownloadDir`, so that each class shares an instance of `DefaultDownloadDir` and can be configured independently, although their default locations are still both dependent on the RAPIDS_DATASET_ROOT_DIR_PATH environment variable. The old patterns are present but commented-out, so this change would be breaking. This PR also removes the deprecated `experimental.datasets` package due to it being promoted to stable for >=1 release. Authors: - Ralph Liu (https://github.com/nv-rliu) - Dylan Chima-Sanchez (https://github.com/betochimas) - Rick Ratzel (https://github.com/rlratzel) - Brad Rees (https://github.com/BradReesWork) Approvers: - Rick Ratzel (https://github.com/rlratzel) URL: #3957
- Loading branch information
Showing
38 changed files
with
379 additions
and
882 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,22 @@ | ||
name: cit-Patents | ||
file_type: .csv | ||
description: A citation graph that includes all citations made by patents granted between 1975 and 1999, totaling 16,522,438 citations. | ||
author: NBER | ||
refs: | ||
J. Leskovec, J. Kleinberg and C. Faloutsos. Graphs over Time Densification Laws, Shrinking Diameters and Possible Explanations. | ||
ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2005. | ||
delim: " " | ||
header: None | ||
col_names: | ||
- src | ||
- dst | ||
col_types: | ||
- int32 | ||
- int32 | ||
has_loop: true | ||
is_directed: true | ||
is_multigraph: false | ||
is_symmetric: false | ||
number_of_edges: 16518948 | ||
number_of_nodes: 3774768 | ||
url: https://data.rapids.ai/cugraph/datasets/cit-Patents.csv |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,21 @@ | ||
name: europe_osm | ||
file_type: .csv | ||
description: A graph of OpenStreetMap data for Europe. | ||
author: M. Kobitzsh / Geofabrik GmbH | ||
refs: | ||
Rossi, Ryan. Ahmed, Nesreen. The Network Data Respoistory with Interactive Graph Analytics and Visualization. | ||
delim: " " | ||
header: None | ||
col_names: | ||
- src | ||
- dst | ||
col_types: | ||
- int32 | ||
- int32 | ||
has_loop: false | ||
is_directed: false | ||
is_multigraph: false | ||
is_symmetric: true | ||
number_of_edges: 54054660 | ||
number_of_nodes: 50912018 | ||
url: https://data.rapids.ai/cugraph/datasets/europe_osm.csv |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,26 @@ | ||
name: hollywood | ||
file_type: .csv | ||
description: | ||
A graph of movie actors where vertices are actors, and two actors are | ||
joined by an edge whenever they appeared in a movie together. | ||
author: Laboratory for Web Algorithmics (LAW) | ||
refs: | ||
The WebGraph Framework I Compression Techniques, Paolo Boldi | ||
and Sebastiano Vigna, Proc. of the Thirteenth International | ||
World Wide Web Conference (WWW 2004), 2004, Manhattan, USA, | ||
pp. 595--601, ACM Press. | ||
delim: " " | ||
header: None | ||
col_names: | ||
- src | ||
- dst | ||
col_types: | ||
- int32 | ||
- int32 | ||
has_loop: false | ||
is_directed: false | ||
is_multigraph: false | ||
is_symmetric: true | ||
number_of_edges: 57515616 | ||
number_of_nodes: 1139905 | ||
url: https://data.rapids.ai/cugraph/datasets/hollywood.csv |
22 changes: 22 additions & 0 deletions
22
python/cugraph/cugraph/datasets/metadata/soc-livejournal1.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,22 @@ | ||
name: soc-LiveJournal1 | ||
file_type: .csv | ||
description: A graph of the LiveJournal social network. | ||
author: L. Backstrom, D. Huttenlocher, J. Kleinberg, X. Lan | ||
refs: | ||
L. Backstrom, D. Huttenlocher, J. Kleinberg, X. Lan. Group Formation in | ||
Large Social Networks Membership, Growth, and Evolution. KDD, 2006. | ||
delim: " " | ||
header: None | ||
col_names: | ||
- src | ||
- dst | ||
col_types: | ||
- int32 | ||
- int32 | ||
has_loop: true | ||
is_directed: true | ||
is_multigraph: false | ||
is_symmetric: false | ||
number_of_edges: 68993773 | ||
number_of_nodes: 4847571 | ||
url: https://data.rapids.ai/cugraph/datasets/soc-LiveJournal1.csv |
22 changes: 22 additions & 0 deletions
22
python/cugraph/cugraph/datasets/metadata/soc-twitter-2010.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,22 @@ | ||
name: soc-twitter-2010 | ||
file_type: .csv | ||
description: A network of follower relationships from a snapshot of Twitter in 2010, where an edge from i to j indicates that j is a follower of i. | ||
author: H. Kwak, C. Lee, H. Park, S. Moon | ||
refs: | ||
J. Yang, J. Leskovec. Temporal Variation in Online Media. ACM Intl. | ||
Conf. on Web Search and Data Mining (WSDM '11), 2011. | ||
delim: " " | ||
header: None | ||
col_names: | ||
- src | ||
- dst | ||
col_types: | ||
- int32 | ||
- int32 | ||
has_loop: false | ||
is_directed: false | ||
is_multigraph: false | ||
is_symmetric: false | ||
number_of_edges: 530051354 | ||
number_of_nodes: 21297772 | ||
url: https://data.rapids.ai/cugraph/datasets/soc-twitter-2010.csv |
Oops, something went wrong.