[DO NOT MERGE] Reduce parallelism in devcontainer #3925

AyodeAwe · 2023-10-10T14:57:25Z

Debugging failure in #3919

https://github.com/rapidsai/cugraph/actions/runs/6434836186/job/17535341120?pr=3919

@dongxuy04

Created based on code from @dongxuy04 Adds support for `WholeGraph` `WholeMemory` in the cuGraph `FeatureStore` class. This enables both DGL and PyG to take advantage of distributed feature store functionality. Adds `pylibwholegraph` as a testing dependency so the feature store can be tested. Adds appropriate SG and MG tests. Authors: - Alex Barghi (https://github.com/alexbarghi-nv) Approvers: - Ray Douglass (https://github.com/raydouglass) - Brad Rees (https://github.com/BradReesWork) - Vibhu Jawa (https://github.com/VibhuJawa) URL: rapidsai#3874

…MFG creation (rapidsai#3887) Allow cugraph-dgl dataloader to consume sampled outputs from BulkSampler in CSC format. Authors: - Tingyu Wang (https://github.com/tingyu66) - Seunghwa Kang (https://github.com/seunghwak) - Alex Barghi (https://github.com/alexbarghi-nv) Approvers: - Seunghwa Kang (https://github.com/seunghwak) - Alex Barghi (https://github.com/alexbarghi-nv) - Vibhu Jawa (https://github.com/VibhuJawa) URL: rapidsai#3887

@naimnv

This handles isolated nodes in `louvain_communities` similar to what is done in rapidsai#3886. This is expected to be a temporary fix until pylibcugraph can handle isolated nodes. As a bonus, I added `isolates` algorithm 🎉 CC @naimnv @rlratzel Authors: - Erik Welch (https://github.com/eriknw) Approvers: - Rick Ratzel (https://github.com/rlratzel) URL: rapidsai#3897

Integrates the new CSR bulk sampler output, allowing reading of batches without having to call CSC conversion or count the numbers of vertices and edges in each batch. Should result in major performance improvements, especially for small batches. Authors: - Alex Barghi (https://github.com/alexbarghi-nv) - Seunghwa Kang (https://github.com/seunghwak) - Brad Rees (https://github.com/BradReesWork) Approvers: - Brad Rees (https://github.com/BradReesWork) - Ray Douglass (https://github.com/raydouglass) - Tingyu Wang (https://github.com/tingyu66) URL: rapidsai#3873

This PR increases the minimum timeout when waiting for the workers to complete their tasks. Authors: - Joseph Nke (https://github.com/jnke2016) Approvers: - Brad Rees (https://github.com/BradReesWork) - Vibhu Jawa (https://github.com/VibhuJawa) - Rick Ratzel (https://github.com/rlratzel) - Jake Awe (https://github.com/AyodeAwe) URL: rapidsai#3907

@jnke2016

… `cugraph.Graph` (rapidsai#3895) This PR attempts to fix rapidsai#3790 Please note that I have not being able to cause failure locally so it is really hard for me to know if it actually fixes anything or not . MRE being used to test locally: https://gist.github.com/VibhuJawa/4b1ec24022b6e2dd7879cd2e8d3fab67 CC: @jnke2016 , @rlratzel , CC: @rjzamora , Please let me know what i can do better here. Authors: - Vibhu Jawa (https://github.com/VibhuJawa) - Brad Rees (https://github.com/BradReesWork) Approvers: - Rick Ratzel (https://github.com/rlratzel) - Joseph Nke (https://github.com/jnke2016) URL: rapidsai#3895

AyodeAwe · 2023-10-10T15:26:37Z

There's a giant memory leak happening somewhere after the cpp build is kicked off in the devcontainer (for cuda 12.0, pip variant). Trying to figure out where.

Because of this, it looks like reducing cpu parallelism would be irrelevant here.

AyodeAwe · 2023-10-10T21:01:49Z

Issue fixed, closing PR.

alexbarghi-nv and others added 7 commits September 30, 2023 23:35

Merge branch-23.08 into branch-23.10

af5de4e

AyodeAwe added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Oct 10, 2023

AyodeAwe changed the title ~~[DO NOT MERGE] Reduce parallel~~ [DO NOT MERGE] Reduce parallelism in devcontainer Oct 10, 2023

reduce parallelism

c076e81

AyodeAwe force-pushed the reduce_parallel branch from 6e57794 to c076e81 Compare October 10, 2023 15:08

AyodeAwe closed this Oct 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DO NOT MERGE] Reduce parallelism in devcontainer #3925

[DO NOT MERGE] Reduce parallelism in devcontainer #3925

AyodeAwe commented Oct 10, 2023 •

edited

Loading

AyodeAwe commented Oct 10, 2023 •

edited

Loading

AyodeAwe commented Oct 10, 2023

[DO NOT MERGE] Reduce parallelism in devcontainer #3925

[DO NOT MERGE] Reduce parallelism in devcontainer #3925

Conversation

AyodeAwe commented Oct 10, 2023 • edited Loading

AyodeAwe commented Oct 10, 2023 • edited Loading

AyodeAwe commented Oct 10, 2023

AyodeAwe commented Oct 10, 2023 •

edited

Loading

AyodeAwe commented Oct 10, 2023 •

edited

Loading