Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feature: Add multirail support and optional usage of hwloc for optimal NIC assignment #1107

Merged

Conversation

philipmarshall21
Copy link
Collaborator

This PR adds support for utilizing multiple NICs to increase peak RMA bandwidth. Additionally, it adds support for use of hwloc to optimally assign processes to a NIC based on each process' CPU affinity.

@philipmarshall21 philipmarshall21 force-pushed the multirail_with_hwloc branch 3 times, most recently from 6796126 to 0b52547 Compare November 30, 2023 22:30
src/shmem_internal.h Outdated Show resolved Hide resolved
src/transport_ofi.c Outdated Show resolved Hide resolved
src/transport_ofi.c Outdated Show resolved Hide resolved
src/transport_ofi.c Outdated Show resolved Hide resolved
src/transport_ofi.c Outdated Show resolved Hide resolved
@philipmarshall21 philipmarshall21 marked this pull request as ready for review January 9, 2024 23:15
@philipmarshall21
Copy link
Collaborator Author

@wrrobin @markbrown314 @parkerha1 Just wanted to give a little reminder that this branch is ready for review when you have the cycles to give it a look :)

Copy link
Collaborator

@parkerha1 parkerha1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me!

src/transport_ofi.c Outdated Show resolved Hide resolved
src/transport_ofi.c Outdated Show resolved Hide resolved
src/shmem_env_defs.h Show resolved Hide resolved
if (NULL == info->p_info) {
RAISE_WARN_MSG("OFI transport, no valid fabric (prov=%s, fabric=%s, domain=%s)\n",
info->prov_name != NULL ? info->prov_name : "<auto>",
info->fabric_name != NULL ? info->fabric_name : "<auto>",
info->domain_name != NULL ? info->domain_name : "<auto>");
return ret;
}
DEBUG_MSG("provider: %s\n", info->p_info->domain_attr->name);
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I believe this is included in the DEBUG_MSG below, can we remove it here?

src/transport_ofi.c Outdated Show resolved Hide resolved
src/transport_ofi.c Show resolved Hide resolved
@philipmarshall21 philipmarshall21 merged commit 4727a9e into Sandia-OpenSHMEM:main Apr 10, 2024
35 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants