Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding clarification to the networking in the "b. Create a distributed ML cluster" section #243

Open
pptb-aws opened this issue May 24, 2022 · 0 comments

Comments

@pptb-aws
Copy link

In the section, "b. Create a distributed ML cluster", it has the user run these commands.

# create the cluster configuration
export IFACE=$(curl --silent http://169.254.169.254/latest/meta-data/network/interfaces/macs/)
export SUBNET_ID=$(curl --silent http://169.254.169.254/latest/meta-data/network/interfaces/macs/${IFACE}/subnet-id)
export VPC_ID=$(curl --silent http://169.254.169.254/latest/meta-data/network/interfaces/macs/${IFACE}/vpc-id)
export AZ=$(curl http://169.254.169.254/latest/meta-data/placement/availability-zone)
export REGION=${AZ::-1}

However, it does not provide an explanation regarding what these commands are for and what issues could arise based on these commands. I purposes adding the following information directly before this block.

"The following exports a number of variables that are used to set the VPC and subnet that the Head and Compute nodes will be launched into. These will use the VPC and Subnet of the instance where these commands are being run. Please review the networking requirments here for setting up Parallel Cluster in a single public subnet to make sure your subnet meets the requirments"

Additionally, there is a typo in the ml-config.yaml where, in the HeadNode.Dcv Enabled is spelled Emabled.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant