Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added training plan logic into automate script #510

Open
wants to merge 27 commits into
base: main
Choose a base branch
from

Conversation

amanshanbhag
Copy link
Collaborator

New feature (training plans) integrated into the auto create cluster script. Also updated README with better descriptions + added in gif of demo :)

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

amanshanbhag and others added 22 commits November 7, 2024 10:07
To avoid confusion, modified pre-requisites output at begining of script to clearly state what resources have been pre-provisioned by workshop studio. 

The AIM_403 branch of this script presumes a workshop studio account has been deployed and a user is running on a SageMaker studio Code instance with required permissions.
fix bug with sts get caller identity
Use sagemaker stack name by default, remove ability for users to define a custom stack name
@nghtm
Copy link
Collaborator

nghtm commented Dec 9, 2024

Can you please confirm whether this PR has been tested?

@nghtm
Copy link
Collaborator

nghtm commented Dec 9, 2024

please resolve conflicts before merging:

https://github.com/aws-samples/awsome-distributed-training/pull/510/conflicts

@amanshanbhag
Copy link
Collaborator Author

It's been tested pre-release. Waiting on account AL to be able to test once before merging.


## 🚀 Installation and Usage
Using this script is very simple. Run ```bash automate-cluster-creation.sh```
The automation script streamlines the process of setting up a distributed training environment using AWS SageMaker Hyperpod.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

redunddant with previous sentence.

Copy link
Contributor

@mhuguesaws mhuguesaws left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Left comments

@amanshanbhag
Copy link
Collaborator Author

Thank you Maxime! i will merge changes after a final test

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants