feat: Add sample blueprint to run stable diffusion model on inferentia2 #405

ratnopamc · 2024-01-24T15:45:23Z

What does this PR do?

🛑 Please open an issue first to discuss any significant work and flesh out details/direction - we would hate for your time to be wasted.
Consult the CONTRIBUTING guide for submitting pull-requests.

This PR showcases a sample blueprint that deploys a stable diffusion model using rayserve on inferentia2 accelerator.

Motivation

This addresses existing issue - #371

More

Yes, I have tested the PR using my local account setup (Provide any test evidence report under Additional Notes)
Mandatory for new blueprints. Yes, I have added a example to support my blueprint PR
Mandatory for new blueprints. Yes, I have updated the website/docs or website/blog section for this feature - work in progress.
Yes, I ran pre-commit run -a with this PR. Link for installing pre-commit locally

For Moderators

E2E Test successfully complete before merge?

Additional Notes

vara-bonthu

Thanks @ratnopamc ! I have added few comments. Could you also write a Website Doc for end to end deployment?

ai-ml/trainium-inferentia/variables.tf

vara-bonthu · 2024-01-24T16:02:34Z

...rainium-inferentia/examples/ray-serve/stable-diffusion-inf2/ray-service-stablediffusion.yaml

+                memory: "8G"
+          nodeSelector:
+            #provisioner: default
+            workload: "rayhead"


Is this using CS or karpenter? you can remove commented config or add details about why its commented

vara-bonthu · 2024-01-24T16:03:27Z

ai-ml/trainium-inferentia/eks.tf

+
+      labels = {
+        instance-type = "inf2-8xl"
+        //provisioner   = "cluster-autoscaler"


remove the commented line

ai-ml/trainium-inferentia/eks.tf

vara-bonthu · 2024-01-24T16:05:25Z

ai-ml/trainium-inferentia/eks.tf

-        instance-type = "inf2"
-        provisioner   = "cluster-autoscaler"
+        instance-type = "inf2-24xl"
+        //provisioner   = "cluster-autoscaler"


remove the comment

ratnopamc · 2024-01-25T04:32:16Z

Closed. A new PR #406 is tracking this issue.

ratnopamc temporarily deployed to DoEKS Test January 24, 2024 15:45 — with GitHub Actions Inactive

ratnopamc changed the title ~~Example blueprint to run stable diffusion model on inferentia2~~ feat:Example blueprint to run stable diffusion model on inferentia2 Jan 24, 2024

ratnopamc changed the title ~~feat:Example blueprint to run stable diffusion model on inferentia2~~ feat: Add sample blueprint to run stable diffusion model on inferentia2 Jan 24, 2024

vara-bonthu reviewed Jan 24, 2024

View reviewed changes

ratnopamc temporarily deployed to DoEKS Test January 25, 2024 00:26 — with GitHub Actions Inactive

ratnopamc closed this Jan 25, 2024

ratnopamc force-pushed the main branch from b1f08b2 to 28ea4c4 Compare January 25, 2024 01:23

ratnopamc temporarily deployed to DoEKS Test January 25, 2024 01:24 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add sample blueprint to run stable diffusion model on inferentia2 #405

feat: Add sample blueprint to run stable diffusion model on inferentia2 #405

ratnopamc commented Jan 24, 2024 •

edited

Loading

vara-bonthu left a comment

vara-bonthu Jan 24, 2024

vara-bonthu Jan 24, 2024

vara-bonthu Jan 24, 2024

ratnopamc commented Jan 25, 2024

feat: Add sample blueprint to run stable diffusion model on inferentia2 #405

feat: Add sample blueprint to run stable diffusion model on inferentia2 #405

Conversation

ratnopamc commented Jan 24, 2024 • edited Loading

What does this PR do?

Motivation

More

For Moderators

Additional Notes

vara-bonthu left a comment

Choose a reason for hiding this comment

vara-bonthu Jan 24, 2024

Choose a reason for hiding this comment

vara-bonthu Jan 24, 2024

Choose a reason for hiding this comment

vara-bonthu Jan 24, 2024

Choose a reason for hiding this comment

ratnopamc commented Jan 25, 2024

ratnopamc commented Jan 24, 2024 •

edited

Loading