Skip to content

v1.0.4

Compare
Choose a tag to compare
@vara-bonthu vara-bonthu released this 19 Sep 19:52
· 38 commits to main since this release
4136b09

What's Changed

  • fix: Update cleanup script by @ratnopamc in #595
  • fix: Update karpenter version for trainium-inferentia blueprint by @ratnopamc in #599
  • docs: Add video link for Deploy accelerator-agnostic inference pipelines to Amazon EKS by @TalHibner in #600
  • feat: Replaced NIM architecture diagram with self-made NIM on EKS arch diagram by @hustshawn in #612
  • feat: RayServe with vLLM using AWS Neuron on Amazon EKS by @ratnopamc in #607
  • feat: Mountpoint S3 for loading additional Spark Jars by @bainskb in #606
  • feat: Support preloading container images into Bottlerocket data volumes with Karpenter by @lindarr915 in #587
  • chore: Delete ai-ml/kubeflow directory by @askulkarni2 in #619
  • feat: Updated mountpoint-s3 for spark readme by @bainskb in #618
  • feat: Trainium blueprint upgrade and Llama3.1 405b Distributed inference example by @vara-bonthu in #622
  • feat: Neuron scheduler update for trainium-inferentia blueprints by @ratnopamc in #624
  • feat: Website Updates by @vara-bonthu in #626
  • feat: Updates to the sidebar by @vara-bonthu in #627
  • feat: Added deprecating notes; added Jark stack doc;added warnings for ML p… by @vara-bonthu in #628
  • feat: NVIDIA NIM Updates by @vara-bonthu in #631
  • feat: Udate NVIDIA NIM blueprint with grafana dashboard and docs by @ratnopamc in #633
  • feat: Add OpenWebUI for vllm-rayserve-inf2 blueprint by @ratnopamc in #635
  • feat: Updated EMR on EKS Blueprint by @vara-bonthu in #638
  • chore: Update PULL_REQUEST_TEMPLATE.md by @askulkarni2 in #643
  • chore: Add access entry for workshop by @askulkarni2 in #644
  • fix: Use eks module for access_entries for trn-inf blueprint by @askulkarni2 in #646
  • chore(deps): bump send and express in /website by @dependabot in #653
  • chore(deps): bump serve-static and express in /website by @dependabot in #652
  • feat: Updates to the llama3.1 405 model scripts by @vara-bonthu in #655
  • feat: Add cloudwatch eks add on with enhanced monitoring for neuron by @ratnopamc in #651
  • chore: Update cmd-shell image to python3.11 by @askulkarni2 in #656
  • docs: Website documentation for vllm inferencing using rayserve on AWS Inferentia by @sindhupalakodety in #637
  • feat: Adding support for AWS Batch by @delagoya in #620
  • docs: Ray vLLM Inf2 website doc updates by @ratnopamc in #657
  • feat: Observability for RayServe and vLLM GPU by @shivam-dubey-1 in #642
  • feat: Add binpacking examples by @hitsub2 in #615
  • feat: Spark K8s Operator on EKS IPv6 cluster by @ovaleanu in #499

New Contributors

Full Changelog: v1.0.3...v1.0.4