Skip to content

MAX 24.4

Compare
Choose a tag to compare
@goldiegadde goldiegadde released this 07 Jun 21:31
· 972 commits to main since this release

Release 24.4

Today, we are thrilled to announce the release of MAX 24.4, which introduces a powerful new quantization API for MAX Graphs and extends MAX’s reach to macOS. Together, these unlock a new industry standard paradigm where developers can leverage a single toolchain to build Generative AI pipelines locally and seamlessly deploy them to the cloud, all with industry-leading performance. Leveraging the Quantization API reduces the latency and memory cost of Generative AI pipelines by up to 8x on desktop architectures like macOS, by up to 7x on cloud CPU architectures like Intel and Graviton, without requiring developers to rewrite models or update any application code.

Checkout the changelog and the full release blog for additional details.