What’s New in MAX 24.4? MAX on macOS, Fast Local Llama3, Native Quantization and GGUF Support

MAX
AI
LLM
Release
Exploring the new features in MAX 24.4 including macOS support and Llama3 - originally published on Modular’s blog.
Author

Ehsan M. Kermani

Published

June 25, 2024

I wrote the MAX 24.4 release post on the Modular blog. The headline for this one is that MAX now runs on macOS, so you can do fast local Llama 3 inference right on your laptop. It also adds native quantization and GGUF support, which means a lot more models work out of the box.

The article goes through each of these with benchmarks.

Read it here: What’s New in MAX 24.4?.