MLX, Llama.cpp, and Candle are performing about equally on an M3 Max now.
🕯️🔥[Candle](github.com/huggingface/...) is now much faster on macOS thanks to a contribution by @EricLBuehler, which brings major speed improvements to the Metal backend.🍎📈
Try it out by running some of our examples with the `--features metal` flag.
#Candle #RustLang #macOS #Metal #HuggingFace