MobileLLM-R1-950M meets Apple Silicon
From Stub to Coherent MLX Model, a story-shaped release note Getting Meta's efficient 950M parameter model running natively on Apple Silicon for fast, local inference I’ve done a tiny bit of contributions to the llama.cpp community, including adding ...
Sep 15, 20256 min read77