Podcast Guide
Cover art for The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Prince Canuma

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Multimodal AI Models on Apple Silicon with MLX with Prince Canuma

Published
August 26, 2025
Duration
1h 10m
Summary source
description
Last updated
Apr 21, 2026

Discusses multimodal, inference.

Summary

Today, we're joined by Prince Canuma, an ML engineer and open-source developer focused on optimizing AI inference on Apple Silicon devices. Prince shares his journey to becoming one of the most prolific contributors to Apple’s MLX ecosystem, having published over 1,000 models and libraries that make open, multimodal AI accessible and performant on Apple d…

Show notes

Today, we're joined by Prince Canuma, an ML engineer and open-source developer focused on optimizing AI inference on Apple Silicon devices. Prince shares his journey to becoming one of the most prolific contributors to Apple’s MLX ecosystem, having published over 1,000 models and libraries that make open, multimodal AI accessible and performant on Apple devices. We explore his workflow for adapting new models in MLX, the trade-offs between the GPU and Neural Engine, and how optimization methods

Themes

  • multimodal
  • inference