New Apple model combines vision understanding and image generation with impressive results

Published: (January 14, 2026 at 03:44 PM EST)
1 min read
Source: 9to5Mac

Source: 9to5Mac

Overview

Apple researchers have published a study about Manzano, a multimodal model that combines visual understanding and text-to-image generation, while significantly reducing performance and quality trade‑offs of current implementations. Here are the details.

Back to Blog

Related posts

Read more »