Granite 4.0 1B Speech: Compact, Multilingual, and Built for the Edge

Published: 1 month ago (March 9, 2026 at 02:36 PM EDT)

2 min read

Source: Hugging Face Blog

Overview

We’re excited to share Granite 4.0 1B Speech, the latest addition to IBM’s Granite Speech collection. Designed for enterprise applications on resource‑constrained devices, Granite 4.0 1B Speech is a compact speech‑language model built for multilingual automatic speech recognition (ASR) and bidirectional speech translation (AST).

Key highlights

Size: Half the parameters of its predecessor, granite‑speech‑3.3‑2b.
Languages: English, French, German, Spanish, Portuguese, and Japanese (new Japanese ASR support).
Features: Keyword‑list biasing for better recognition of names and acronyms.
Performance: Higher English transcription accuracy and faster inference via speculative decoding.
Recognition: Ranked #1 on the OpenASR leaderboard.

Performance

Despite its small size, Granite 4.0 1B Speech achieves highly competitive results on standard English ASR benchmarks. Performance is measured using Word Error Rate (WER)—the percentage of words transcribed incorrectly—where lower scores indicate better accuracy.

Benchmark Results

englishasr
Chart 1: Granite 4.0 1B Speech delivers competitively low WER across many benchmarks while being a small model.

Licensing and Usage

License: Apache 2.0.
Framework support: Native integration with Transformers and vLLM.
Evaluation: The model has been evaluated across a range of standard ASR and AST benchmarks—spanning English, multilingual, and translation tasks—and performs as well as or better than larger models.

For full evaluation results, architecture details, training data, and usage examples, see the model card.

Production Recommendations

We recommend pairing Granite 4.0 1B Speech with Granite Guardian for deployments that require additional risk detection.

Give it a try today and let us know what you think!

Granite 4.0 1B Speech: Compact, Multilingual, and Built for the Edge

Overview

Performance

Benchmark Results

Licensing and Usage

Production Recommendations

Related posts

Building a RAG System from Scratch: Turning Aviation Disruption Data into an AI-Powered Q&A App

Week in AI (Mar 8): Local-First AI Is Winning

[Paper] DEBISS: a Corpus of Individual, Semi-structured and Spoken Debates

Edge AI's Silent Killer: The Observability Gap in Full-Duplex Fidelity