Show HN: I trained a 9M speech model to fix my Mandarin tones

Published: (January 30, 2026 at 07:51 PM EST)
1 min read

Source: Hacker News

Model Overview

Built this because tones are killing my spoken Mandarin and I can’t reliably hear my own mistakes.
It’s a 9M Conformer‑CTC model trained on ~300 h (AISHELL + Primewords), quantized to INT8 (11 MB), and runs 100 % in‑browser via ONNX Runtime Web.

The model grades per‑syllable pronunciation + tones using a Viterbi force algorithm.

Back to Blog

Related posts

Read more »

Julia

Article URL: https://borretti.me/fiction/julia Comments URL: https://news.ycombinator.com/item?id=46863357 Points: 28 Comments: 3...

xAI joins SpaceX

Article URL: https://www.spacex.com/updatesxai-joins-spacex Comments URL: https://news.ycombinator.com/item?id=46862170 Points: 211 Comments: 519...