Hacker News logo

Show HN: I trained a 9M speech model to fix my Mandarin tones

461 points
by simedw
3 days ago
148 comments
Built this because tones are killing my spoken Mandarin and I can't reliably hear my own mistakes.

It's a 9M Conformer-CTC model trained on ~300h (AISHELL + Primewords), quantized to INT8 (11 MB), runs 100% in-browser via ONNX Runtime Web.

Grades per-syllable pronunciation + tones with Viterbi forced alignment.

Try it here: https://simedw.com/projects/ear/


148 comments

Loading...

Almost there! We're setting everything up for you.

Built by Troy Ciesco
Hacker News API