Comparison of Codictate's curated speech models (Small q5_1, Large V3 Turbo q5_0, Large V3 q5_0, Parakeet) to determine the best-performing default model.

May 8, 2026Apple M4 Max / 36 GB / macOS 26.4.14 models200 samples per dataset

Accuracy (%) per language. Speed in ms per second of audio.
Model ⇅	Samples ⇅	Disk ⇅	RAM ⇅	Speed ⇅	Avg overall ↓	Avg EN ⇅	Avg multi ⇅	EN ⇅	EN noisy ⇅	ES ⇅	DA ⇅	HU ⇅
Large V3 q5_0	200	1.1 GB	2.0 GB	147 ms	92.0%	95.2%	89.9%	96.5%	94.0%	97.0%	87.1%	85.5%
Large V3 Turbo q5_0	200	574 MB	801 MB	105 ms	91.2%	95.0%	88.6%	96.2%	93.8%	96.8%	85.4%	83.5%
Parakeet TDT v3	200	500 MB	80 MB	19 ms	89.2%	94.1%	85.9%	95.6%	92.6%	95.5%	80.6%	81.6%
Small q5_1	200	181 MB	477 MB	58 ms	80.8%	93.0%	72.6%	94.9%	91.1%	94.1%	64.3%	59.4%

At a glance

Ratings computed from benchmark data, scaled 1 to 10. Accuracy is based on Word Error Rate (WER) and does not include punctuation yet.

Name ⇅	Lang ⇅	Translate ⇅	Speed ⇅	Accuracy ↓
Large V3 Turbo q5_0	all	✘	7	9
Large V3 q5_0	all	✔	6	9
Parakeet TDT v3	25	✘	10	9
Small q5_1	all	✔	9	7