Comparison of Codictate's curated speech models (Small q5_1, Large V3 Turbo q5_0, Large V3 q5_0, Parakeet) to determine the best-performing default model.

May 8, 2026Apple M4 Max / 36 GB / macOS 26.4.14 models200 samples per dataset
Accuracy (%) per language. Speed in ms per second of audio.
Model Samples Disk RAM Speed Avg overall Avg EN Avg multi EN EN noisy ES DA HU
Large V3 q5_02001.1 GB2.0 GB147 ms92.0%95.2%89.9%96.5%94.0%97.0%87.1%85.5%
Large V3 Turbo q5_0200574 MB801 MB105 ms91.2%95.0%88.6%96.2%93.8%96.8%85.4%83.5%
Parakeet TDT v3200500 MB80 MB19 ms89.2%94.1%85.9%95.6%92.6%95.5%80.6%81.6%
Small q5_1200181 MB477 MB58 ms80.8%93.0%72.6%94.9%91.1%94.1%64.3%59.4%

At a glance

Ratings computed from benchmark data, scaled 1 to 10. Accuracy is based on Word Error Rate (WER) and does not include punctuation yet.

Name Lang Translate Speed Accuracy
Large V3 Turbo q5_0all79
Large V3 q5_0all69
Parakeet TDT v325109
Small q5_1all97

Charts

Bar chart showing average English, multilingual, and overall accuracy per model.
Average accuracy by group
Bar chart comparing transcription speed across models and test conditions.
Speed comparison across conditions
Bar chart comparing model accuracy across English, Spanish, Danish, and Hungarian benchmark conditions.
Accuracy by model and test condition