Replies: 1 comment 3 replies
-
whisper accuracy on uzbek language is very bad (WER > 90% see paper), so u may want to fine tune custom tokenizer meaning u have to re-train whisper from scratch |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I use whiper-small tokenizer of Uzbek language.
It's working but, something wrong.
Because, it's returned non-Uzbek characters.
How good (correct) it is?
How to fix that?
Can I make custom tokenizer? (share some articles).
Beta Was this translation helpful? Give feedback.
All reactions