About multi -language hybrid output #2273

Ansen88 · 2024-07-22T09:00:48Z

Ansen88
Jul 22, 2024

In the chapter Load WhisperTokenizer @ https://huggingface.co/blog/fine-tune-Whisper, it is mentioned that " We simply have to specify the target language and the task."
Does that mean that all the languages are separated and cannot be mixed output in some Chinese, such as mixing English words in Chinese？

ryanheise · 2024-07-22T10:59:29Z

ryanheise
Jul 22, 2024

Note that your link gave a 404, and it was probably referring to a different implementation of Whisper. However, the model itself was trained on monolingual audio, so you can't directly give it a list of languages and expect it to automatically code switch between them (although without direction, Whisper can sometimes successfully transcribe code switching, but it will be unreliable).

However, workarounds are outlined in #2009 where you can try first diarizing the audio with pyannote, then detect the language of each segment and transcribe them each monolingually, and stitch the result together. That's about the extent of what you could do with Whisper. Perhaps instead of pyannote, you could try running tiny whisper with a prompt that includes hyphens in front of each utterance which can influence Whisper to do a primitive sort of speaker change detection.

Outside of Whisper, you can also look at solutions based on Meta's MMS finetuned for multilingual code switching.

0 replies

toanhuynhnguyen · 2024-10-19T15:37:16Z

toanhuynhnguyen
Oct 19, 2024

Just correct the above link: https://huggingface.co/blog/fine-tune-whisper

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About multi -language hybrid output #2273

{{title}}

Replies: 2 comments

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

About multi -language hybrid output #2273

Ansen88 Jul 22, 2024

Replies: 2 comments

ryanheise Jul 22, 2024

toanhuynhnguyen Oct 19, 2024

Ansen88
Jul 22, 2024

ryanheise
Jul 22, 2024

toanhuynhnguyen
Oct 19, 2024