-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Running the same code twice giving two different results #1085
Comments
Is this possible ? Did anyone observe this ? |
Ok. Here is what I did. I took a pretrained HF model (https://huggingface.co/vasista22/whisper-hindi-small) and fine-tuned it using my data. Then I converted the checkpoint to faster-whisper format. If I use "word_timestamps=True" in transcribe function, I am getting extra (useless) segments in the output. I don't know why. This is not happening if I use whisper model directly for transcription. This is happening with my fine-tuned model only. |
when "word_timestamps=False", the output is as follows when it is True, the output is like this
Is there something wrong which is obvious ? |
Hi, I am running faster-whisper on an audio file like follows
The same code sometimes gives two segments and sometimes gives one segment on the same audio file. I find this weird. Is this expected ? whenever this gives 2 segments, second one of those is always "insertions" . there is no speech, but model gives some words as output.
However if I slightly modify the above statement to not output word timestamps like follows
I always get only one segment in the output with good accuracy.
Is the presence of "word_timestamps=True" messing this up ?
The text was updated successfully, but these errors were encountered: