Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

High Word Error Rates % in Large Whisper V3 #552

Open
zhao-lun opened this issue Mar 7, 2024 · 1 comment
Open

High Word Error Rates % in Large Whisper V3 #552

zhao-lun opened this issue Mar 7, 2024 · 1 comment

Comments

@zhao-lun
Copy link

zhao-lun commented Mar 7, 2024

model_used=large_whisperv3
Hardware: A100
Dataset: aishell

client cmd

num_task=16
python3 client.py     --server-addr localhost   \
  --model-name whisper     --num-tasks $num_task   \
 --whisper-prompt "<|startoftranscript|><|zh|><|transcribe|><|notimestamps|>"    \
 --manifest-dir /sample_dataset/aishell1_test/ 

server is built using the sample dockerfile

output:
RTF: 0.0092
total_duration: 32590.000 seconds
(9.05 hours)
processing time: 299.156 seconds (0.08 hours)

%WER = 53.34
Errors: 55 insertions, 0 deletions, 3773 substitutions, over 7176 reference words (3403 correct)

Hi,
i followed setup instructions, and it ran without any issues. However, I noticed an abnormal WER percentage. Is this normal?

@yuekaizhang
Copy link
Collaborator

@zhao-lun https://github.com/k2-fsa/icefall/blob/master/egs/aishell/ASR/whisper/decode.py#L286-L288, check this to do normalize before computing metrics.

If you have some free time, feel free to make a PR to triton-asr-client/client.py.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants