High Word Error Rates % in Large Whisper V3 #552

zhao-lun · 2024-03-07T09:48:55Z

model_used=large_whisperv3
Hardware: A100
Dataset: aishell

client cmd

num_task=16
python3 client.py     --server-addr localhost   \
  --model-name whisper     --num-tasks $num_task   \
 --whisper-prompt "<|startoftranscript|><|zh|><|transcribe|><|notimestamps|>"    \
 --manifest-dir /sample_dataset/aishell1_test/

server is built using the sample dockerfile

output:
RTF: 0.0092
total_duration: 32590.000 seconds
(9.05 hours)
processing time: 299.156 seconds (0.08 hours)

%WER = 53.34
Errors: 55 insertions, 0 deletions, 3773 substitutions, over 7176 reference words (3403 correct)

Hi,
i followed setup instructions, and it ran without any issues. However, I noticed an abnormal WER percentage. Is this normal?

The text was updated successfully, but these errors were encountered:

yuekaizhang · 2024-03-11T06:04:03Z

@zhao-lun https://github.com/k2-fsa/icefall/blob/master/egs/aishell/ASR/whisper/decode.py#L286-L288, check this to do normalize before computing metrics.

If you have some free time, feel free to make a PR to triton-asr-client/client.py.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

High Word Error Rates % in Large Whisper V3 #552

High Word Error Rates % in Large Whisper V3 #552

zhao-lun commented Mar 7, 2024 •

edited

Loading

yuekaizhang commented Mar 11, 2024

High Word Error Rates % in Large Whisper V3 #552

High Word Error Rates % in Large Whisper V3 #552

Comments

zhao-lun commented Mar 7, 2024 • edited Loading

yuekaizhang commented Mar 11, 2024

zhao-lun commented Mar 7, 2024 •

edited

Loading