Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[v3.x] Cannot load whisper-v3-large-turbo #989

Open
1 of 5 tasks
liuhuapiaoyuan opened this issue Oct 24, 2024 · 1 comment
Open
1 of 5 tasks

[v3.x] Cannot load whisper-v3-large-turbo #989

liuhuapiaoyuan opened this issue Oct 24, 2024 · 1 comment
Labels
bug Something isn't working

Comments

@liuhuapiaoyuan
Copy link

System Info

3.x

Environment/Platform

  • Website/web-app
  • Browser extension
  • Server-side (e.g., Node.js, Deno, Bun)
  • Desktop app (e.g., Electron)
  • Other (e.g., VSCode extension)

Description

Can't create a session. ERROR_CODE: 1, ERROR_MESSAGE: Deserialize tensor onnx::MatMul_4069 failed.Failed to load external data file ""encoder_model.onnx_data"", error: Module.MountedFiles is not available.

Reproduction

pipeline(
'automatic-speech-recognition',
'onnx-community/whisper-large-v3-turbo',
{ progress_callback,device:"webgpu" },
)

@liuhuapiaoyuan liuhuapiaoyuan added the bug Something isn't working label Oct 24, 2024
@xenova
Copy link
Collaborator

xenova commented Oct 24, 2024

Hi there 👋 There are 2 ways to fix this:

  1. Use a dtype which is < 2GB:

    pipeline(
        'automatic-speech-recognition',
        'onnx-community/whisper-large-v3-turbo',
        {
            progress_callback,
            device:"webgpu",
            dtype: {
                encoder_model: "fp16",
                decoder_model_merged: "q4",
            }
        },
    )
  2. Use external data format for the fp32 encoder (which is > 2GB):

    pipeline(
        'automatic-speech-recognition',
        'onnx-community/whisper-large-v3-turbo',
        {
            progress_callback,
            device:"webgpu",
            use_external_data_format: {
                encoder_model: true,
                decoder_model_merged: false,
            }
        },
    )

Let me know if that helps!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants