[webgpu] support Pad operator #23141

xhcao · 2024-12-18T11:27:43Z

Description

Motivation and Context

xhcao · 2024-12-18T11:29:38Z

guschmue · 2024-12-19T21:01:28Z

/azp run ONNX Runtime Web CI Pipeline,Windows GPU CI Pipeline,Linux Android Emulator QNN CI Pipeline

guschmue · 2024-12-19T21:01:36Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

guschmue · 2024-12-19T21:01:43Z

/azp run Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,Windows x64 QNN CI Pipeline,Big Models

azure-pipelines · 2024-12-19T21:01:43Z

Azure Pipelines successfully started running 2 pipeline(s).

guschmue · 2024-12-19T21:01:49Z

/azp run Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2024-12-19T21:02:02Z

Azure Pipelines successfully started running 4 pipeline(s).

azure-pipelines · 2024-12-19T21:02:04Z

Azure Pipelines successfully started running 3 pipeline(s).

azure-pipelines · 2024-12-19T21:02:17Z

Azure Pipelines successfully started running 9 pipeline(s).

onnxruntime/core/providers/webgpu/tensor/pad.cc

xhcao · 2024-12-20T04:02:24Z

@fs-eire @guschmue Please help to trigger the bots again. Last version failed on Mac OS, but could compile correctly on Windows. I had changed the code, but not ensured it worked correctly on Mac OS. The compiling error was shown as below.
/Users/runner/work/1/s/onnxruntime/core/providers/webgpu/tensor/pad.h:16:54: error: member initializer 'Program' does not name a non-static data member or base class PadProgram(const Mode mode, bool dim_value_zero) : Program{"Pad"}, mode_{mode}, dim_value_zero_{dim_value_zero} {}

guschmue · 2024-12-20T16:06:10Z

/azp run ONNX Runtime Web CI Pipeline,Windows GPU CI Pipeline,Linux Android Emulator QNN CI Pipeline

guschmue · 2024-12-20T16:06:17Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

guschmue · 2024-12-20T16:06:23Z

/azp run Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,Windows x64 QNN CI Pipeline,Big Models

azure-pipelines · 2024-12-20T16:06:28Z

Azure Pipelines successfully started running 2 pipeline(s).

guschmue · 2024-12-20T16:06:30Z

/azp run Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2024-12-20T16:06:45Z

Azure Pipelines successfully started running 4 pipeline(s).

azure-pipelines · 2024-12-20T16:06:45Z

Azure Pipelines successfully started running 3 pipeline(s).

azure-pipelines · 2024-12-20T16:06:58Z

Azure Pipelines successfully started running 9 pipeline(s).

onnxruntime/core/providers/webgpu/tensor/pad.cc

guschmue · 2025-01-13T16:32:15Z

/azp run Win_TRT_Minimal_CUDA_Test_CI

azure-pipelines · 2025-01-13T16:32:26Z

Azure Pipelines successfully started running 1 pipeline(s).

onnxruntime/core/providers/webgpu/tensor/pad.cc

fs-eire · 2025-02-11T23:59:15Z

The build break in Web CI pipeline is caused by this change:

2025-02-08T07:34:42.7945646Z /mnt/vss/_work/1/s/onnxruntime/core/providers/webgpu/tensor/pad.cc:116:116: error: implicit conversion loses integer precision: 'int64_t' (aka 'long long') to 'size_type' (aka 'unsigned long') [-Werror,-Wshorten-64-to-32]
2025-02-08T07:34:42.7948642Z   116 |     int64_t upper_pad = (*p_pads)[static_cast<int64_t>(i) + dimension_count] + (*p_slices)[static_cast<int64_t>(i) + dimension_count];
2025-02-08T07:34:42.7950719Z       |                                                                                ~           ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~
2025-02-08T07:34:42.7951830Z /mnt/vss/_work/1/s/onnxruntime/core/providers/webgpu/tensor/pad.cc:116:59: error: implicit conversion loses integer precision: 'int64_t' (aka 'long long') to 'size_type' (aka 'unsigned long') [-Werror,-Wshorten-64-to-32]
2025-02-08T07:34:42.7989831Z   116 |     int64_t upper_pad = (*p_pads)[static_cast<int64_t>(i) + dimension_count] + (*p_slices)[static_cast<int64_t>(i) + dimension_count];
2025-02-08T07:34:42.7990546Z       |                         ~         ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~

The pipeline is using parallel building so you may need to scroll up a little bit more to find the error message.

in WebAssembly build, size_t is 4 bytes instead of 8 bytes. this is why the warning occurred

onnxruntime/core/providers/webgpu/tensor/pad.cc

xhcao · 2025-02-12T08:36:48Z

@fs-eire take a look again, thanks.

guschmue · 2025-02-12T18:28:06Z

/azp run ONNX Runtime Web CI Pipeline,Windows GPU CI Pipeline,Linux Android Emulator QNN CI Pipeline

guschmue · 2025-02-12T18:28:20Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

azure-pipelines · 2025-02-12T18:28:21Z

Azure Pipelines successfully started running 2 pipeline(s).

guschmue · 2025-02-12T18:28:27Z

/azp run Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,Windows x64 QNN CI Pipeline,Big Models

guschmue · 2025-02-12T18:28:33Z

/azp run Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI

azure-pipelines · 2025-02-12T18:28:46Z

Azure Pipelines successfully started running 4 pipeline(s).

azure-pipelines · 2025-02-12T18:28:50Z

Azure Pipelines successfully started running 4 pipeline(s).

azure-pipelines · 2025-02-12T18:28:58Z

Azure Pipelines successfully started running 9 pipeline(s).

guschmue · 2025-02-13T18:05:22Z

/azp run ONNX Runtime Web CI Pipeline,Windows GPU CI Pipeline,Linux Android Emulator QNN CI Pipeline

guschmue · 2025-02-13T18:05:30Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

guschmue · 2025-02-13T18:05:37Z

/azp run Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,Windows x64 QNN CI Pipeline,Big Models

azure-pipelines · 2025-02-13T18:05:40Z

Azure Pipelines successfully started running 2 pipeline(s).

guschmue · 2025-02-13T18:05:45Z

/azp run Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI

azure-pipelines · 2025-02-13T18:05:58Z

Azure Pipelines successfully started running 4 pipeline(s).

azure-pipelines · 2025-02-13T18:06:03Z

Azure Pipelines successfully started running 4 pipeline(s).

azure-pipelines · 2025-02-13T18:06:09Z

Azure Pipelines successfully started running 9 pipeline(s).

fs-eire · 2025-02-14T06:38:52Z

It seems that onnx_backend_test_series fails (SIGSEGV) on macOS and the error stably reproduces after retry.

Maybe need to verify if this can be reproducible locally on a macOS device.

guschmue · 2025-02-14T18:28:06Z

macos pipeline is failing with:
'onnx_backend_test_series.py']' died with <Signals.SIGSEGV: 11>.

seems to be ok on main as far I can tell, so maybe something in this PR triggers it.

xhcao · 2025-02-25T09:28:27Z

@fs-eire @guschmue The bots error log had been removed, could you help trigger the bots again? I want to re-produce the issue on local mac, but I want to the options when running the 'onnx_backend_test_series.py'. Thanks.

[webgpu] support Pad operator

fc309db

xhcao marked this pull request as ready for review December 18, 2024 11:29

guschmue added the ep:WebGPU ort-web webgpu provider label Dec 19, 2024

github-advanced-security bot found potential problems Dec 19, 2024

View reviewed changes

xhcao added 2 commits December 20, 2024 11:37

Fix compiling error on Mac OS

dbe430f

Merge branch 'main' into pad

491e597

jchen10 reviewed Dec 26, 2024

View reviewed changes

onnxruntime/core/providers/webgpu/tensor/pad.cc Outdated Show resolved Hide resolved

fs-eire reviewed Jan 14, 2025

View reviewed changes

onnxruntime/core/providers/webgpu/tensor/pad.cc Outdated Show resolved Hide resolved

guschmue previously approved these changes Jan 14, 2025

View reviewed changes

Remove template class

dbfc00a

fs-eire reviewed Feb 12, 2025

View reviewed changes

onnxruntime/core/providers/webgpu/tensor/pad.cc Outdated Show resolved Hide resolved

xhcao added 2 commits February 12, 2025 16:04

Fix bots' failures

88e8e56

Merge branch 'main' into pad

488e1fb

guschmue previously approved these changes Feb 12, 2025

View reviewed changes

Include string and vector headers

31ae735

xhcao dismissed guschmue’s stale review via 31ae735 February 13, 2025 07:47

fs-eire approved these changes Feb 14, 2025

View reviewed changes

guschmue approved these changes Feb 14, 2025

View reviewed changes

Merge branch 'main' into pad

2659680

[webgpu] support Pad operator #23141

Are you sure you want to change the base?

[webgpu] support Pad operator #23141

Conversation

xhcao commented Dec 18, 2024

Description

Motivation and Context

xhcao commented Dec 18, 2024

guschmue commented Dec 19, 2024

guschmue commented Dec 19, 2024

guschmue commented Dec 19, 2024

azure-pipelines bot commented Dec 19, 2024

guschmue commented Dec 19, 2024

azure-pipelines bot commented Dec 19, 2024

azure-pipelines bot commented Dec 19, 2024

azure-pipelines bot commented Dec 19, 2024

xhcao commented Dec 20, 2024

guschmue commented Dec 20, 2024

guschmue commented Dec 20, 2024

guschmue commented Dec 20, 2024

azure-pipelines bot commented Dec 20, 2024

guschmue commented Dec 20, 2024

azure-pipelines bot commented Dec 20, 2024

azure-pipelines bot commented Dec 20, 2024

azure-pipelines bot commented Dec 20, 2024

guschmue commented Jan 13, 2025

azure-pipelines bot commented Jan 13, 2025

fs-eire commented Feb 11, 2025 • edited Loading

xhcao commented Feb 12, 2025

guschmue commented Feb 12, 2025

guschmue commented Feb 12, 2025

azure-pipelines bot commented Feb 12, 2025

guschmue commented Feb 12, 2025

guschmue commented Feb 12, 2025

azure-pipelines bot commented Feb 12, 2025

azure-pipelines bot commented Feb 12, 2025

azure-pipelines bot commented Feb 12, 2025

guschmue commented Feb 13, 2025

guschmue commented Feb 13, 2025

guschmue commented Feb 13, 2025

azure-pipelines bot commented Feb 13, 2025

guschmue commented Feb 13, 2025

azure-pipelines bot commented Feb 13, 2025

azure-pipelines bot commented Feb 13, 2025

azure-pipelines bot commented Feb 13, 2025

fs-eire commented Feb 14, 2025 • edited Loading

guschmue commented Feb 14, 2025 • edited Loading

xhcao commented Feb 25, 2025

fs-eire commented Feb 11, 2025 •

edited

Loading

fs-eire commented Feb 14, 2025 •

edited

Loading

guschmue commented Feb 14, 2025 •

edited

Loading