Skip to content

Commit

Permalink
[GPU] Optimized operations in the blas kernels with the latest buffe…
Browse files Browse the repository at this point in the history
…r changes.

    Updated the pipeline for both fp32 and fp16.
    SwiGLU, RmsNorm and Concat ops updated.

        **Self evaluation:**
        1. Build test:   [X]Passed [ ]Failed [ ]Skipped
        2. Run test:     [X]Passed [ ]Failed [ ]Skipped

Signed-off-by: Niket Agarwal <[email protected]>
  • Loading branch information
niket-agarwal authored and jijoongmoon committed Feb 24, 2025
1 parent f154d7c commit 95bd480
Show file tree
Hide file tree
Showing 3 changed files with 204 additions and 183 deletions.
Loading

0 comments on commit 95bd480

Please sign in to comment.