Skip to content

Actions: ggml-org/llama.cpp

Server

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
10,693 workflow runs
10,693 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

vulkan: fix assertion when qy_needs_dequant (#12068)
Server #11203: Commit a82c9e7 pushed by 0cc4m
February 25, 2025 15:30 9m 28s master
February 25, 2025 15:30 9m 28s
Add GGML_HIP_ROCWMMA_FATTN to enable rocWMMA for FlashAttention
Server #11202: Pull request #12032 synchronize by hjc4869
February 25, 2025 15:12 Action required hjc4869:pr
February 25, 2025 15:12 Action required
llama : add xcframework build script
Server #11201: Pull request #11996 synchronize by danbev
February 25, 2025 15:06 8m 10s danbev:xcframework-build-10747
February 25, 2025 15:06 8m 10s
Add GGML_HIP_ROCWMMA_FATTN to enable rocWMMA for FlashAttention
Server #11200: Pull request #12032 synchronize by hjc4869
February 25, 2025 15:00 Action required hjc4869:pr
February 25, 2025 15:00 Action required
llama : add xcframework build script
Server #11199: Pull request #11996 synchronize by danbev
February 25, 2025 14:42 25m 1s danbev:xcframework-build-10747
February 25, 2025 14:42 25m 1s
tool-call: add support for tool-calls using Model Context Protocol
Server #11198: Pull request #11556 synchronize by bandoti
February 25, 2025 14:41 19m 39s bandoti:llamacli-tools
February 25, 2025 14:41 19m 39s
tool-call: add support for tool-calls using Model Context Protocol
Server #11197: Pull request #11556 synchronize by bandoti
February 25, 2025 14:15 26m 46s bandoti:llamacli-tools
February 25, 2025 14:15 26m 46s
vulkan: fix assertion when qy_needs_dequant
Server #11196: Pull request #12068 opened by jeffbolznv
February 25, 2025 14:14 33m 5s jeffbolznv:qy_dequant_assert
February 25, 2025 14:14 33m 5s
llama : refactor llama_kv_cache, llama_context and llm_build_context
Server #11194: Pull request #11213 synchronize by ggerganov
February 25, 2025 14:11 8m 33s gg/llama-kv-cache
February 25, 2025 14:11 8m 33s
ggml: aarch64: implement SVE kernels for q2_k_q8_k vector dot
Server #11192: Pull request #12064 synchronize by Vithulep
February 25, 2025 13:33 7m 47s Vithulep:Q2_k_SVE_Kernel
February 25, 2025 13:33 7m 47s
Cache based tokenization for the server input prompts
Server #11191: Pull request #12067 opened by vnicolici
February 25, 2025 13:08 Action required vnicolici:cache-based-tokenization
February 25, 2025 13:08 Action required
llama : add xcframework build script
Server #11190: Pull request #11996 synchronize by danbev
February 25, 2025 12:55 27m 58s danbev:xcframework-build-10747
February 25, 2025 12:55 27m 58s
ggml: aarch64: implement SVE kernels for q2_k_q8_k vector dot
Server #11187: Pull request #12064 synchronize by Vithulep
February 25, 2025 11:55 1h 3m 17s Vithulep:Q2_k_SVE_Kernel
February 25, 2025 11:55 1h 3m 17s
server: handle echo=false on /v1/completions (#12060)
Server #11186: Commit 401af80 pushed by ngxson
February 25, 2025 11:52 54m 37s master
February 25, 2025 11:52 54m 37s
tool-call: add support for tool-calls using Model Context Protocol
Server #11183: Pull request #11556 synchronize by bandoti
February 25, 2025 11:41 50m 8s bandoti:llamacli-tools
February 25, 2025 11:41 50m 8s
add OP sigmoid (#12056)
Server #11182: Commit c132239 pushed by 0cc4m
February 25, 2025 11:32 42m 54s master
February 25, 2025 11:32 42m 54s
tool-call: add support for tool-calls using Model Context Protocol
Server #11181: Pull request #11556 synchronize by bandoti
February 25, 2025 11:31 9m 50s bandoti:llamacli-tools
February 25, 2025 11:31 9m 50s
ggml-cpu: Fix build with sve (#12059)
Server #11180: Commit 393fca6 pushed by MollySophia
February 25, 2025 11:28 23m 3s master
February 25, 2025 11:28 23m 3s
vulkan: implement more backpropagation operators (#11914)
Server #11179: Commit 61d4f39 pushed by 0cc4m
February 25, 2025 11:04 29m 27s master
February 25, 2025 11:04 29m 27s