Change the repository type filter
All
Repositories list
16 repositories
UMbreLLa
PublicLLM Inference on consumer devicesAPE
PublicRULER
Public- scalable and robust tree-based speculative decoding algorithm
S2FT
PublicS2FT-Page
PublicMagicPIG
Public[ICLR2025] MagicPIG: LSH Sampling for Efficient LLM GenerationMagicDec
PublicFactor
PublicMagicDec-part1
PublicSirius
PublicMagicDec-part2
PublicTriForce
Public[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative DecodingSequoia-Page
Public