What's new in 1.1.0 (2024-12-13)
These are the changes in inference v1.1.0.
New features
- FEAT: Support F5 TTS by @codingl2k1 in #2626
- FEAT: [UI] Add a hint for model running. by @yiboyasss in #2657
- FEAT: support VL models for MLX by @qinxuye in #2638
- FEAT: Add support for CLIP model by @Second222None in #2637
- FEAT: support llama-3.3-instruct by @qinxuye in #2661
Enhancements
- ENH: Optimize error message when user parameters are passed incorrectly by @namecd in #2623
- ENH: bypass the sampling parameter skip_special_tokens to vLLM backend by @zjuyzj in #2655
- ENH: unify prompt_text as cosyvoice for fish speech by @qinxuye in #2658
- ENH: Update glm4 chat model to new weights by @codingl2k1 in #2660
- ENH: upgrade sglang in Docker by @amumu96 in #2668
Bug fixes
- BUG: Cleanup Isolation tasks by @codingl2k1 in #2603
- BUG: fix qwq gguf download hub for modelscope by @redreamality in #2647
- BUG: fix ImportError when optional dependency FlagEmbedding is not installed by @zjuyzj in #2649
- BUG: use stream_generate in MLX by @qinxuye in #2635
- BUG:
stop
parameter leads to failure withtransformers
backend by @ChengjieLi28 in #2663 - BUG: fix FishSpeech Negative code found by @themanforfree in #2667
Documentation
- DOC: update new models by @qinxuye in #2632
- DOC: add doc about offline usage for SenseVoiceSmall by @qinxuye in #2654
Others
- FIX: fix launching bge-m3 with hybrid mode by @pengjunfeng11 in #2641
New Contributors
- @namecd made their first contribution in #2623
- @redreamality made their first contribution in #2647
- @Second222None made their first contribution in #2637
- @themanforfree made their first contribution in #2667
Full Changelog: v1.0.1...v1.1.0