v1.0.0
What's new in 1.0.0 (2024-11-15)
These are the changes in inference v1.0.0.
New features
- FEAT: Basic cancel support for image model by @codingl2k1 in #2528
- FEAT: Add qwen2.5-coder 0.5B 1.5B 3B 14B 32B by @frostyplanet in #2543
- FEAT: support kvcache in multi-round chat for MLX by @qinxuye in #2534
Enhancements
- ENH: add normalize to rerank model by @hustyichi in #2509
- ENH: Update fish audio by @codingl2k1 in #2555
Bug fixes
Documentation
- DOC: Add paper citation by @luweizheng in #2533
Full Changelog: v0.16.3...v1.0.0