Official Release for h2oGPT 0.2.1
What's Changed
- Update Mac One click installer Mar 08, 2024 by @Mathanraj-Sharma in #1456
- Update gradio constraint to 4.20.1 by @Mathanraj-Sharma in #1457
- General chat_template handling + Sealion by @pseudotensor in #1463
- Update linux install script by @Mathanraj-Sharma in #1462
- CohereForAI/c4ai-command-r-v01 by @pseudotensor in #1467
- Set OpenAI proxy port to 5001 for MacOS by @Mathanraj-Sharma in #1468
- Do map[0] instead of map_reduce if all fits into context by @pseudotensor in #1475
- Image change style by @pseudotensor in #1444
- Fix history with images by @pseudotensor in #1479
- Support claude-3 as vision models by @pseudotensor in #1480
- Add gpt-4-vision support as vision model by @pseudotensor in #1481
- Support gemini-vision-pro as vision model by @pseudotensor in #1482
- [DOCS] Correct typos in FAQ and improve readability by @zainhaq-h2o in #1487
- Google auth by @pseudotensor in #1486
- Handle multiple images for gpt4-vision-preview, gemini-pro-vision and claude-3's by @pseudotensor in #1490
- [DOCS] Fix typos on Links page by @zainhaq-h2o in #1488
- clean-up use of grclient by @pseudotensor in #1493
- feat: Qdrant vector store by @Anush008 in #1460
- [Snyk] Fix for 3 vulnerabilities by @smg478 in #1498
- [Snyk] Fix for 15 vulnerabilities by @smg478 in #1501
- Fix trust by @pseudotensor in #1505
- Makelist by @pseudotensor in #1511
- Pass number of prompt tokens and prompt_raw back by @pseudotensor in #1514
- Handle multiple images for llava by @pseudotensor in #1516
- fix: Ignore Qdrant scroll offset gpt_langchain.py by @Anush008 in #1524
- JSON mode by @pseudotensor in #1527
- Gradio 4.25.0 by @pseudotensor in #1510
- Fix grounded template token counting by @pseudotensor in #1533
- Fix llava token counting by @pseudotensor in #1534
- Fixdocker by @pseudotensor in #1538
- Update docker_build_script_ubuntu.sh by @achraf-mer in #1541
- Check and version by @pseudotensor in #1542
- Back to gradio 4.20.1, 4.25.0 really bad in terms of speed and overall stressed performance. Eventually hangs server too easily by @pseudotensor in #1546
- Restore gradio 4.26.0 but no heartbeat by @pseudotensor in #1562
- Repair json if required, also pass back raw response without extraction by @pseudotensor in #1568
- Faster auth access using sqlite3 instead of full json load/change every minor operation by @pseudotensor in #1569
- Stream in async for summary/extract by @pseudotensor in #1575
- Isolate JSON prompts so can change language etc. by @pseudotensor in #1581
- Ensure llama-3 or other chat template based models handled by @pseudotensor in #1588
- Clean-up stopping to avoid hard-coded things for llama-3 as it was fixed 11 days ago. by @pseudotensor in #1590
- remove vllm-check/tgi-check init-container by @robinliubin in #1605
- Together.ai support and remove old chroma migration by @pseudotensor in #1607
- [HELM] Fixes - Add Args when running h2oGPT only by @EshamAaqib in #1610
- Improve split and merge by @pseudotensor in #1612
- [DOCS] Minor FAQ improvements by @zainhaq-h2o in #1613
- At least provide rules even if no schema by @pseudotensor in #1620
- Add OpenAI Proxy TTS by @pseudotensor in #1621
- set podSecurityContext to null, so umbrella can overwrite on openshift by @robinliubin in #1618
- guided_whitespace_pattern by @pseudotensor in #1625
- OpenAI proxy STT by @pseudotensor in #1622
- Refactor gradio tools to isolate non-gradio functions. Fix audio streaming for TTS through OpenAI. WIP for direct OpenAI nochat call without gradio. by @pseudotensor in #1543
- Add support for idefics2 vision model via TGI client by @pseudotensor in #1629
- Put file lock as deep as possible to avoid over locking by @pseudotensor in #1640
- Function server by @pseudotensor in #1641
- Use gunicorn so dead workers restart unlike uvicorn by @pseudotensor in #1645
- Cogvlm2 by @pseudotensor in #1651
- Fix asyncio sglang use by @pseudotensor in #1654
- Repair json work around by @pseudotensor in #1658
- Add function calling for mistralai for better json mode by @pseudotensor in #1659
New Contributors
Full Changelog: 0.2.0...0.2.1