Lists (4)
Sort Name ascending (A-Z)
Starred repositories
Toolkit for linearizing PDFs for LLM datasets/training
Parse PDFs into markdown using Vision LLMs
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Automate the process of making money online.
坚持分享 GitHub 上高质量、有趣实用的开源技术教程、开发者工具、编程网站、技术资讯。A list cool, interesting projects of GitHub.
A simple screen parsing tool towards pure vision based GUI agent
基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。
整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX Organize the currently open-source optimal table recognition models, improve pre-processing and post-processing, and convert the models to ONNX.
PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation
Retrieval and Retrieval-augmented LLMs
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
chongzicbo / transformers
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🧑🚀 全世界最好的LLM资料总结(数据处理、模型训练、模型部署、o1 模型、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Collection of AWESOME vision-language models for vision tasks
Curated list of datasets and tools for post-training.
Univer is a full-stack framework for creating and editing spreadsheets, documents, and slides on both web and server.