-
Soochow University
- Suzhou
- https://spico197.github.io/
Highlights
- Pro
-
-
-
-
LLaMA-MoE-v2 Public
Forked from OpenSparseLLMs/LLaMA-MoE-v2LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
Python Apache License 2.0 UpdatedDec 7, 2024 -
i7h-decoder Public
Decoder for i7h from https://github.com/RimoChan/i7h .
-
REx Public
🎮 A toolkit for Relation Extraction and more...
-
get-an-idea Public
Geting an idea from enormous paper repositories.
-
MoE-SFT Public
🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts
-
TextEE Public
Forked from ej0cl6/TextEEA standardized, fair, and reproducible benchmark for evaluating event extraction approaches
Python Apache License 2.0 UpdatedJul 2, 2024 -
Humback Public
🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.
-
-
-
Spico197.github.io Public
ZHU Tong's homepage.
-
-
-
nanotron Public
Forked from huggingface/nanotronMinimalistic large language model 3D-parallelism training
Python Apache License 2.0 UpdatedApr 25, 2024 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedMar 12, 2024 -
transformers Public
Forked from huggingface/transformers🤗Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedMar 11, 2024 -
llama-moe Public
Forked from pjlab-sys4nlp/llama-moe⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training
Python Apache License 2.0 UpdatedFeb 26, 2024 -
mergekit Public
Forked from arcee-ai/mergekitTools for merging pretrained large language models.
Python GNU Lesser General Public License v3.0 UpdatedJan 9, 2024 -
smoe-eval Public
For smoe models evaluation. Commit: b281b0921b636bc36ad05c0b0b0763bd6dd43463
-
FastChat Public
Forked from lm-sys/FastChatAn open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Python Apache License 2.0 UpdatedDec 25, 2023 -
opencompass Public
Forked from open-compass/opencompassOpenCompass is an LLM evaluation platform, supporting a wide range of models (LLaMA, LLaMa2, ChatGLM2, ChatGPT, Claude, etc) over 50+ datasets.
-
Mirror Public
🪞A powerful toolkit for almost all the Information Extraction tasks.
-
-
-
-
OpenBA Public
Forked from OpenNLG/OpenBAOpenBA: An Open-Sourced 15B Bilingual Asymmetric Seq2Seq Model Pre-trained from Scratch
Python Apache License 2.0 UpdatedSep 22, 2023 -
nanoT5 Public
Forked from PiotrNawrot/nanoT5Fast & Simple repository for pre-training and fine-tuning T5-style models
Python Apache License 2.0 UpdatedSep 10, 2023 -
DocEE Public
🕹️ A toolkit for document-level event extraction, containing some SOTA model implementations.