Highlights
- Pro
Stars
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
A bibliography and survey of the papers surrounding o1
Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"
LaTeX files for the Deep Learning book notation
A package to generate summaries of long-form text and evaluate the coherence of these summaries. Official package for our ICLR 2024 paper, "BooookScore: A systematic exploration of book-length summ…
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
A simple Python wrapper for Slurm with flexibility in mind.
[EMNLP 2023] Adapting Language Models to Compress Long Contexts
[NeurIPS2023] BoundaryDiffusion: A learning-free method for semantic control with Diffusion Models
[ICLR2023] Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation (CDCD).
[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627
Training and serving large-scale neural networks with auto parallelization.
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
Traditional roguelike game with pixel-art graphics and simple interface
Discord bot for chess puzzles and information from https://lichess.org