Skip to content
View howard-yen's full-sized avatar

Highlights

  • Pro

Block or report howard-yen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 40,909 5,022 Updated Feb 18, 2025

LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation

HTML 18 Updated Jan 22, 2025

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Python 74 9 Updated Feb 12, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,154 50 Updated Nov 16, 2024

Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"

Python 153 5 Updated Dec 16, 2024

The HELMET Benchmark

Python 115 17 Updated Feb 18, 2025
Python 39 1 Updated Aug 10, 2024

LaTeX files for the Deep Learning book notation

TeX 1,726 360 Updated May 8, 2023

A package to generate summaries of long-form text and evaluate the coherence of these summaries. Official package for our ICLR 2024 paper, "BooookScore: A systematic exploration of book-length summ…

Python 115 8 Updated Oct 1, 2024

Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy

Python 1,018 53 Updated Jan 16, 2025

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 819 55 Updated Feb 16, 2025

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Python 584 49 Updated Mar 4, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 139,662 28,009 Updated Feb 18, 2025

A simple Python wrapper for Slurm with flexibility in mind.

Python 129 19 Updated Feb 13, 2025

[EMNLP 2023] Adapting Language Models to Compress Long Contexts

Python 293 23 Updated Sep 9, 2024

[NeurIPS2023] BoundaryDiffusion: A learning-free method for semantic control with Diffusion Models

Python 34 2 Updated Nov 1, 2023

[ICLR2023] Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation (CDCD).

Python 156 10 Updated Apr 5, 2023

[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627

Python 475 44 Updated Oct 9, 2024

Training and serving large-scale neural networks with auto parallelization.

Python 3,100 361 Updated Dec 9, 2023

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Python 1,751 307 Updated Apr 6, 2023

Traditional roguelike game with pixel-art graphics and simple interface

Java 3,638 1,211 Updated Jul 23, 2019

Discord bot for chess puzzles and information from https://lichess.org

Python 15 5 Updated Dec 14, 2024
Showing results