Skip to content
View ehsk's full-sized avatar

Organizations

@castorini @beir-cellar @project-miracl

Block or report ehsk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Recipes to scale inference-time compute of open models

Python 1,033 104 Updated Feb 25, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,477 416 Updated Mar 8, 2025

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,087 227 Updated Feb 19, 2025

Fully open reproduction of DeepSeek-R1

Python 22,369 2,005 Updated Mar 8, 2025
Python 484 15 Updated Feb 27, 2025

Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models

Python 32 3 Updated Dec 3, 2024

A bibliography and survey of the papers surrounding o1

TeX 1,177 50 Updated Nov 16, 2024

The MATH Dataset (NeurIPS 2021)

Python 1,049 95 Updated Aug 5, 2024

TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle

Python 227 21 Updated Mar 7, 2025

🙃 A delightful community-driven (with 2,400+ contributors) framework for managing your zsh configuration. Includes 300+ optional plugins (rails, git, macOS, hub, docker, homebrew, node, php, python…

Shell 176,540 26,014 Updated Mar 4, 2025

The official Meta Llama 3 GitHub site

Python 28,462 3,307 Updated Jan 26, 2025

A blazing fast inference solution for text embeddings models

Rust 3,268 224 Updated Mar 7, 2025

WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?

Python 165 25 Updated Feb 17, 2025

🌎💪 BrowserGym, a Gym environment for web task automation

Python 583 73 Updated Mar 4, 2025

Code for Contrastive Preference Learning (CPL)

Python 162 15 Updated Nov 22, 2024

Firefly III: a personal finances manager

PHP 18,322 1,621 Updated Mar 8, 2025

Home of StarCoder2!

Python 1,881 169 Updated Mar 21, 2024

Easy and Efficient Quantization for Transformers

C++ 192 15 Updated Feb 7, 2025

A Comprehensive Assessment of Trustworthiness in GPT Models

Python 275 57 Updated Sep 16, 2024

Ranger helps you see the forest among the trees - Ranger is an effect-size meta analysis library creating beautiful forest plots!

Python 11 1 Updated Jun 12, 2023

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,405 180 Updated Dec 5, 2024
Python 34 7 Updated Mar 29, 2024

utilities for decoding deep representations (like sentence embeddings) back to text

Python 771 87 Updated Jan 24, 2025

RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.

Python 407 54 Updated Mar 7, 2025

List of papers on hallucination detection in LLMs.

789 63 Updated Mar 7, 2025

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,260 410 Updated Nov 18, 2024

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

1,695 137 Updated Sep 19, 2023

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,287 836 Updated Jun 10, 2024

Train transformer language models with reinforcement learning.

Python 12,323 1,665 Updated Mar 7, 2025
Next
Showing results