ehsk

Ehsan ehsk

21 followers · 20 following

ServiceNow Research
Canada
16:47 - 5h behind
https://ehsk.github.io
@ehsk0

Achievements

Organizations

Lists (2)

Sort

xACL

12 repositories

xML

5 repositories

Stars

huggingface / search-and-learn

Recipes to scale inference-time compute of open models

Python 1,033 104 Updated Feb 25, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,477 416 Updated Mar 8, 2025

hkust-nlp / simpleRL-reason

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,087 227 Updated Feb 19, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 22,369 2,005 Updated Mar 8, 2025

huggingface / Math-Verify

Python 484 15 Updated Feb 27, 2025

mnoukhov / async_rlhf

Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models

Python 32 3 Updated Dec 3, 2024

srush / awesome-o1

A bibliography and survey of the papers surrounding o1

TeX 1,177 50 Updated Nov 16, 2024

hendrycks / math

The MATH Dataset (NeurIPS 2021)

Python 1,049 95 Updated Aug 5, 2024

ServiceNow / TapeAgents

TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle

Python 227 21 Updated Mar 7, 2025

ohmyzsh / ohmyzsh

🙃 A delightful community-driven (with 2,400+ contributors) framework for managing your zsh configuration. Includes 300+ optional plugins (rails, git, macOS, hub, docker, homebrew, node, php, python…

Shell 176,540 26,014 Updated Mar 4, 2025

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 28,462 3,307 Updated Jan 26, 2025

huggingface / text-embeddings-inference

A blazing fast inference solution for text embeddings models

Rust 3,268 224 Updated Mar 7, 2025

ServiceNow / WorkArena

WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?

Python 165 25 Updated Feb 17, 2025

ServiceNow / BrowserGym

🌎💪 BrowserGym, a Gym environment for web task automation

Python 583 73 Updated Mar 4, 2025

jhejna / cpl

Code for Contrastive Preference Learning (CPL)

Python 162 15 Updated Nov 22, 2024

firefly-iii / firefly-iii

Firefly III: a personal finances manager

PHP 18,322 1,621 Updated Mar 8, 2025

bigcode-project / starcoder2

Home of StarCoder2!

Python 1,881 169 Updated Mar 21, 2024

NetEase-FuXi / EETQ

Easy and Efficient Quantization for Transformers

C++ 192 15 Updated Feb 7, 2025

AI-secure / DecodingTrust

A Comprehensive Assessment of Trustworthiness in GPT Models

Python 275 57 Updated Sep 16, 2024

MeteSertkan / ranger

Ranger helps you see the forest among the trees - Ranger is an effect-size meta analysis library creating beautiful forest plots!

Python 11 1 Updated Jun 12, 2023

EleutherAI / pythia

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,405 180 Updated Dec 5, 2024

nouhadziri / faith-and-fate

Python 34 7 Updated Mar 29, 2024

vec2text / vec2text

utilities for decoding deep representations (like sentence embeddings) back to text

Python 771 87 Updated Jan 24, 2025

castorini / rank_llm

RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.

Python 407 54 Updated Mar 7, 2025

EdinburghNLP / awesome-hallucination-detection

List of papers on hallucination detection in LLMs.

789 63 Updated Mar 7, 2025

stanford-futuredata / ColBERT

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,260 410 Updated Nov 18, 2024

lorenzkuhn / semantic_uncertainty

Python 157 22 Updated Jun 20, 2024

anthropics / hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

1,695 137 Updated Sep 19, 2023

artidoro / qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,287 836 Updated Jun 10, 2024

huggingface / trl

Train transformer language models with reinforcement learning.

Python 12,323 1,665 Updated Mar 7, 2025