JAX-Based Off-Policy RL Algorithms

This repository contains a JAX-based implementation of various off-policy reinforcement learning (RL) algorithms, focusing on leveraging JAX for efficiency.

Features

Efficient JAX Implementation: Optimized for speed and performance.
Clean and Simple Code: Designed for clarity and ease of understanding.
Comparison with PyTorch: Includes benchmarks comparing training speed against PyTorch implementations.

Implemented Algorithms

TD7
SALE-TQC : SALE Representation (TD7) + TQC
SIMBA

Learning Curves

With 5 seeds experiment, 95% Confidence interval

Getting Started

# Clone the repository
git clone https://github.com/seungju-k1m/jax-offpolicy-rl.git
cd jax-offpolicy-rl

# Install dependencies
rye sync

Usage

rye run python cli.py td7 --env-id Humanoid-v4 --save-path "save/TD7" --seed 1 --use-progressbar

Results

The repository includes scripts to visualize learning curves and compare training efficiency.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
doc		doc
scripts		scripts
src		src
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
cli.py		cli.py
pyproject.toml		pyproject.toml
requirements-dev.lock		requirements-dev.lock
requirements.lock		requirements.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JAX-Based Off-Policy RL Algorithms

Features

Implemented Algorithms

Learning Curves

Getting Started

Usage

Results

About

Releases

Packages

Languages

seungju-k1m/jax-offpolicy-rl

Folders and files

Latest commit

History

Repository files navigation

JAX-Based Off-Policy RL Algorithms

Features

Implemented Algorithms

Learning Curves

Getting Started

Usage

Results

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages