Haiku-Baseline

Haiku-Baseline is the same Reinforcement Learning implementation as any Baseline implemented through the JAX and Haiku libraries.

It's not compared to any Baseline yet, but it's two to three times faster than the Torch and Tensorflow works that we've implemented before. Using JAX's JIT(Just In Time) compilation, we optimized a series of courses for learning and constructed them based on functions. This allows you to see how JAX's capabilities can be used effectively in various Reinforcement Learning implementations.

This implementation is configured to flexibly solve the commonly used gym and unity ml environment for testing algorithms in various complex environments.

Implemented Environments

Name	Q-Net based	Actor-Critic based	DDPG based
Gym	✔️	✔️	✔️
MultiworkerGym with Ray	✔️	✔️	✔️
Unity-ML Environments	✔️	✔️	✔️

Implemented Algorithms

Name	Complete	`Box`	`Discrete`	`Per`	`N-step`	`NoisyNet`	`Munchausen`
DQN	✔️	❌	✔️	✔️	✔️	✔️	✔️
C51	✔️	❌	✔️	✔️	✔️	✔️	✔️
QRDQN	✔️	❌	✔️	✔️	✔️	✔️	✔️
IQN	✔️	❌	✔️	✔️	✔️	✔️	✔️
FQF	✔️	❌	✔️	✔️	✔️	✔️	✔️
A2C	✔️	✔️	✔️	❌	❌	❌	❌
TRPO	TODO
PPO	✔️	✔️	✔️	❌	❌	❌	❌
ACER	TODO
DDPG	✔️	✔️	❌	✔️	✔️	❌	❌
SAC	✔️	✔️	❌	✔️	✔️	❌	❌
TD3	✔️	✔️	❌	✔️	✔️	❌	❌
TQC	✔️	✔️	❌	✔️	✔️	❌	❌
Ape-X-Qnets	TODO
Ape-X-DDPG based	TODO
IMPALA	TODO

Test

To test atari with DQN(or C51, QRQDN, IQN, FQF)

python run_qnet.py --algo DQN --env BreakoutNoFrameskip-v4 --learning_rate 0.0002 \
		--steps 5e5 --batch 32 --train_freq 1 --target_update 1000 --node 512 \
		--hidden_n 1 --final_eps 0.01 --learning_starts 20000 --gamma 0.995 --clip_rewards

Only 15 minutes is sufficient to run 50K steps on DQNs learning atari breakout (540 steps/sec). This performance measurement was on Nvidia RTX3080 and AMD Ryzen 95950X in a single process.

score : 9.600, epsilon : 0.010, loss : 0.181 |: 100%|███████| 500000/500000 [15:24<00:00, 540.88it/s]

Name		Name	Last commit message	Last commit date
Latest commit History 1,255 Commits
haiku_baselines		haiku_baselines
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Haiku-Baseline

Implemented Environments

Implemented Algorithms

Test

About

Releases

Packages

Languages

License

RL-code-lib/haiku-baseline

Folders and files

Latest commit

History

Repository files navigation

Haiku-Baseline

Implemented Environments

Implemented Algorithms

Test

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages