Vectorize and Parallelize RL Environments with JAX: Q-learning at the Speed of Light
Vectorize and Parallelize RL Environments with JAX: Q-learning at the Speed of Light
Learn to vectorize a GridWorld environment and train 30 Q-learning agents in parallel on a CPU, at 1.8 million step per seconds!