Robust Adversarial Model-Based Offline Reinforcement Learning (RAMBO)

June 2, 2023 · View on GitHub

Official code to reproduce the experiments for RAMBO-RL: Robust Adversarial Model-Based Offline Reinforcement Learning. This implementation builds upon the code for MOPO.

For a PyTorch implementation of RAMBO, please see OfflineRL-Kit.

Installation

Install MuJoCo 2.1.0 to ~/.mujoco/mujoco210.
Create a conda environment and install RAMBO.

cd rambo
conda create --name rambo python=3.7
conda activate rambo
pip install -e .
pip install -r requirements.txt

Usage

Configuration files can be found in examples/config/. For example, to run the hopper-medium task from the D4RL benchmark, use the following.

rambo run_example examples.development --config examples.config.rambo.mujoco.hopper_medium --seed 0 --gpus 1

By default, TensorBoard logs are generated in the "logs" directory. The code is also set up to log using Weights and Biases (WandB). To enable the use of WandB, set "log_wandb" to True in the configuration file.

Citing RAMBO

@article{rigter2022rambo,
  title={RAMBO-RL: Robust Adversarial Model-Based Offline Reinforcement Learning},
  author={Rigter, Marc and Lacerda, Bruno and Hawes, Nick},
  journal={Advances in Neural Information Processing Systems},
  year={2022}
}

Installation

Usage

Logging

Citing RAMBO