Prompting Decisicion Transformer for Few-Shot Policy Generalization

Official code repository for Prompt-DT. [website][paper]

Prompt-DT Architecture:

Installation

We tested the code in Ubuntu 20.04.

We recommend using Anaconda to create a virtual environment.

conda create --name prompt-dt python=3.8.5
conda activate prompt-dt

Our experiments require MuJoCo as well as mujoco-py. Install them by following the instructions in the mujoco-py repo.
Install environments and dependencies with the following commands:

# install dependencies
pip install -r requirements.txt

# install environments
./install_envs.sh

We log experiments with wandb. Check out the wandb quickstart doc to create an account.

Download Datasets

We share example datasets via this Google Drive link.
Download the "data" folder.

wget -O data.zip 'https://drive.google.com/uc?export=download&id=1rZufm-XRq1Ig-56DejkQUX1si_WzCGBe&confirm=True' 
unzip data.zip
rm data.zip

Organize folders as follows.

.
├── config
├── data
│   ├── ant_dir
│   ├── cheetah_dir
│   ├── cheetah_vel
│   └── ML1-pick-place-v2
├── envs
├── prompt_dt
└── ...

Run Experiments

# Prompt-DT
python pdt_main.py --env cheetah_dir # choices:['cheetah_dir', 'cheetah_vel', 'ant_dir', 'ML1-pick-place-v2']

# Prompt-MT-BC
python pdt_main.py --no-rtg --no-r

# MT-ORL
python pdt_main.py --no-prompt

# MT-BC-Finetune
python pdt_main.py --no-prompt --no-rtg --no-r --finetune

Acknowledgements

The code for prompt-dt is based on decision-transformer. We build environments based on repos including macaw, rand_param_envs, and metaworld.

References

If you find our code helpful for your research, please consider citing the paper!

@inproceedings{xu2022prompting,
  title={Prompting Decision Transformer for Few-Shot Policy Generalization},
  author={Xu, Mengdi and Shen, Yikang and Zhang, Shun and Lu, Yuchen and Zhao, Ding and Tenenbaum, Joshua and Gan, Chuang},
  booktitle={International Conference on Machine Learning},
  pages={24631--24645},
  year={2022},
  organization={PMLR}
}

Contributions

Suggestions for enhancing and improving the code are welcome. Please email mengdixu@andrew.cmu.edu with comments and suggestions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Prompting Decisicion Transformer for Few-Shot Policy Generalization

Installation

Download Datasets

Run Experiments

Acknowledgements

References

Contributions

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
config		config
envs		envs
prompt_dt		prompt_dt
.gitignore		.gitignore
README.md		README.md
install_envs.sh		install_envs.sh
pdt_main.py		pdt_main.py
requirements.txt		requirements.txt

mxu34/prompt-dt

Folders and files

Latest commit

History

Repository files navigation

Prompting Decisicion Transformer for Few-Shot Policy Generalization

Installation

Download Datasets

Run Experiments

Acknowledgements

References

Contributions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages