RAP: Retrieval-Augmented Planning with Contextual Memory for Multimodal LLM Agents

June 19, 2024 ยท View on GitHub

This repository is the implementation of RAP.

RAP figure

Get Started

Please refer to the following README's for each benchmark.

Citation

If you find RAP helpful in your research, please consider citing.

@misc{kagaya2024rap,
      title={RAP: Retrieval-Augmented Planning with Contextual Memory for Multimodal LLM Agents}, 
      author={Tomoyuki Kagaya and Thong Jing Yuan and Yuxuan Lou and Jayashree Karlekar and Sugiri Pranata and Akira Kinose and Koki Oguri and Felix Wick and Yang You},
      year={2024},
      eprint={2402.03610},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

License

MIT license