Reflection-Reinforced Self-Training for Language Agents

June 20, 2024 ยท View on GitHub

Overview

  • HotpotQA code in hotpotqa
  • GQA code in gqa

Citation

@article{rerest,
  title={Reflection-Reinforced Self-Training for Language Agents},
  author={Dou, Zi-Yi and Yang, Cheng-Fu and Wu, Xueqing and Chang, Kai-Wei and Peng, Nanyun},
  journal={arXiv preprint},
  year={2024}
}