Lay Summary Optimization with RLHF
May 4, 2024 ยท View on GitHub
Lay summary optimization with biomistral using PPO on metrics
Task URL: https://biolaysumm.org/
Tech Stack: pdm, pytorch lightning, unsloth, huggingface, mistral, trl, wandb, github actions
Setup/Install
Run make
Train
Run pdm start
Running python scripts
Run pdm run <script path>, e.g. pdm run src/burrito/main.py or pdm run jupyter lab
Adding dependencies
Run pdm add <package name>, e.g. pdm add torch or pdm add "git+https://github.com/Dao-AILab/flash-attention"
Running linters/formatters
Linter: pdm run lint
Formatters: pdm run lint-format