OpenAI Grok Curve Experiments
March 16, 2024 ยท View on GitHub
Paper
This is the code for the paper Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets by Alethea Power, Yuri Burda, Harri Edwards, Igor Babuschkin, and Vedant Misra
Installation and Training
pip install -e .
./scripts/train.py