OpenAI Grok Curve Experiments

March 16, 2024 ยท View on GitHub

Paper

This is the code for the paper Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets by Alethea Power, Yuri Burda, Harri Edwards, Igor Babuschkin, and Vedant Misra

Installation and Training

pip install -e .
./scripts/train.py

Contents

  1. 1Paper
  2. 2Installation and Training