Tuning the Learning Rate with Alpha

October 20, 2021 · View on GitHub

This experiment performs a grid search for the learning rate on four different problems with Cockpit tracking. Afterward, we can plot the median Alpha value of each run versus the final performance (measured as test accuracy). We can observe that the best performing learning rate is not the one with a median Alpha closest to zero. Perhaps surprisingly it is consistently a run with a median Alpha larger than zero.

Note that the run files require additional data sets (MNIST, FMNIST, SVHN, CIFAR-10) to be downloaded, which they will do automatically.

Median Alpha vs Performance

Run experiment: python run_XX.py to start the grid search on one of the four problems. This will create a results directory with four subfolders for the problems. Alternatively, extract the results.zip to use our original results: unzip results.zip.
Plot results: python plot.py.
Clean up or start over: bash clean.sh (removes results)