Extra reads

February 23, 2021 ยท View on GitHub

ML advance topics

PaperWhy
Sampling techniquesStratifield sampling is popular.
A Comparative Study of Efficient Initialization Methods for the k-means Clustering Algorithm and kmeans initialization, CourseraFundamentals about kemans, favourite topics in interviews i.e: LinkedIn
A Comprehensive Survey of Clustering AlgorithmsClustering fundamentals
A Tutorial on Spectral ClusteringSepctral clustering is intuitive and quite popular.
Partial residual plotUseful for model diagnosis.
Compare GINI index and Information GainIntuition behind Decision Tree, RandomForest
Explain tf-idfFundamentals about tf-idf.
Understanding L-BFGSAdvance about optimization, rarely asked in interview
Optimizer Quasi newton methodAdvance about optimization.

DL classic papers

PaperWhy
Understanding the Difficulty of Training Deep Feedforward Neural NetworksClassic paper (2010) about initialization, sigmoid etc
Delving Deep into Rectifiers - Surpassing Human-Level Performance on ImageNet ClassificationClassic paper (2015) about ReLU, PReLU
Batch Normalization - Accelerating Deep Network Training by Reducing Internal Covariate ShiftClassic paper about BatchNorm
Dropout - A Simple Way to Prevent Neural Networks from OverfittingClassic paper about Dropout
Deep Residual Learning for Image RecognitionClassic ResNet
On Large-Batch Training for Deep Learning - Generalization Gap and Sharp MinimaPractical technique for large batch training

DL advance topics

PaperWhy
Calibration in modern neural networkImportant topics in ML system design i.e: facebook
Attention modelFundamentals in Attention, powerful architecture in NLP
Ilya's thesisNetwork in network

NLP

PaperWhy
Adaptive Importance Sampling to Accelerate Training of a Neural Probabilistic Language ModelClassic paper (2008) in NLP
Natural Language Processing (Almost) from ScratchClassic paper (2011) in NLP
Word2vecThe classic paper in NLP, still popular in industry: Uber, DoorDash, Twitter etc
GloVe - Global Vectors for Word RepresentationClassic paper (2014) in NLP
Bag of Tricks for Efficient Text ClassificationCool tricks in NLP tasks
BERT - Pre-training of Deep Bidirectional Transformers for Language UnderstandingThe famous BERT
Smart Reply - Automated Response Suggestion for EmailNLP application, useful for ML system design
Enriching Word Vectors with Subword InformationSimple and fast method to train NLP task in Facebook
Neural Approaches to Conversational AIComprehensive survey (2018) about chatbots

Recent breakthrough

Notes

  • If you're interested to learn more about paid ML system design course on educative.io with more examples, click here.
  • If you find this helpful, you can Sponsor this project. It's cool if you don't.