Feature Selection for Machine Learning

October 7, 2019 ยท View on GitHub

This repository contains the code for three main methods in Machine Learning for Feature Selection i.e. Filter Methods, Wrapper Methods and Embedded Methods. All code is written in Python 3.

Status: Ongoing

Requirements

1. Python 3.5 +

2. Jupyter Notebook

3. Scikit-Learn

4. Numpy [+mkl for Windows]

5. Pandas

6. Matplotlib

7. Seaborn

8. mlxtend

Datasets

1. Santander Customer Satisfaction Dataset

2. BNP Paribas Cardif Claims Management Dataset

3. Titanic Disaster Dataset

4. Housing Prices Dataset

Filter Methods

S.No.NameAboutStatus
1.Constant Feature EliminationThis notebook explains how to remove the constant features during pre-processing step.Completed
2.Quasi-Constant Feature EliminationThis notebook explains how to get the Quasi-Constant features and remove them during pre-processing.Completed
3.Duplicate Features EliminationThis notebook explains how to find the duplicate features in a dataset and remove them.Completed
4.CorrelationThis notebook explains how to get the correlation between features and between features and target and choose the best features.Completed
5.Machine Learning PipelineThis notebook explains how to use all the above methods in a ML pipeline with performance comparison.Completed
6.Mutual InformationThis notebook explains the concept of Mutual Information using classification and Regression to find the best features from a dataset.Completed
7.Fisher Score Chi SquareThis notebook explains the concept of Fisher Score chi2 for feature selection.Completed
8.Univariate Feature SelectionThis notebook explains the concept of Univariate Feature Selection using Classification and Regression.Completed
9.Univariate ROC/AUC/MSEThis notebook explains the concept of Univariate Feature Selection using ROC AUC scoring.Completed
10.Combining all MethodsThis notebook compares the combined performance of all methods explained.Completed

Wrapper Methods

S.No.NameAboutStatus
1.Step Forward Feature SelectionThis notebook explains the concept of Step Forward Feature Selection.Completed
2.Step Backward Feature SelectionThis notebook explains the concept of Step Backward Feature Selection.Completed
3.Exhaustive Search Feature SelectionThis notebook explains the concept of Exhaustive Search Feature Selection.Completed

Embedded Methods

S.No.NameAboutStatus