Datasets

June 28, 2022 ยท View on GitHub

The datasets supported by the AI Explainability 360 toolkit are listed below. Please follow the instructions for each dataset.

  1. FICO HELOC Dataset

    Source: https://community.fico.com/s/explainable-machine-learning-challenge

    Download instructions: See download form. After download, place the file heloc_dataset.csv in the folder aix360/data/heloc_data/

  2. Medical Expenditure Panel Survey Data

    Source: https://meps.ahrq.gov/mepsweb/

    Download instructions: See meps_data/README.md

  3. NHANES CDC dataset

    Source: https://wwwn.cdc.gov/nchs/nhanes/search/datapage.aspx?Component=Questionnaire&CycleBeginYear=2013

    No download required, appropriate dataset class downloads the dataset from web.

  4. Fashion MNIST dataset

    source: https://github.com/zalandoresearch/fashion-mnist

    No download required, appropriate dataset class downloads the dataset from web.

  5. MNIST dataset

    Source: http://yann.lecun.com/exdb/mnist/

    No download required, appropriate dataset class downloads the dataset from web.

  6. CIFAR dataset

    Source: https://www.cs.toronto.edu/~kriz/cifar.html

    No download required, appropriate dataset class downloads the dataset from web.

  7. CelebA dataset

    Source: http://mmlab.ie.cuhk.edu.hk/projects/CelebA.html

    No download required, appropriate dataset class downloads the dataset from web.

  8. Synthetic dataset to predict employee attrition

    Source: Generated aix360/data/ted_data

    No download required, appropriate python code generates the data.

  9. e-SNLI dataset

    Source: https://www.eraserbenchmark.com/zipped/esnli.tar.gz

    Follow download instructions in the [nbviewer]