Ensuring More Accurate, Generalisable, and Interpretable Machine Learning Models for Bioinformatics

November 20, 2025 · View on GitHub

Ensuring More Accurate, Generalisable, and Interpretable Machine Learning Models for Bioinformatics

This repository regroups works on the intermediate machine learning SIB course "Ensuring More Accurate, Generalisable, and Interpretable Machine Learning Models for Bioinformatics".

pre-requisites

The course is targeted to life scientists who are already familiar with the Python programming language and have a good grasp of Machine Learning concepts such as K-fold cross-validation, grid-search for hyper-parameter tuning, or tree models.

The course uses jupyter notebooks to go through examples and exercises. See the intructions on installing prerequisite libraries to help you stup you environment for the course.

We advocate the use of conda environment for tidy management of the libraries needed for the course, but the participant may use other methods as long as they are able to make them work.

course organization

The course is organized in several "chapters" where the theory is covered in slides, and jupyter notebooks interleave code demo, and exercises.

directory structure

data : contains the datasets
notebooks : contains the code demo and exercise notebooks
slides : pptx / pdf of the course theory

cite us

Feel free to re-use and adapt this material for your own purposes.

We only ask that you cite us:

Duchemin, W., Müller, M., & Tran, V. D. (2025, November 17). Course material for the SIB course: Intermediate Machine Learning - Ensuring More Accurate, Generalisable, and Interpretable Machine Learning Models for Bioinformatics. Zenodo. https://doi.org/10.5281/zenodo.17657949