Flexynesis
June 1, 2026 · View on GitHub
This repository contains CLAUDE.md and AGENTS.md — instruction files that turn a supported AI coding assistant into an interactive guide. It walks you from zero to a trained multi-omics deep learning model, without reading any documentation or writing any code.
You only need:
- mamba (or conda / pip)
- Claude Code or opencode (both have free tiers)
The AI installs flexynesis, downloads a dataset, trains the model, and produces plots — all from a conversation.
Quickstart
git clone https://github.com/BIMSBbioinfo/flexynesis-guide.git
cd flexynesis-guide
Then choose your AI assistant:
Option A — Claude Code
1. Install Claude Code
curl -fsSL https://claude.ai/install.sh | bash
2. Authorise
Run claude once and follow the browser-based login prompt.
3. Start the guided session
claude "let's get started with Flexynesis!"
Claude reads CLAUDE.md and immediately begins.
Option B — opencode
1. Install opencode
See https://opencode.ai for installation instructions.
2. Start opencode in this directory
opencode
3. Start the guided session
Once the opencode prompt is open, type:
let's get started with Flexynesis!
opencode reads AGENTS.md and begins the same guided workflow.
What the AI guide will do
| Step | What happens |
|---|---|
| 1 | Installs flexynesis via pip into a fresh mamba environment |
| 2 | Asks what biology you want to explore, then suggests a matching dataset |
| 3 | Downloads and extracts the dataset; inspects available clinical variables |
| 4 | Proposes concrete modelling tasks (regression, classification, survival, or a mix) |
| 5 | Recommends a model architecture and explains the tradeoffs |
| 6 | Runs training with hyperparameter optimisation |
| 7 | Reads the output files and generates PCA plots, feature importance charts, Kaplan-Meier curves, and a performance table |
You make the choices; the AI does the typing.
Available datasets
The AI guide can download any of these ready-to-use benchmark datasets or fetch any study directly from cBioPortal.
| Dataset | Biology | What you can predict |
|---|---|---|
| Cancer drug response (CCLE/GDSC) | Cell line pharmacogenomics | IC50/AUC for 8 compounds — regression |
| LGG + GBM (TCGA glioma) | Brain tumours | Survival, tumour type, performance score |
| METABRIC breast cancer | Breast tumours | Molecular subtype, chemotherapy, survival |
| MSI status | Microsatellite instability | MSI-H vs MSS — binary classification |
| Bone marrow single-cell RNA | Single-cell biology | Cell type labels — unsupervised / classification |
| Any cBioPortal study | Your choice | Depends on clinical metadata |
Example session
$ claude "let's get started with Flexynesis!"
# or: opencode → "let's get started with Flexynesis!"
I see you've opened this repository. Let me get you started with flexynesis.
Step 1/7 — Checking installation...
flexynesis not found. Installing into a new mamba environment...
✓ flexynesis 1.2.3 installed
Step 2/7 — Choose a dataset
What kind of biological question interests you?
a) Cancer drug response (cell lines)
b) Brain tumour survival and subtyping ← LGG + GBM, TCGA
c) Breast cancer molecular subtypes ← METABRIC
d) Microsatellite instability (MSI)
e) Single-cell unsupervised analysis
f) Fetch a custom study from cBioPortal
> b
Downloading lgggbm_tcga_pub_processed...
556 training samples · 238 test samples
Modalities: mutations (mut), copy number alterations (cna)
Step 3/7 — Clinical variables found:
HISTOLOGICAL_DIAGNOSIS categorical (4 tumour subtypes)
AGE numeric
KARNOFSKY_PERFORMANCE_SCORE numeric
OS_STATUS + OS_MONTHS survival pair ✓
SEX categorical
Suggested tasks:
Classify tumour subtype + predict survival + regress performance score
(multi-task — flexynesis handles the loss balancing automatically)
> looks good
Step 4/7 — Model: DirectPred | Fusion: intermediate
Step 5/7 — Training...
flexynesis --data_path lgggbm_tcga_pub_processed \
--model_class DirectPred \
--target_variables HISTOLOGICAL_DIAGNOSIS,KARNOFSKY_PERFORMANCE_SCORE \
--surv_event_var OS_STATUS --surv_time_var OS_MONTHS \
--data_types mut,cna --hpo_iter 20 --outdir results --prefix lgg_run
Step 6/7 — Results:
HISTOLOGICAL_DIAGNOSIS AUROC = 0.94 balanced_accuracy = 0.81
KARNOFSKY_PERFORMANCE_SCORE pearson_r = 0.43
Survival (OS) c_index = 0.71
Step 7/7 — Plots saved:
pca_embeddings.png
top_markers_HISTOLOGICAL_DIAGNOSIS.png
top_markers_OS.png
kaplan_meier.png
Top survival markers: IDH1, TP53, ATRX
(Well-established glioma drivers — the model learned real biology.)
What next?
a) Try a different model (supervised_vae, GNN, MultiTripletNetwork ...)
b) Run more HPO steps for better performance
c) Fetch a different dataset from cBioPortal
d) Run inference on new samples with this model
About flexynesis
Flexynesis is a deep learning toolkit for multi-omics data integration and clinical outcome prediction. It supports fully connected networks, variational autoencoders, graph convolutional networks, and triplet-loss models, with automated feature selection, hyperparameter optimisation, and integrated-gradient marker discovery.
- Documentation: https://bimsbstatic.mdc-berlin.de/akalin/buyar/flexynesis/site/getting_started/
- Benchmark results: https://bimsbstatic.mdc-berlin.de/akalin/buyar/flexynesis-benchmark-datasets/dashboard.html
- Source code: https://github.com/BIMSBbioinfo/flexynesis
- Paper: Uyar et al., Nature Communications 2025. https://doi.org/10.1038/s41467-025-63688-5
If you use flexynesis in your research, please cite the paper above.