Flexynesis

June 1, 2026 · View on GitHub

This repository contains CLAUDE.md and AGENTS.md — instruction files that turn a supported AI coding assistant into an interactive guide. It walks you from zero to a trained multi-omics deep learning model, without reading any documentation or writing any code.

You only need:

The AI installs flexynesis, downloads a dataset, trains the model, and produces plots — all from a conversation.


Quickstart

git clone https://github.com/BIMSBbioinfo/flexynesis-guide.git
cd flexynesis-guide

Then choose your AI assistant:


Option A — Claude Code

1. Install Claude Code

curl -fsSL https://claude.ai/install.sh | bash

2. Authorise

Run claude once and follow the browser-based login prompt.

3. Start the guided session

claude "let's get started with Flexynesis!"

Claude reads CLAUDE.md and immediately begins.


Option B — opencode

1. Install opencode

See https://opencode.ai for installation instructions.

2. Start opencode in this directory

opencode

3. Start the guided session

Once the opencode prompt is open, type:

let's get started with Flexynesis!

opencode reads AGENTS.md and begins the same guided workflow.


What the AI guide will do

StepWhat happens
1Installs flexynesis via pip into a fresh mamba environment
2Asks what biology you want to explore, then suggests a matching dataset
3Downloads and extracts the dataset; inspects available clinical variables
4Proposes concrete modelling tasks (regression, classification, survival, or a mix)
5Recommends a model architecture and explains the tradeoffs
6Runs training with hyperparameter optimisation
7Reads the output files and generates PCA plots, feature importance charts, Kaplan-Meier curves, and a performance table

You make the choices; the AI does the typing.


Available datasets

The AI guide can download any of these ready-to-use benchmark datasets or fetch any study directly from cBioPortal.

DatasetBiologyWhat you can predict
Cancer drug response (CCLE/GDSC)Cell line pharmacogenomicsIC50/AUC for 8 compounds — regression
LGG + GBM (TCGA glioma)Brain tumoursSurvival, tumour type, performance score
METABRIC breast cancerBreast tumoursMolecular subtype, chemotherapy, survival
MSI statusMicrosatellite instabilityMSI-H vs MSS — binary classification
Bone marrow single-cell RNASingle-cell biologyCell type labels — unsupervised / classification
Any cBioPortal studyYour choiceDepends on clinical metadata

Example session

$ claude "let's get started with Flexynesis!"
# or: opencode → "let's get started with Flexynesis!"

I see you've opened this repository. Let me get you started with flexynesis.

Step 1/7 — Checking installation...
  flexynesis not found. Installing into a new mamba environment...
  ✓ flexynesis 1.2.3 installed

Step 2/7 — Choose a dataset
  What kind of biological question interests you?
    a) Cancer drug response (cell lines)
    b) Brain tumour survival and subtyping     ← LGG + GBM, TCGA
    c) Breast cancer molecular subtypes        ← METABRIC
    d) Microsatellite instability (MSI)
    e) Single-cell unsupervised analysis
    f) Fetch a custom study from cBioPortal

> b

  Downloading lgggbm_tcga_pub_processed...
  556 training samples · 238 test samples
  Modalities: mutations (mut), copy number alterations (cna)

Step 3/7 — Clinical variables found:
  HISTOLOGICAL_DIAGNOSIS    categorical  (4 tumour subtypes)
  AGE                       numeric
  KARNOFSKY_PERFORMANCE_SCORE  numeric
  OS_STATUS + OS_MONTHS     survival pair  ✓
  SEX                       categorical

  Suggested tasks:
    Classify tumour subtype  +  predict survival  +  regress performance score
    (multi-task — flexynesis handles the loss balancing automatically)

> looks good

Step 4/7 — Model: DirectPred  |  Fusion: intermediate

Step 5/7 — Training...
  flexynesis --data_path lgggbm_tcga_pub_processed \
    --model_class DirectPred \
    --target_variables HISTOLOGICAL_DIAGNOSIS,KARNOFSKY_PERFORMANCE_SCORE \
    --surv_event_var OS_STATUS --surv_time_var OS_MONTHS \
    --data_types mut,cna --hpo_iter 20 --outdir results --prefix lgg_run

Step 6/7 — Results:
  HISTOLOGICAL_DIAGNOSIS     AUROC = 0.94   balanced_accuracy = 0.81
  KARNOFSKY_PERFORMANCE_SCORE  pearson_r = 0.43
  Survival (OS)              c_index = 0.71

Step 7/7 — Plots saved:
  pca_embeddings.png
  top_markers_HISTOLOGICAL_DIAGNOSIS.png
  top_markers_OS.png
  kaplan_meier.png

  Top survival markers: IDH1, TP53, ATRX
  (Well-established glioma drivers — the model learned real biology.)

What next?
  a) Try a different model (supervised_vae, GNN, MultiTripletNetwork ...)
  b) Run more HPO steps for better performance
  c) Fetch a different dataset from cBioPortal
  d) Run inference on new samples with this model

About flexynesis

Flexynesis is a deep learning toolkit for multi-omics data integration and clinical outcome prediction. It supports fully connected networks, variational autoencoders, graph convolutional networks, and triplet-loss models, with automated feature selection, hyperparameter optimisation, and integrated-gradient marker discovery.

If you use flexynesis in your research, please cite the paper above.