SPAE

October 22, 2024 ยท View on GitHub

Introduction

SPAE is a novel computational framework designed to analyze cell cycle dynamics and cell type using single-cell RNA sequencing (scRNA-seq) data. SPAE employs an autoencoder that integrates both nonlinear and piecewise linear components, it uniquely utilizes sine and cosine functions within the decoder to accurately fit the periodicity of the cell cycle, thus facilitating precise pseudotime estimation. Moreover, SPAE incorporates a piecewise linear regression model to predict cell types. We rigorously assessed SPAE's efficacy in estimating cell cycle pseudotime and determining cell stage classifications. SPAE is applied to several downstream analyses and applications to demonstrate its ability to accurately estimate the cell cycle pseudotime and stages.

Requirements

Ensure you have the following dependencies installed:

  • numpy == 1.23.5
  • pandas == 2.0.3
  • scikit-learn == 1.3.0
  • keras == 2.13.1
  • tensorflow == 2.13.0
  • matplotlib == 3.7.2

You can install these dependencies using:

pip install numpy==1.23.5 pandas==2.0.3 scikit-learn==1.3.0 keras==2.13.1 tensorflow==2.13.0 matplotlib==3.7.2

Estimate cell cycle pseudotime(example: mESCs_Quartz data)

run mESCs_Quartz.py

Cell cycle pseudotime to cell cycle stages

Rscript pseudotime_to_label.r 

Cell cycle effect removal(example: breast_cancer data)

run breast_cancer.py