Single Cell Genomics Library Structure

April 2, 2026 · View on GitHub

Collections of library structure and sequence of popular single cell genomic methods (mainly scRNA-seq).

Before you start

Make sure you understand the basic configuration of the Illumina libraries, because most single cell sequencing methods are developed to be sequenced on the Illumina platforms. If you are not familiar with the Illumina sequencing libraries, click here to check some general information about Illumina library structures and the nature of library preparation.

The HTML pages listed below contain step-by-step procedures of how the libraries are generated experimentally. For the computational preprocessing pipelines for each method, please see this accompanying ReadTheDocs documentation. For the machine-readable format of the library structure, check seqspec.

How to use?

Click the following links to view the methods. Notes:

  1. Index1 (i7) is always sequenced using the bottom strand as template, regardless of the Illumina machine in use. That is why the index sequences are reverse complementary to the primer sequences.
  2. IMPORTANT: In a dual-index library, how index2 (i5) is sequenced differs from machines to machines. According to the Index Sequencing Guide from Illumina, Miseq, Hiseq2000/2500, MiniSeq (Rapid) and NovaSeq 6000 (v1.0) use the bottom strand as template (Forward Strand Workflow), which is why the index sequences are the same as the primer sequences in those machines. iSeq 100, MiniSeq, NextSeq, HiSeq X, HiSeq 3000/4000 and NovaSeq 6000 (v1.5) use the top strand as template (Reverse Complement Workflow), which is why the index sequences are reverse-complementary to the primer sequences in those machines. All methods listed below use iSeq 100, MiniSeq (Standard), NextSeq, HiSeq X, HiSeq 3000/4000 and NovaSeq 6000 (v1.5) as examples, because this configuration is more frequently used nowadays.

scRNA-seq technical comparisons

The basic chemistry is very similar, the main differences among those scRNA-seq methods are summarised in the table below. For a detailed discussion, check the text boxes from our review: From Tissues to Cell Types and Back: Single-Cell Gene Expression Analysis of Tissue Architecture

Single cell isolation/captureWhere RT happens2nd strand synthesisFull-length cDNA synthesisBarcode additionPooling before libraryLibrary amplificationGene coverage
10x Chromium Single Cell 3'DropletIn dropletsTSOYesBarcoded RT primersYesPCR3'
10x Chromium Single Cell 5'DropletIn dropletsTSOYesBarcoded TSO primersYesPCR5'
BD RhapsodyNanowellsIn collection tubesRandom priming and primer extensionNoBarcoded RT primersYesPCR3'
CEL-seq/CEL-seq2FACSIn 96w/384w wellsRNase H and DNA pol INoBarcoded RT primersYesIn vitro transcription3'
Drop-seqDropletIn collection tubesTSOYesBarcoded RT primersYesPCR3'
Illumina Bio-Rad SureCell 3' WTADropletIn dropletsRNase H and DNA pol INoBarcoded RT primersYesPCR3'
ddSEQ Single-Cell 3' RNA-SeqDropletIn dropletsRNase H and DNA pol INoBarcoded RT primersYesPCR3'
inDropDropletIn dropletsRNase H and DNA pol INoBarcoded RT primersYesIn vitro transcription3'
MARS-seq/MARS-seq2.0FACSIn 96w/384w wellsRNase H and DNA pol INoBarcoded RT primersYesIn vitro transcription3'
Microwell-seqNanowellsIn collection tubesTSOYesBarcoded RT primersYesPCR3'
Quartz-seqFACSIn 96w/384w wellsPolyA tailing and primer ligationYes in principleLigation of barcoded Truseq adaptersNoPCR3'
Quartz-seq2FACSIn 96w/384w wellsPolyA tailing and primer ligationYes in principleBarcoded RT primersYesPCR3'
sci-RNA-seqNot neededIn situRNase H and DNA pol INoBarcoded RT primers and library PCR with barcoded primersYesPCR3'
sci-RNA-seq3Not neededIn situRNase H and DNA pol INoBarcoded RT primers and hairpin adaptersYesPCR3'
scifi-RNA-seqDroplet multiple cellsIn situTSOYesBarcoded RT primers and gel bead barcodesYesPCR3'
SCRB-seq/mcSCRB-seqFACSIn 96w/384w wellsTSOYesBarcoded RT primersYesPCR3'
Seq-WellNanowellsIn collection tubesTSOYesBarcoded RT primersYesPCR3'
Seq-Well S3NanowellsIn collection tubesRandom priming and primer extensionNoBarcoded RT primersYesPCR3'
SMART-seq/SMART-seq2/SMART-seq3FACS or Fluidigm C1In 96w/384w wellsTSOYesLibrary PCR with barcoded primersNoPCRfull-length
SPLiT-seqNot neededIn situTSOYesLigation of barcoded RT primersYesPCR3'
STRT-seqFACSIn 96w/384w wellsTSOYesBarcoded TSO primersYesPCR5'
STRT-seq-C1Fluidigm C1In microfluidic chambersTSOYesBarcoded Tn5 transposaseNoPCR5'
STRT-seq-2iFACS or dilutionIn 9600w wellsTSOYesBarcoded PCR primers and Tn5 transposaseYesPCR5'
Tang 2009FACS or manualIn 96w/384w wellsPolyA tailing and primer extensionYes in principleLigation of barcoded adaptorsNoPCRBiased to 3'

scATAC-seq technical comparisons

This is basically Table 1 from our scATAC-seq protocol: A plate-based single-cell ATAC-seq workflow for fast and robust profiling of chromatin accessibility

Tn5 and adaptorsStaring cell numberTagmentationSingle-cell/nucleus isolationLibrary amplificationBarcode additionThroughput
sci-ATAC-seq/snATAC-seqCustom-made500,000+BulkFACS or dilutionPCRTn5 + PCR barcodes10,000
scTHS-seqCustom-made500,000+BulkFACS or dilutionIVT and PCRTn5 + PCR barcodes10,000
Plate_scATAC-seq and Pi-ATAC-seqNextera5,000+BulkFACSPCRPCR barcodes1,000
Fluidigm C1Nextera4,000-20,000Single cellsMicrofluidicsPCRPCR barcodes100
Takara ICELL8Nextera16,000Single cellsMicrofluidicsPCRPCR barcodes1,000
10x Chromium Single Cell ATACNextera800-15,000BulkDropletsPCRPCR barcodes10,000
Bio-Rad dscATAC-seqNextera60,000+BulkDropletsPCRPCR barcodes10,000
Bio-Rad dsciATAC-seqCustom-made600,000+BulkDropletsPCRTn5 + PCR barcodes100,000

Motivation

I was a little bit bombarded with all the single cell methods and got completely lost. To help myself understand all of them and future troubleshooting, I start to perform an on-paper library preparation whenever I see a new single cell method.

Why bother?

Here I borrow from Feyman:

What I cannot create, I do not understand.


Citation

If you find this repository useful and would like to cite this resource, please consider citing this repo and the seqspec preprint together:

@misc{xi_chen_teichlabscg_lib_structs_2023,
	title = {Teichlab/scg\_lib\_structs: {Release} 26th {Oct} 2023},
	copyright = {Creative Commons Attribution 4.0 International},
	shorttitle = {Teichlab/scg\_lib\_structs},
	url = {https://zenodo.org/doi/10.5281/zenodo.10042390},
	abstract = {This is the first release to get a DOI so that people can cite the repo.},
	urldate = {2023-10-26},
	publisher = {Zenodo},
	author = {Xi Chen and Patrick Roelli and Darío Hereñú and Pontus Höjer and Tim Stuart},
	month = oct,
	year = {2023},
	doi = {10.5281/ZENODO.10042390},
}

@article{booeshaghi.pachter.Bioinformatics2024,
  title = {A Machine-Readable Specification for Genomics Assays},
  author = {Booeshaghi, Ali Sina and Chen, Xi and Pachter, Lior},
  editor = {Kendziorski, Christina},
  year = {2024},
  month = mar,
  journal = {Bioinformatics},
  volume = {40},
  number = {4},
  pages = {btae168},
  issn = {1367-4811},
  doi = {10.1093/bioinformatics/btae168},
  urldate = {2024-05-01},
  abstract = {Motivation: Understanding the structure of sequenced fragments from genomics libraries is essential for accurate read preprocessing. Currently, different assays and sequencing technologies require custom scripts and programs that do not leverage the common structure of sequence elements present in genomics libraries.},
  copyright = {https://creativecommons.org/licenses/by/4.0/},
  langid = {english}
}

Feedback

I would be very happy if you go through them and let me know what you think. If you spot some errors/mistakes, or I've missed some key methods. Feel free to raise an issue in the GitHub repository, or contact me directly:

Xi Chen
chenx9@sustech.edu.cn