RASE - Resistance-Associated Sequence Elements

January 2, 2020 ยท View on GitHub

This repository contains the core RASE package. Other components of the software include the RASE pipeline and the RASE DB skeleton. For more information, see the RASE paper and the RASE supplementary materials.

Content

Introduction

This repository contains the core RASE software for rapid inference of antibiotic resistance and susceptibility using genomic neighbor typing.

Installation

  1. Setup an environment

  2. Install RASE using PIP

    pip install --upgrade git+https://github.com/c2-d2/rase
    

File formats

Prediction output (timeline)

Tab-separated text file with the following columns:

columndescription
datetimedatetime of the read
readsnumber of processed reads
kbpsnumber of processed bps (thousands)
kkmersnumber of matched k-mers (thousands)
ksk-mer score (proportion of matched k-mers)
lslineage score
ls_passlineage score interpretation, 1=passing 0=failing
ln, alt_lnpredicted and alternative lineage
bm, bm_{feature}best-matching strain (nearest neighbor) and its features
{ant}_sssusceptibility score for the antibiotic {ant}
{ant}_predprediction (score interpretation): S=susceptible, R=non-susceptible, S! and R!=low confidence calls
{ant}_bmresistance information for the best match, format: cat (mic)

See an example file.

Prediction output (snapshot)

Tab-separated text file with the following columns:

columndescription
taxidtaxid of a database strain, _unassigned_ for reads without any k-mer matches with the database
lineagelineage
weightweight (cumulative "number of k-mers matches divided by the the number of best matches")
weight_normnormalized weight
lengthcumulative "read length divided by number of matches"
countcumulative "count divided by number of matches"

See an example file.

RASE DB metadata

Tab-separated text file with the following columns:

columndescription
taxidtaxid of a database strain
lineagelineage
orderorder for plotting
{feature}arbitrary features (e.g., serotype or MLST)
{ant}_micoriginal MIC string (e.g., <0.03)
{ant}_intextracted MIC interval (0-0.03)
{ant}_catresistance category (S/R/s/r)

See an example file. Metadata files are generated in dedicated repositories (see RASE DB skeleton, S. pneumoniae RASE DB, and N. gonorrhoeae RASE DB).

RASE/BAM

File following the SAM/BAM specification, containing the following information:

fielddescription
QNAMEread name (ideally with the timestamp encoded)
FLAG4 for unassigned, 0 otherwise
RNAMEstrain taxid or name of an internal node
ln:i:read length
h1:i:number of k-mer matches at this node
  • RASE supplementary. Supplementary Materials for the RASE paper, including figures, tables, experiments, and links to other related repositories.
  • RASE DB skeleton. Skeleton for creating novel RASE databases.

License

MIT.

Contact

Karel Brinda <kbrinda@hsph.harvard.edu>