Slinker: Visualising novel splicing events in RNA-Seq data

F1000Res. 2021 Dec 7:10:1255. doi: 10.12688/f1000research.74836.1. eCollection 2021.

Abstract

Visualisation of the transcriptome relative to a reference genome is fraught with sparsity. This is due to RNA sequencing (RNA-Seq) reads being predominantly mapped to exons that account for just under 3% of the human genome. Recently, we have used exon-only references, superTranscripts, to improve visualisation of aligned RNA-Seq data through the omission of supposedly unexpressed regions such as introns. However, variation within these regions can lead to novel splicing events that may drive a pathogenic phenotype. In these cases, the loss of information in only retaining annotated exons presents significant drawbacks. Here we present Slinker, a bioinformatics pipeline written in Python and Bpipe that uses a data-driven approach to assemble sample-specific superTranscripts. At its core, Slinker uses Stringtie2 to assemble transcripts with any sequence across any gene. This assembly is merged with reference transcripts, converted to a superTranscript, of which rich visualisations are made through Plotly with associated annotation and coverage information. Slinker was validated on five novel splicing events of rare disease samples from a cohort of primary muscular disorders. In addition, Slinker was shown to be effective in visualising deletion events within transcriptomes of tumour samples in the important leukemia gene, IKZF1. Slinker offers a succinct visualisation of RNA-Seq alignments across typically sparse regions and is freely available on Github.

Keywords: Novel Splicing Events; RNA-Seq; Visualisation; bioinformatics; superTranscripts.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Exons / genetics
  • Humans
  • RNA Splicing* / genetics
  • RNA*
  • RNA-Seq
  • Sequence Analysis, RNA

Substances

  • RNA

Grants and funding

This work was supported by the NHMRC under project grant APP1140626.