Slinker: Visualising novel splicing events in RNA-Seq data

Breon Schmidt; Marek Cmero; Paul Ekert; Nadia Davidson; Alicia Oshlack

doi:10.12688/f1000research.74836.1

Slinker: Visualising novel splicing events in RNA-Seq data

F1000Res. 2021 Dec 7:10:1255. doi: 10.12688/f1000research.74836.1. eCollection 2021.

Authors

Breon Schmidt^{1

2}, Marek Cmero^{1

3

4}, Paul Ekert^{1

3

5

6

7}, Nadia Davidson^{1

2

4}, Alicia Oshlack^{1

2

3}

Affiliations

¹ Peter MacCallum Cancer Centre, Melbourne, Victoria, 3000, Australia.
² School of BioScience, University of Melbourne, Melbourne, Victoria, 3000, Australia.
³ Sir Peter MacCallum Department of Oncology, University of Melbourne, Melbourne, Victoria, 3000, Australia.
⁴ Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, 3052, Australia.
⁵ Murdoch Children's Research Institute, Parkville, Victoria, 3052, Australia.
⁶ Children's Cancer Institute, Lowy Cancer Centre, Kensington, New South Wales, 2033, Australia.
⁷ School of Women's and Children's Health, University of New South Wales, Sydney, Victoria, 2052, Australia.

Abstract

Visualisation of the transcriptome relative to a reference genome is fraught with sparsity. This is due to RNA sequencing (RNA-Seq) reads being predominantly mapped to exons that account for just under 3% of the human genome. Recently, we have used exon-only references, superTranscripts, to improve visualisation of aligned RNA-Seq data through the omission of supposedly unexpressed regions such as introns. However, variation within these regions can lead to novel splicing events that may drive a pathogenic phenotype. In these cases, the loss of information in only retaining annotated exons presents significant drawbacks. Here we present Slinker, a bioinformatics pipeline written in Python and Bpipe that uses a data-driven approach to assemble sample-specific superTranscripts. At its core, Slinker uses Stringtie2 to assemble transcripts with any sequence across any gene. This assembly is merged with reference transcripts, converted to a superTranscript, of which rich visualisations are made through Plotly with associated annotation and coverage information. Slinker was validated on five novel splicing events of rare disease samples from a cohort of primary muscular disorders. In addition, Slinker was shown to be effective in visualising deletion events within transcriptomes of tumour samples in the important leukemia gene, IKZF1. Slinker offers a succinct visualisation of RNA-Seq alignments across typically sparse regions and is freely available on Github.

Keywords: Novel Splicing Events; RNA-Seq; Visualisation; bioinformatics; superTranscripts.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Exons / genetics
Humans
RNA Splicing* / genetics
RNA*
RNA-Seq
Sequence Analysis, RNA

Substances

RNA

Grants and funding

This work was supported by the NHMRC under project grant APP1140626.