PacBio single-molecule long-read sequencing provides new insights into the complexity of full-length transcripts in oriental river prawn, macrobrachium nipponense

BMC Genomics. 2023 Jun 20;24(1):340. doi: 10.1186/s12864-023-09442-x.

Abstract

Background: Oriental river prawn (Macrobrachium nipponense) is one of the most dominant species in shrimp farming in China, which is a rich source of protein and contributes to a significant impact on the quality of human life. Thus, more complete and accurate annotation of gene models are important for the breeding research of oriental river prawn.

Results: A full-length transcriptome of oriental river prawn muscle was obtained using the PacBio Sequel platform. Then, 37.99 Gb of subreads were sequenced, including 584,498 circular consensus sequences, among which 512,216 were full length non-chimeric sequences. After Illumina-based correction of long PacBio reads, 6,599 error-corrected isoforms were identified. Transcriptome structural analysis revealed 2,263 and 2,555 alternative splicing (AS) events and alternative polyadenylation (APA) sites, respectively. In total, 620 novel genes (NGs), 197 putative transcription factors (TFs), and 291 novel long non-coding RNAs (lncRNAs) were identified.

Conclusions: In summary, this study offers novel insights into the transcriptome complexity and diversity of this prawn species, and provides valuable information for understanding the genomic structure and improving the draft genome annotation of oriental river prawn.

Keywords: Alternative polyadenylation; Alternative splicing; Long non-coding RNA; Novel genes; Oriental river prawn; SMRT sequencing.

MeSH terms

  • Alternative Splicing
  • Animals
  • Gene Expression Profiling
  • Humans
  • Palaemonidae* / genetics
  • Protein Isoforms / genetics
  • Transcriptome

Substances

  • Protein Isoforms