Alternative splicing enhances protein diversity in different ways, including through exonization of transposable elements (TEs). Recent transcriptomic analyses identified thousands of unannotated spliced transcripts with exonizing TEs, but their contribution to the proteome and biological relevance remains unclear. Here, we use transcriptome assembly, ribosome profiling, and proteomics to describe a population of 1,227 unannotated TE exonizing isoforms generated by mRNA splicing and recurrent in human populations. Despite being shorter and lowly expressed, these isoforms are shared between individuals and efficiently translated. Functional analyses show stable expression, specific cellular localization, and, in some cases, modified functions. Exonized TEs are rich in ancient genes, whereas the involved splice sites are recent and can be evolutionarily conserved. In addition, exonized TEs contribute to the secondary structure of the emerging isoforms, supporting their functional relevance. We conclude that TE-spliced isoforms represent a diversity reservoir of functional proteins on which natural selection can act.
Keywords: cryptic splice sites; exon birth; non-canonical proteome; protein evolution; protein isoforms; protein structure; proteomics; ribosome profiling; transposable elements; unannotated splicing.
Copyright © 2024 The Authors. Published by Elsevier Inc. All rights reserved.