• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site

Analysis of Splicing Aberrations in Human Tissue Transcriptomes

Student: Mironov Aleksei

Supervisor: Dmitry Davidovich Pervouchine

Faculty: Faculty of Computer Science

Educational Programme: Data Analysis for Biology and Medicine (Master)

Year of Graduation: 2018

Recent advances in genomic studies and high-throughput sequencing have uncovered massive and heterogeneous repertoire of RNAs produced by alternative splicing. Although many alternative splicing isoforms are annotated and present in public databases, a growing number of unannotated isoforms is reported in each study. While some of them can represent natural variation of splicing across tissues and individuals, i.e., be splicing noise, others might be associated with disease. Alternative splicing patterns, even considered as noise, are expected to preserve reading frame in protein-coding RNA sequences and thus can be a valuable indirect source of information about translation of a transcript. In particular, micropeptides are a class of transcripts that are being extensively studied now because they were mistakenly annotated as non-coding transcripts, but in fact they contain short open reading frames. In this work we study splicing aberrations, defined as shifts of splice sites from the major splice site to a nearby unannotated splice site by not more than 30 nucleotides. We analyze a large panel of human RNA-seq data from Genotype-Tissue Expression Project (GTEx) and find distinct patterns of splicing aberrations in protein-coding and in non-coding regions. As expected, shifts by a multiple of three nucleotides preserving the coding frame are more frequent in coding regions compared to non-coding regions. Contrary to expectations, there are fundamental differences in the propensity of reading-frame-preserving aberrations among donor and acceptor splice sites, as well as different preferences in the upstream and the downstream shifts. Overall, the discovered patterns are consistent with previous observations and provide novel insights about splicing process, its output, and error rates.

Student Theses at HSE must be completed in accordance with the University Rules and regulations specified by each educational programme.

Summaries of all theses must be published and made freely available on the HSE website.

The full text of a thesis can be published in open access on the HSE website only if the authoring student (copyright holder) agrees, or, if the thesis was written by a team of students, if all the co-authors (copyright holders) agree. After a thesis is published on the HSE website, it obtains the status of an online publication.

Student theses are objects of copyright and their use is subject to limitations in accordance with the Russian Federation’s law on intellectual property.

In the event that a thesis is quoted or otherwise used, reference to the author’s name and the source of quotation is required.

Search all student theses