Alt-Splice Gene Predictor Using Multitrack-Clique Analysis: Verification of Statistical Support for Modelling in Genomes of Multicellular Eukaryotes

Journal Title: Informatics - Year 2017, Vol 4, Issue 1

Abstract

One of the main limitations of the typical hidden Markov model (HMM) implementation for gene structure identification is that a single structure is identified on a given sequence of genomic data—i.e., identification of overlapping structure is not directly possible, and certainly not possible within the confines of the optimal Viterbi path evaluation. This is a huge limitation given that we now know that significant portions of eukaryotic genomes, particularly mammalian genomes, are alternatively spliced, and, thus, have overlapping structure in the sense of the mRNA transcripts that result. Using the general meta-state HMM approach developed in prior work, however, more than one ‘track’ of annotation can be accommodated, thereby allowing a direct implementation of an alternative-splice gene-structure identifier. In this paper we examine the representation of alternative splicing annotation in the multi-track context, and show that the proliferation on states is manageable, and has sufficient statistical support on the genomes examined (human, mouse, worm, and fly) that a full alt-splice meta-state HMM gene finder can be implemented with sufficient statistical support. In the process of performing the alternative splicing analysis on alt-splice event counts we expected to see an increase in alternative splicing complexity as the organism becomes more complex, and this is seen with the percentage of genes with alt-splice variants increasing from worm to fly to the mammalian genomes (mouse and human). Of particular note is an increase in alternative splicing variants at the start and end of coding with the more complex organisms studied (mouse and human), indicating rapid new first and last exon recruitment that is possibly spliceosome mediated. This suggests that spliceosome-mediated refinements (acceleration) of gene structure variation and selection, with increasing levels of sophistication, has occurred in eukaryotes and in mammals especially.

Authors and Affiliations

Stephen Winters-Hilt and Andrew J. Lewis

Keywords

Related Articles

The Socio-Economic Evaluation of a European Project: The DIYLab Case

This paper builds on the results of a 3-year long European project, the main aim of which was to deeply and sustainably transform teaching and learning practice in primary and secondary schools and higher education, by...

Human–Information Interaction with Complex Information for Decision-Making

Human–information interaction (HII) for simple information and for complex information is different because people’s goals and information needs differ between the two cases. With complex information, comprehension com...

Health Literacy for the General Public: Making a Case for Non-Trivial Visualizations

Health literacy is concerned with the degree to which individuals can access and understand information to make health decisions. The multifaceted nature of health data presents challenges for individuals seeking to im...

Embracing First-Person Perspectives in Soma-Based Design

A set of prominent designers embarked on a research journey to explore aesthetics in movement-based design. Here we unpack one of the design sensitivities unique to our practice: a strong first person perspective—where...

Preferences of Informal Carers on Technology Packages to Support Meal Production by People Living with Dementia, Elicited from Personalised AT and ICT Product Brochures

Assistive technology (AT) can help support the continued independence of people living with dementia, supported by informal carers. Opinions and preferences of informal carers towards a range of assistive and digital i...

Download PDF file
  • EP ID EP44074
  • DOI https://doi.org/10.3390/informatics4010003
  • Views 245
  • Downloads 0

How To Cite

Stephen Winters-Hilt and Andrew J. Lewis (2017). Alt-Splice Gene Predictor Using Multitrack-Clique Analysis: Verification of Statistical Support for Modelling in Genomes of Multicellular Eukaryotes. Informatics, 4(1), -. https://europub.co.uk./articles/-A-44074