Applying Back Propagation Algorithm for classification of fragile genome sequence

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2016, Vol 18, Issue 5

Abstract

Abstract : Most frequently occurring recurrent chromosomal translocation allied with all subtype of leukemia are available in Mitel Mann Data base. We have retrieved about 55 such genome sequence from TIC dB database with 100% similarity score and got noncoding sequence of chromosome 9 and 22 as positive example of fragile site. Another 55 housekeeping genome sequence is taken for classification purpose. For content based analysis we have extracted 20 features of frequency density of mono nucleotide and dinucleotide. The network is designed by determining hyper parameters like number of hidden layer, hidden neurons and input features. Firstwe took 20 input features and there after 16 for reducing number of free parameters (i.e. weight space). Network is also pruned for succeeding experiments. The training strategy was also exhaustively explored, basedon literature study and trial and error heuristic methods to achieve more and more accuracy. Regularization is also employed by cross validation and early stopping. We have achieved 95% accuracy for training data and 70% to test data in first experiment. To avoid this over fitting at last we could achieve 93% over all accuracy and outlier detection, too. We could be able to show that dinucleotide frequency density is important statistical feature for classifying genome sequence. This classifier can show the probability of fragility to occur in genome sequence at very early stage so as to deal with the diesis at prognosis phase.

Authors and Affiliations

Medha Patel , Dr. Devarshi Mehta , Dr. Patrick Patterson , Dr. Rakesh Rawal

Keywords

Related Articles

 Identifying Threats Associated With Man-In-The-Middle Attacks during Communication between a Mobile Device and the Back End Server in Mobile Banking Applications

 Mobile banking, sometimes referred to as M-Banking, Mbanking or SMS Banking, is a term used for performing balance checks, account transactions, payments, credit applications and other banking transactions throug...

 Wavelet Based Features for Defect Detection in Fabric using Genetic Algorithm

 Abstract: In this paper a new scheme is proposed for Fabric defect detection in textile industry. For this purpose, wavelet transformer is used as feature extractor of coefficients of fabric. These coefficients c...

 Route maintenance and Scalability improvement of DSR, based on Relay node identification after locating Link-failure over MANET

 Abstract: In Dynamic Source Routing, each source determines the route to be used in transmitting its packets to destination. Route Discovery determines the optimum path for a transmission between a given source and...

Prevention and Detection of Wormhole Attack in Mobile Adhoc Network Using Clustering and RTT

Abstract: A security constraint in mobile adhoc network is very critical task. Some critical security issue such as black hole attack, wormhole attack, sinkhole attack, prevention and detection of attack is major challen...

Opinion Mining Method for Sentiment Analysis

Abstract: We are living in a world full of data. Every passing second, large data is generated by Social Media, ECommerce, Stock Exchange and many other platforms. Now-a-days, microblogging sites are used for manypurpose...

Download PDF file
  • EP ID EP133682
  • DOI -
  • Views 90
  • Downloads 0

How To Cite

Medha Patel, Dr. Devarshi Mehta, Dr. Patrick Patterson, Dr. Rakesh Rawal (2016). Applying Back Propagation Algorithm for classification of fragile genome sequence. IOSR Journals (IOSR Journal of Computer Engineering), 18(5), 1-10. https://europub.co.uk./articles/-A-133682