An Effective Identification of Species from DNA Sequence: A Classification Technique by Integrating DM and ANN

Abstract

Species classification from DNA sequences remains as an open challenge in the area of bioinformatics, which deals with the collection, processing and analysis of DNA and proteomic sequence. Though incorporation of data mining can guide the process to perform well, poor definition, and heterogeneous nature of gene sequence remains as a barrier. In this paper, an effective classification technique to identify the organism from its gene sequence is proposed. The proposed integrated technique is mainly based on pattern mining and neural network-based classification. In pattern mining, the technique mines nucleotide patterns and their support from selected DNA sequence. The high dimension of the mined dataset is reduced using Multilinear Principal Component Analysis (MPCA). In classification, a well-trained neural network classifies the selected gene sequence and so the organism is identified even from a part of the sequence. The proposed technique is evaluated by performing 10-fold cross validation, a statistical validation measure, and the obtained results prove the efficacy of the technique.

Authors and Affiliations

Sathish S, Dr. N. Duraipandian

Keywords

Related Articles

The SVM Classifier Based on the Modified Particle Swarm Optimization

The problem of development of the SVM classifier based on the modified particle swarm optimization has been considered. This algorithm carries out the simultaneous search of the kernel function type, values of the kernel...

Automation of Combinatorial Interaction Test (CIT) Case Generation and Execution for Requirements based Testing (RBT) of Complex Avionics Systems

In the field of avionics, most of the software systems are either safety critical or mission critical. These systems are developed with high quality standards strictly following the relevant guidelines and procedures. Du...

Intelligent Transportation System (ITS) for Smart-Cities using Mamdani Fuzzy Inference System

It is estimated that more than half of the world population lives in cities according to (UN forecasts, 2014), so cities are vital. Cities, as we all know facing with complex challenges – for smart cities the outdated tr...

A Correlation based Approach to Differentiate between an Event and Noise in Internet of Things

Internet of Things (IoT) is considered a huge enhancement in the field of information technology. IoT is the integration of physical devices which are embedded with electronics, software, sensors, and connectivity that a...

Real-Time Digital Image Exposure Status Detection and Circuit Implementation

Auto exposure is an important part of digital image signal processing. We studied the detection of the exposure status in this paper, and fast and parallel detection method was presented. The method comprises the followi...

Download PDF file
  • EP ID EP145799
  • DOI -
  • Views 98
  • Downloads 0

How To Cite

Sathish S, Dr. N. Duraipandian (2012). An Effective Identification of Species from DNA Sequence: A Classification Technique by Integrating DM and ANN. International Journal of Advanced Computer Science & Applications, 3(8), 104-114. https://europub.co.uk./articles/-A-145799