Tree-kNN: A Tree-Based Algorithm for Protein Sequence Classification

Journal Title: International Journal on Computer Science and Engineering - Year 2011, Vol 3, Issue 2

Abstract

The phylogenomic classification of protein sequences attempts to categorize a given protein within the evolutionary context of the entire family. It involves mainly four steps: selection of homologous sequences, multiple sequence alignment, phylogenetic tree construction and tree-based classification. This supposes that the tree used as a basis of protein classification is correct. Sequence alignment is the first step for tree construction. Thus, the accuracy of the alignment produced should affect the topology of the phylogenetic tree. This work proposes a kNN tree-based algorithm for protein classification, namely Tree-kNN, which uses a phylogenetic tree estimated from pair-wise and multiple alignment approaches. We compare the classification performance of Tree-kNN with an existing method, called TreeNN. Results show that Tree-kNN gives better results than TreeNN. Based on four datasets we show that classification performances of the two algorithms using pair-wise alignment are better than using multiple alignment

Authors and Affiliations

Khaddouja Boujenfa , Nadia Essoussi , Mohamed Limam

Keywords

Related Articles

WebParF:A Web Partitioning Framework for Parallel Crawler

With the ever proliferating size and scale of the WWW [1], efficient ways of exploring content are of increasing importance. How can we efficiently retrieve information from it through crawling? And in this “era of tera”...

Purchase Decision for ATUR Broadband Network

Along with the booming of telecommunication, Internet web has become a vital media in the current global communication and the communication quality of Internet has been emphasized. To the client user, the Internet ATUR...

Design Patterns: A Resource for Reverse Engineering

Design patterns are gaining popularity because they support odifiability and flexibility of designs. Design patterns are olutions to frequently recurring problems in design. Reverse engineering of source code primaril...

TOWARDS AN AGENT-BASED CUSTOMER KNOWLEDGE MANAGEMENT SYSTEM (ABCKMS) IN E-COMMERCE ORGANIZATIONS

Till date, e-commerce organizations still have competency challenges in Customer Knowledge Management (CKM). Organizations need to develop competencies in all aspects of CKM, from understanding who their customers really...

Context Ontology Construction For Cricket Video

Content based video retrieval systems are not complete in semantic sense. To improve the efficiency and the effectiveness of the retrieval system the content based retrieval systems must be equipped with the semantic bas...

Download PDF file
  • EP ID EP160512
  • DOI -
  • Views 151
  • Downloads 0

How To Cite

Khaddouja Boujenfa, Nadia Essoussi, Mohamed Limam (2011). Tree-kNN: A Tree-Based Algorithm for Protein Sequence Classification. International Journal on Computer Science and Engineering, 3(2), 961-968. https://europub.co.uk./articles/-A-160512