An Investigation on Topic Maps Based Document Classification with Unbalance Classes

Journal Title: Journal of Independent Studies and Research - Computing - Year 2015, Vol 13, Issue 1

Abstract

Classification of imbalanced data has become a widespread problem due to the fact that the most real world datasets are imbalanced. In a classification task, one of the challenges is to learn the feature-space of classification under class-imbalance setting. The majority classes generally have good representation of features in the learned classification function and the minority classes lack this representation; subsequently, the classification for these classes failed more often. In this paper, authors investigate the task of document classification with topic map based representation of documents under class imbalance setting. In order to measure of topic-map based representation for classification under imbalance data, authors compare three representations: Bag-ofWords, Phrases and Topic terms for three approaches (i) under-sampling, (ii) cost-adjusting, and (iii) cluster based sampling. A series of experiments are carried out and results are reported.

Authors and Affiliations

Keywords

Related Articles

Performance Comparison of NOSQL Database Cassandra and SQL Server for Large Databases.

The performance comparison of NoSQL database and a Relational Database Management Systems has been done to identify which database responds faster to specific types of requests and suitability of these databases for diff...

Extracting Key Sentences from Text

Automatic key sentence extraction from a text is a challenging task. It has numerous applications in text processing systems. The actual task of key sentence extraction consists of three main functionalities: (i) Identif...

Improving Query Response Time for Graph Data Using Materialization

Graphs are used in many disciplines, from communication networks, biological, social networks includ- ing maths and other fields of science. This is the latest and most important field of computer science today. In this...

A Review and Comparison of the Traditional Collaborative and Online Collaborative Techniques for Requirement Elicitation

Requirement elicitation is one of the major phases of the software development life cycle. As per authors knowledge, among many reviews, there is no review available on a comparison between Online Collaborative Requireme...

Implementation of Adaptive Control Algorithm to Overcome the Traffic Congestion Problems of Karachi

Traffic controlling and management is a severe issue of urban cities as well as on high ways in developing countries like South Asian countries but here particularly, in Pakistan. The traffic congestion problem is becomi...

Download PDF file
  • EP ID EP643241
  • DOI 10.31645/jisrc/(2015).13.1.0007
  • Views 149
  • Downloads 0

How To Cite

(2015). An Investigation on Topic Maps Based Document Classification with Unbalance Classes. Journal of Independent Studies and Research - Computing, 13(1), 50-56. https://europub.co.uk./articles/-A-643241