Performance Analysis of Classification Learning Methods on Large Dataset using two Data Mining Tools

Journal Title: Journal of Independent Studies and Research - Computing - Year 2015, Vol 13, Issue 2

Abstract

Data is increasing day to day thus, processing this data and selection of right method and tool is really a big problem. Computer scientists are process- ing and analysing data on different machine learning methods using various Data Mining tools to get the high accuracy of results and minimum time for building of Model. There are several data analysis and processing tools like WEKA, RapidMiner, Keel, and etc. available for the purpose of processing, analysis, modelling and etc. Still no single tool is perfect or nominated for data processing and analysis. In this concern, the authors present here a comparative and analytical research study on the performance of different classification machine learning algorithms like Naïve Bayes, KNN, IBK, Random Forest, C4.5, J48 and Data Mining tools which are WEKA and RapidMiner on a large datasets to evalu- ate their performance and analytical results with low cost of error. The data set Adult Income is taken from UCI Data repository for this research study. The significance and aim of this study is to evaluate and assess the range of performance of different machine learning methods and two diverse data mining tools on dissimilar datasets. The result of each classification method and Data mining tool is analysed and presented in the end.

Authors and Affiliations

Keywords

Related Articles

Prediction of Suicide Causes in India using Machine Learning

Worldwide, suicide rate is considered one of the most significant issue. With each passing year, the number of suicide is getting increased phenomenally and because of this reason, this research is carried out to predict...

Urdu Optical Character Recognition Technique for Jameel Noori Nastaleeq Script

Urdu OCR's have been an object of interest for many developers in the recent years. Active research is being done pertaining to Urdu OCR’s, but because of the complexity associated with Urdu fonts; it still lacks perfect...

Improving ATM User Interface (UI) of Pakistani Banks Using Keystroke Level Modelling (KLM)

The ATM connotes as Automated Teller Machine or Cash Machine. This machine has earned its currency on a larger scale in our modern society. However, unfortunately, most users have met bad experiences. For instance, reins...

Performance Comparison of NOSQL Database Cassandra and SQL Server for Large Databases.

The performance comparison of NoSQL database and a Relational Database Management Systems has been done to identify which database responds faster to specific types of requests and suitability of these databases for diff...

Prospects of 5G Communications

The next generation of wireless communication is going to meet human demands beyond today’s trend. This study sets the frame on the future of wireless communication that requires real-time responses which pushes this tec...

Download PDF file
  • EP ID EP643810
  • DOI 10.31645/jisrc/(2015).13.2.0005
  • Views 123
  • Downloads 0

How To Cite

(2015). Performance Analysis of Classification Learning Methods on Large Dataset using two Data Mining Tools. Journal of Independent Studies and Research - Computing, 13(2), 8-14. https://europub.co.uk./articles/-A-643810