An Overview of Web Content Mining Tools
Journal Title: Bonfring International Journal of Data Mining - Year 2016, Vol 6, Issue 1
Abstract
Web is one of the most widespread platforms for information exchange today, as it is easier to publish documents. As the number of users and providers increases, the number of documents grows, searching for information becomes a difficult and time-consuming process. Web mining uses various data mining techniques to discover useful knowledge from Web hyperlinks, page content and usage log file. The mining tools are used to scan the HTML documents, images, and text, the results is provided for the search engines.It can assist search engines in providing productive results of each search in order of their relevance. In this paper, we brief introduction to the concepts related to data mining, web mining and then an overview of different Web mining tools. We conclude by presenting a comparative table of these tools based on some pertinent criteria.
Authors and Affiliations
Dr Eldhose T John , Bibu Skaria , P. X. Shajan
A Study on the Bi-Rayleigh ROC Curve Model
Receiver Operating Characteristic (ROC) curves are used to describe and compare the accuracy of diagnostic test or the ability of a continuous biomarker in discriminating between the subjects into healthy or diseased cas...
Conditional Variables Double Sampling Plan for Weibull Distributed Lifetimes under Sudden Death Testing
n this paper, we propose a conditional sampling plan called conditional double sampling plan for lot acceptance of parts whose life time follows a Weibull distribution with known shape parameter under sudden death testin...
Estimation of Area under the ROC Curve Using Exponential and Weibull Distributions
In recent years the Receiver Operating Characteristic (ROC) curves received much attention in medical diagnosis for classifying the subjects into one of the two groups. Many researchers have provided the mathematical for...
Classification Mining SNPs from Leukaemia Cancer Dataset Using Linear Classifier with ACO
Single Nucleotide Polymorphisms(SNP) are the foremost common type of genetic variation in human comprising nearly1/1,000th of the typical human genome. SNP offer the foremost complete information for genome-wide associat...
Crop Advisor: A Software Tool for Forecasting Paddy Yield
The highly erratic rainfall and associated climatic parameters in India, have greater influence on the performance of cropping systems and are adversely affecting the crop yields. Forecasting of crop yields from the clim...