A FRAME WORK FOR WEB INFORMATION EXTRACTION AND ANALYSIS
Journal Title: INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY - Year 2013, Vol 7, Issue 2
Abstract
Day by day the volume of information availability in the web is growing significantly. There are several data structures for information available in the web such as structured, semi-structured and unstructured. Majority of information in the web is presented in web pages. The information presented in web pages is semi-structured. But the information required for a context are scattered in different web documents. It is difficult to analyze the large volumes of semi-structured information presented in the web pages and to make decisions based on the analysis. The current research work proposed a frame work for a system that extracts information from various sources and prepares reports based on the knowledge built from the analysis. This simplifies  data extraction, data consolidation, data analysis and decision making based on the information presented in the web pages.The proposed frame work integrates web crawling, information extraction and data mining technologies for better information analysis that helps in effective decision making.  It enables people and organizations to extract information from various sourses of web and to make an effective analysis on the extracted data for effective decision making. The proposed frame work is applicable for any application domain. Manufacturing,sales,tourisum,e-learning are various application to menction few.The frame work is implemetnted and tested for the effectiveness of the proposed system and the results are promising.
Authors and Affiliations
Dr Sunitha Abburu, G. Suresh Babu
Secured Data Transmission Using Wavelet Based Steganography And Cryptography
Steganography and cryptographic methods are used together with wavelets to increase the security of the data while transmitting through networks. Another technology, the digital watermarking is the process of embedding i...
AN APPROACH TO GENERATE MST WITHOUT CHECKING CYCLE
Abstract: A minimum spanning tree of an undirected graph can be easily obtained using classical algorithms by Prim or Kruskal. MST generation is a NP hard problem. Now this paper represents an algorithm to find minimum s...
TAXONOMY FOR WSN SECURITY-A SURVEY
WSN is one of the dominant and emerging technology that shows great promise for various application in military, ecological and health related areas.WSN is highly vulnerable to attacks and inclusion of wireless communi...
Blind Signal Separation Using an Adaptive Generalized Compound Gamma Distribution
We propose an independent component analysis (ICA) algorithm which can separate mixtures of sub- and super- Gaussian source signals with self-adaptive nonlinearities. The ICA algorithm in the framework of natural Riemann...
Secure Dynamic Resource Provisioning Cost by Optimized Placement of Virtual Machines in Cloud Computing
Cloud computing provides pay-as-you-go computing resources and accessing services are offered from data centers all over the world as the cloud. Consumers may find that cloud computing allows them to reduce the cost of i...