A FRAME WORK FOR WEB INFORMATION EXTRACTION AND ANALYSIS

Journal Title: INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY - Year 2013, Vol 7, Issue 2

Abstract

Day by day the volume of information availability in the web is growing significantly. There are several data structures for information available in the web such as structured, semi-structured and unstructured. Majority of information in the web is presented in web pages. The information presented in web pages is semi-structured.  But the information required for a context are scattered in different web documents. It is difficult to analyze the large volumes of semi-structured information presented in the web pages and to make decisions based on the analysis. The current research work proposed a frame work for a system that extracts information from various sources and prepares reports based on the knowledge built from the analysis. This simplifies  data extraction, data consolidation, data analysis and decision making based on the information presented in the web pages.The proposed frame work integrates web crawling, information extraction and data mining technologies for better information analysis that helps in effective decision making.   It enables people and organizations to extract information from various sourses of web and to make an effective analysis on the extracted data for effective decision making.  The proposed frame work is applicable for any application domain. Manufacturing,sales,tourisum,e-learning are various application to menction few.The frame work is implemetnted and tested for the effectiveness of the proposed system and the results are promising.

Authors and Affiliations

Dr Sunitha Abburu, G. Suresh Babu

Keywords

Related Articles

Fuzzy Cognitive Maps Based Election Results Prediction System

Prediction and forecast are common words in the area of election. It can also be related with word “opinion poll”. Although according to dictionary the meaning/definition of prediction is limited but practically pred...

Twig Pattern Minimization Based on XML Schema Constraints

Twig pattern is one of the core components of XQuery. Twig usually includes redundancy nodes which can be optimized. Schema feature is used to judge whether the node of Twig pattern is redundancy. In this paper, we propo...

Enhancing the Security of the GPT Cryptosystem Against Attacks

The concept of Public key cryptosystems based on error correcting codes was invented by McEliece in 1978. In 1991 Gabidulin, Paramonov and Tretjakov proposed a new mversion of the McEliece cryptosystem (GPT) based on max...

FROM GRID COMPUTING TO CLOUD INFRASTRUCTURES

As a consequence of the economic crisis, the funds allocated for the development of the IT&C infrastructure in all domains become scarcer, whilst the need for computing services increases day by day and becomes a key...

A Modified Advanced Encryption Standard Algorithm for Image Encryption

Cryptography algorithms are becoming more necessary to ensure secure data transmission, which can be used in several applications. Increasing use of images in industrial process therefore it is essential to protect the c...

Download PDF file
  • EP ID EP650063
  • DOI 10.24297/ijct.v7i2.3459
  • Views 103
  • Downloads 0

How To Cite

Dr Sunitha Abburu, G. Suresh Babu (2013). A FRAME WORK FOR WEB INFORMATION EXTRACTION AND ANALYSIS. INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY, 7(2), 574-579. https://europub.co.uk./articles/-A-650063