Table Detection and Extraction from Image Document

Abstract

Tables make information easier to understand and perceive than regular text block. Now days, it  becomes popular structure for information representation. Format of tables differs and change according to need of representation of information. Various format of table makes it difficult for OCR system to recognize and just segment as an Image block. We proposed a novel approach which can detect all type of table format from single column image document. Tables are categorized in three type based of their rows and column separator.Type1 table have line as row and column separator. Type2 table have horizontal line for separating rows and space for separating column. In Type3 tables only space are used as both row and columns separator. Tables are detected from image documents based on simple projection profile and hough line detection method. We have tested this approach with 1200 image documents which contains all type of table format and get 89% accurate result.

Authors and Affiliations

Tanushree Dhiran , Rakesh Sharma

Keywords

Related Articles

Modified Ant Colony Based Routing Algorithm in Manet

In MANET, without the aid of any established infrastructure or centralized administration, a temporary network needs to be established whenever a node tries to send data to another node. Each node in MANET acts as an end...

Sharing Channel In IEEE 802.16 Using The Cooperative Model Of Slotted ALOHA

One of the main problems in WIMAX is to share the medium by multiple users who compete for access. Various random access mechanisms, such as ALOHA and its corresponding variations have been widely studied as efficient me...

Fractal Image Compression Techniques

Image compression is an essential technology in multimedia and digital communication fields. Fractal image compression is a potential image compression scheme due to its potential high compression ratio, fast decompressi...

 Ajax Complexity

 For century, This paper discuss the new era of Internet application and user experience, Ajax is a new technology and this paper address the Software system complexity and Algorithms for better feature and performa...

 An Integrated Approach to Measurement Software Defect using Software Matrices

 Software measurement is a quantified attribute of a characteristic of a software product or the software process. It is a discipline within software engineering. Measurement programs in software organizations are a...

Download PDF file
  • EP ID EP120818
  • DOI -
  • Views 111
  • Downloads 0

How To Cite

Tanushree Dhiran, Rakesh Sharma (2013). Table Detection and Extraction from Image Document. International Journal of Computer & organization Trends(IJCOT), 3(7), 275-278. https://europub.co.uk./articles/-A-120818