Text Extraction from Image Using MSER Approach

Abstract

The automated understanding of textual information in images is an important problem to solve for the Computer Vision and Document Analysis for extracting that information for processing. This needs to generate required word regions and the remaining to be filter out the nontext area. For this, we extract the connected components (CCs) in images by using the maximally stable extremal region algorithm. Whereas in the existing system the region based method is considered. These extracted CCs are partitioned into clusters so that we can generate candidate regions Instead of using heuristic rules for clustering we train an AdaBoost classifier which determines the adjacency relationship and cluster those CCs by using their pair wise relations. Then we normalize candidate word regions and determine whether each region contains text or not. Adaboost classifier is based on multilayer perceptrons and we can control recall and precision rates with a single free parameter we develop text/nontext classifier for normalized images. Finally we obtain the extracted text by matching the trained set of templates.

Authors and Affiliations

V. Kalai selvan , M. Prakash

Keywords

Related Articles

Multi-Authentication for Cloud Security: A Framework

Cloud computing is a multi-tenant computational paradigm that offers an efficient, elastic and scalable business model for organizations to adopt various information technology (IT) resources i.e. software, hardware, net...

Detection and Removal of Bad Smells instantly using a InsRefactor

Software refactoring is one of the essential techniques which are used to improve the software quality without affecting any of the external functionality of the software. There were numerous of software refactoring tool...

DIGITAL WATERMARKING: A SURVEY 

This is the era of digital information as digital image plays a vital role in every field of human lives. A digital image is both informative and flexible since it is easy to edit and redistribute. The vulnerability of d...

A Review of various Software Project Scheduling techniques

Software project scheduling is one of the most important scheduling areas faced by software project management team. For a successful project, both software engineering and software management are very necessary. To comp...

A Novel Technique for Image Compression in Hand Written Recognition using Back Propagation in Neural Network

The handwritten symbol recognition plays an important role in present communication systems. In the data communication systems, all the data have to be recorded, encoded and will be communicated with other systems. Prese...

Download PDF file
  • EP ID EP99663
  • DOI -
  • Views 132
  • Downloads 0

How To Cite

V. Kalai selvan, M. Prakash (2014). Text Extraction from Image Using MSER Approach. International Journal of Computer Science & Engineering Technology, 5(4), 345-347. https://europub.co.uk./articles/-A-99663