Comparing Tesseract results with and without Character localization for Smartphone application
Journal Title: International Journal of Research in Computer and Communication Technology - Year 2013, Vol 2, Issue 5
Abstract
Tesseract is considered the most accurate free OCR engine in existence. Android operating system based Smartphones application where images taken from camera of mobile device or browsed from gallery are preprocessed. The text of these images will be accurately localized within the device using special localization method. Localized text sub-image will be fed for text extraction to the best OCR engine called “Tesseract”. In this paper, Tesseract results with and without Character localization is compared based on computation time in milliseconds. Each image is taken 10 times and time for each is calculated. The computation time is taken as average of this 10 values. There is drastic change in time and accuracy of localized image compared to nonlocalized image. Finally we concluded the importance of localization in OCR system especially for Smartphone application where we OCR a few words and need high accuracy.
Authors and Affiliations
Snehal Charjan, R. V. Mante, Dr. P. N. Chatur
A Cryptographic Based Implementation Of Secure Hash Algorithm By Using Microblaze Processor
Hash function is simply an algorithm that takes a string of any length and reduces it to a unique fixed length string. Hash functions are used to ensure data and message integrity, password validity as well as the bas...
Secure Transmission and Minimizing Communication Over Head In Cooperative Group
A mobile ad-hoc network is a selfconfiguring infrastructure less network of mobile devices connected by wireless. Here the problem is efficiently and securely broadcasting to cooperative groups and trusted key generat...
A Review On Data Mining Process In Healthcare Department To Identify The Frequently Occurring Diseases
Data mining is a process of analyzing large volumes of data to extract the useful knowledge from it. Data mining techniques is applied on medical data to improve the service in healthcare department. Availability of...
Data Identity Provable For Multi Clouds With Freezing Safeguards
A standout amongst the most critical current examinations in the Cloud Computing provisioning is the Service Level Agreement and its application in guaranteeing the supplied distributed computing administrations. The...
An Efficient Parallel Processing Technique For Large Text Files
Now a days the data was increasing rapidly because of heterogeneous resources. The data that is useful for large organisations will required large number of computing resources for processing large data sets. there a...