Learning Approaches toward Title Word Selection on Indic Script

Journal Title: International Journal on Computer Science and Engineering - Year 2011, Vol 3, Issue 3

Abstract

Title is a compact representation of a document which distill the important information from the document. In this paper we studied the selection words as title words by using different learning approaches namely nearest neighbor approach (NN), Naive Bayes approach with limited-vocabulary (NBL), Naive Bayes approach with full vocabulary (NBF) and by using a term weighing approach (tf-idf). We compare the performance of these approaches by using F1 metric. We compare the F1 metric results both on English Script and Indic Script ' Telugu'. We concluded the influence of linguistic complexity in the process of Title word selection.

Authors and Affiliations

P. Vijayapal Reddy , A. Govardhan

Keywords

Related Articles

A Semantic Query Transformation Approach Based on Ontology for Search Engine

These days we are using some popular web search engines for information retrieval in all areas, such engine are as Google, Yahoo!, and Live Search, etc. to obtain initial helpful information. Which information we retriev...

A Study on Enhancement of Loadability of Large-Scale Emerging Power Systems by Using FACTS Controllers

This study presents comprehensive review of various ethods/techniques for incorporation of differential algebraic equations (DAE) model of FACTS controllers and different type of loads such as a static, dynamic, and com...

Fingerprint Recognition Using Global and Local Structures

Biometrics is one of the biggest tendencies in human identification. The fingerprint is the most widely used biometric. However considering the automatic fingerprint recognition a completely solved problem is a common mi...

FEATURE BASED IMAGE OPTIMIZATION TECHIQUE

The motivation behind the production of high resolution images is to increase the quality and the visual presentation of a digital image. The erstwhile analogue images captured by the silver Bromide film was having infin...

Relaxed Median Filter: A Better Noise Removal Filter for Compound Images

Image filtering techniques are widely used in removing noises in images. But representation of data is becoming popular day by day using compound images. So, noise removal is necessary to maintain the quality of the comp...

Download PDF file
  • EP ID EP91913
  • DOI -
  • Views 122
  • Downloads 0

How To Cite

P. Vijayapal Reddy, A. Govardhan (2011). Learning Approaches toward Title Word Selection on Indic Script. International Journal on Computer Science and Engineering, 3(3), 1063-1067. https://europub.co.uk./articles/-A-91913