Stemmers for Tamil Language: Performance Analysis

Abstract

Stemming is the process of extracting root word from the given inflection word and also plays significant role in numerous application of Natural Language Processing (NLP). Tamil Language raises several challenges to NLP, since it has rich morphological patterns than other languages. The rule based approach light-stemmer is proposed in this paper, to find stem word for given inflection Tamil word. The performance of proposed approach is compared to a rule based suffix removal stemmer based on correctly and incorrectly predicted. The experimental result clearly show that the proposed approach light stemmer for Tamil language perform better than suffix removal stemmer and also more effective in Information Retrieval System (IRS).

Authors and Affiliations

M. Thangarasu , Dr. R. Manavalan

Keywords

Related Articles

A Single Image Super Resolution Using Advanced Neighbor Embedding

There are lots of Super resolution methods developed recently. Each has its own pros and cons and behavior. The neighbor-embedding (NE) algorithm for single-image super-resolution reconstruction is one of them which assu...

HYBRID PERSONALIZED RECOMMENDATION APPROACH FOR IMPROVING MOBILE E-COMMERCE

In recent years, the massive influx of information onto internet has facilitated user, not only retrieving information, but also discovering facts. However, web users usually suffer from the information overload problem...

Real Time Detection Of Network Attacks Using Signature Based Approach

Network attack detection is an essential technology in business as well as dynamic research area. It is essential for security of the information. Attacks on network can cause legitimate users being strived or denied ser...

Evaluation of Classifiers to Enhance Model Selection

The various tasks like classification, clustering and association rule deriving are performed in the data-mining for the pattern extraction. The performance evaluation measures make each task distinct and meaningful. The...

A MACHINE LEARNING APPROACH TO PREDICT SOLAR RADIATION FOR SOLAR ENERGY BASED DEVICES

Solar energy is used in many applications, such as increasing water’s temperature or moving electrons in a photovoltaic cell, agriculture planning, fuel production, electricity production, transport, architecture and urb...

Download PDF file
  • EP ID EP151532
  • DOI -
  • Views 107
  • Downloads 0

How To Cite

M. Thangarasu, Dr. R. Manavalan (2013). Stemmers for Tamil Language: Performance Analysis. International Journal of Computer Science & Engineering Technology, 4(7), 902-908. https://europub.co.uk./articles/-A-151532