Stemmers for Tamil Language: Performance Analysis

Abstract

Stemming is the process of extracting root word from the given inflection word and also plays significant role in numerous application of Natural Language Processing (NLP). Tamil Language raises several challenges to NLP, since it has rich morphological patterns than other languages. The rule based approach light-stemmer is proposed in this paper, to find stem word for given inflection Tamil word. The performance of proposed approach is compared to a rule based suffix removal stemmer based on correctly and incorrectly predicted. The experimental result clearly show that the proposed approach light stemmer for Tamil language perform better than suffix removal stemmer and also more effective in Information Retrieval System (IRS).

Authors and Affiliations

M. Thangarasu , Dr. R. Manavalan

Keywords

Related Articles

AN ENHANCED PRIVACY RULE BASED MODEL FOR FILTERING UNPREFERRED MESSAGES

The online social networks (OSN) offers proficient message controls that are posted on their private space in order to avoid un-preferred content displayed to users. But, OSN provides a low supportive and flexibility to...

Hilditch’s Algorithm Based Tamil Character Recognition

Character identification plays a vital role in the contemporary world of Image processing. It can solve many composite problems and makes human’s work easier. An instance is Handwritten Character detection. Handwritten r...

Cloud Computing and its challenges: A Review

Cloud computing is today’s one of the most recent topics due to its cost-efficiency and flexibility and ubiquitous computing. This paper gives a review our early of Cloud computing, its major characteristics and some iss...

Enhanced Double Layer Security using RSA over DNA based Data Encryption System

In this paper we propose an enhanced algorithm to communicate data securely for communication and information security. The DNA cryptography is a new and promising area in cryptography. Here we propose techniques that us...

A Literature Review: Cryptography Algorithms for Wireless sensor networks

Cryptography is that the observe and study of techniques for secure communication within the presence of third parties. It additionally plays important of wireless sensor networks. The cryptography drawback has addressed...

Download PDF file
  • EP ID EP151532
  • DOI -
  • Views 83
  • Downloads 0

How To Cite

M. Thangarasu, Dr. R. Manavalan (2013). Stemmers for Tamil Language: Performance Analysis. International Journal of Computer Science & Engineering Technology, 4(7), 902-908. https://europub.co.uk./articles/-A-151532