A SIMPLIFIED APPROACH TO WORD ALIGNMENT ALGORITHM FOR ENGLISH-TAMIL TRANSLATION

Journal Title: Indian Journal of Computer Science and Engineering - Year 2011, Vol 2, Issue 1

Abstract

In this paper, a recently proposed word alignment algorithm is simplified for easy understanding and tested for an Indian language. The word alignment problem is viewed as a simple assignment problem and is formulated as an Integer Linear Programming problem. The new objective function defined is tested for obtaining optimal alignment for English-Tamil translation pair. This alignment is necessary for creating the probabilistic bilingual dictionary and is also required for automatic machine translation. We have used this objective function to align words in 25 sentences of English-Tamil parallel corpora. The formulation is solved using the open source LP-Solver. Result obtained indicates that the methodology is applicable for all Indian languages. The work implemented is useful for pedagogical purposes, as it is a standard problem in computational linguistics. Accuracy of modern statistical machine translation depends on good word alignment. The document of the formulated model is available on request.

Authors and Affiliations

R. Harshawardhan , Mridula Sara Augustine , Dr K. P. Soman

Keywords

Related Articles

EXPLORING GPU MEMORY PERFORMANCE USING DIGITAL IMAGE PROCESSING ALGORITHMS

Leveraging the incredible parallel computational power of graphics processing units (GPUs) is a proven method for accelerating general applications. Efficient utilization of the GPU remains one of the greatest challenges...

A Review of Petri Net Modeling of Dynamical Systems

Petri nets are graphical and mathematical modeling tools which are gaining popularity in recent years. It is a tool for the representation of complex logical systems, such as synchronization, sequentially, concurrency an...

Assessment of Breastfeeding practices in Ethiopia using different data mining techniques

Breastfeeding is one of the critical issues in Ethiopia because researches show that 24.0% - 27.0% of infant death in Ethiopia is due to poor breastfeeding practices. UNICEF has been reported that a good promotion of bre...

APPLICATION OF ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM IN INTEREST RATES EFFECTS ON STOCK RETURNS

In the current study we examine the effects of interest rate changes on common stock returns of Greek banking sector. We examine the Generalized Autoregressive eteroskedasticity (GARCH) process and an Adaptive Neuro-Fuz...

Computer Profiling Based Model for Investigation

Computer profiling is used for computer forensic analysis, and proposes and elaborates on a novel model for use in computer profiling, the computer profiling object model. The computer profiling object model is an inform...

Download PDF file
  • EP ID EP145082
  • DOI -
  • Views 105
  • Downloads 0

How To Cite

R. Harshawardhan, Mridula Sara Augustine, Dr K. P. Soman (2011). A SIMPLIFIED APPROACH TO WORD ALIGNMENT ALGORITHM FOR ENGLISH-TAMIL TRANSLATION. Indian Journal of Computer Science and Engineering, 2(1), 94-100. https://europub.co.uk./articles/-A-145082