POS tagger based on second-order HMM
Journal Title: Romanian Journal of Human - Computer Interaction - Year 2012, Vol 5, Issue 3
Abstract
Part-of-speech tagging (POS tagging) is the process of grammatical labelling of each word in a sentence, phrase or paragraph with the corresponding part of speech. This process is a component of other modules of natural language processing and therefore the results should be as precise as possible. Once a part of speech has been identified, it provides supplementary information about the parts of speech that can appear in the same sentence. In the case of POS tagging, the ambiguities arise due to the fact that a word may have multiple morphological values depending on context. In this paper is performed, from an experimental perspective, an analysis of a POS Tagger based on a Second-Order Hidden Markov Model, using the Brown corpus. The tests have been conducted to obtain results according to various parameters. We will show how changes the accuracy of a POS tagger for English when become different, on the one hand, the training set size, and on the other hand, the domains of the original functions in comparison with the domain of the training set. We have identified the categories of texts from Brown corpus used for the training corpus when the accuracy of the POS tagger is higher, lower respectively.
Authors and Affiliations
Dumitru-Clementin Cercel , Stefan Trăuşan-Matu
The Generic Interaction Protocol: Increasing portability of distributed physical user interfaces
Natural user interfaces want to liberate the user from having to learn new concepts to interact with computers. They do that by taking advantage of our senses and our own knowledge about world in order to build the user...
Controlling the applications running on a Windows system by means of Android devices.
This article presents a client-server application that, enabling the user to remotely control with an Android component the applications running on the Microsoft Windows operating system. The system consists of two main...
Graphics Annotation Techniques in E-learning
This paper presents the experiments on the user interaction techniques based on the 2D and 3D graphics annotation, developed in eTrace eLearning Environment. The 2D annotation technique does not depend on the type of the...
Recognizing named entities, quotes and events in news and social media items in Romanian
At the border of natural language processing and information retrieval, named entity recognition has represented one of the most important research problems of the two domains, that has not been solved perfectly yet even...
The Influence of Perceptual Accuracy on the User Learning Experience with a Biology Application
In recent years there is a growing interest for the hedonic aspects of interacting with e-learning systems. The interaction techniques based on augmented reality (AR – Augmented Reality) provides new opportunities to inc...