News Web Portal based on Natural Language Processing
Journal Title: Romanian Journal of Human - Computer Interaction - Year 2008, Vol 1, Issue 3
Abstract
The paper presents an autonomous text classification module for a news web portal for the Romanian language. Statistical natural language processing techniques are combined in order to achieve a completely autonomous functionality of the portal. The news items are automatically collected from a large number of news sources using web syndication. Afterward, machine-learning techniques are used for achieving an automatic classification of the news stream. Firstly, the items are clustered using an agglomerative algorithm and the resulting groups correspond to the main news topics. Thus, more in-formation about each of the main topics is acquired from various news sources. Secondly, text classification algorithms are applied to automatically label each cluster of news items in a predetermined number of classes. More than a thou-sand news items were employed for both the training and the evaluation of the classifiers. The paper presents a complete comparison of the results obtained for each method.
Authors and Affiliations
Traian Rebedea, Costin-Gabriel Chiru, Ştefan Trăuşan-Matu
Interactions in Smart Environments and the Importance of Modelling
One challenge in software engineering is the development of smart environments that help users to intuitively accomplish their tasks. The ideal smart environment dynamically manages a diverse collection of devices, is ac...
Visual Communication through Infographics
Interaction techniques and visual representations allow users to view, explore and understand large amounts of information. The research made in Information Visualization area has focused on finding ways to render the ab...
Testing with Visual Impairment Users of a Local Public Administration Web Site
Accessibility and usability are two concepts which evolved together, usability being associated with ergonomics (especially cognitive ergonomics) of the user-interfaces and accessibility being associated with the not dis...
Evaluation Of Motivational Value Of An Augmented Reality System For Learning Biology
The development of augmented reality (AR) technologies creates new opportunities and challenges for e-learning systems designers. An important goal of these new platforms is increasing the learning motivation of students...
Ontology-based Collaborative Applications
his paper proposes a method to extend a collaborative application with semantic web technologies in order to provide advanced functionalities for searching and classification of messages based on the semantics of the ter...