An Intra and Inter-Topic Evaluation and Cleansing Method
Journal Title: Romanian Journal of Human - Computer Interaction - Year 2010, Vol 3, Issue 2
Abstract
Topic modeling is a growing research field and novel ways of interpreting and evaluating results are necessary. We propose a method for evaluating and improving the performance of topic models generating algorithms relying on WordNet data. We first propose a measure for determining a topic model’s fitness factoring in its broadness and redundancy. Then, for each individual topic, the amount of relevant information it provides, along with its most important words and related concepts are determined by defining a cohesion function based on the topic’s projection on WordNet concepts. The model as a whole is improved by eliminating each topic’s outliers with respect to the ontology projection. We define a inter topic ontology based distance and we further use it to investigate the impact of removing redundant topics from a model with regard to the overlap between topics’ ontological projections. Clustering similar topics into conceptually cohesive groups is tried as an alternative to pruning less relevant topics. Results show that evaluating and improving statistical models with WordNet is a promising research track that leads to more coherent topic models.
Authors and Affiliations
Claudiu Muşat, Marian-Andrei Rizoiu , Ştefan Trauşan-Matu
WebVOX – a Solution for Web Page Accessibility Improvement for Persons with Reading Deficiency
This paper presents the WebVOX system, for Web page accessibility improvement for persons with reading deficiency. The presented solution addresses peoples with dyslexia, low literacy and reading skills, learning difficu...
Integrating 3D Scene in Interactive Graphics Applications
Building and using tridimensional geometric models in computer applications imply great efforts in production software application. A model contains both complex geometry and properties such as color, lighting, texture,...
Ignoring Irrelevant Information on Displays
This paper presents empirical and modeling work aiming to contribute toward a coherent theory of how humans select relevant information and discharge irrelevant information presented on displays. Two empirical studies ar...
A Study Regarding The Abstract Specification Of The User Interface By Using USIXML And UIML Languages
The paper proposes a study regarding the methods of abstract specification for a game user interface by using UsiXML and UIML descriptive languages. Certain aspects about the model-based interaction in the context of CAD...
Location-based Services Personalization Using Genetic Algorithms and Intelligent Agents
This article analyzes possible solutions for improving Location-based Services (LBS) by using genetic algorithms and intelligent agents. The proposed solution enables identifying the points of interest based on the user...