A Semantic Approach for Outlier Detection in Big Data Streams
Journal Title: Webology - Year 2019, Vol 16, Issue 1
Abstract
In recent years, the world faced a big revolution in data generation and collection technologies. The volume, velocity and veracity of data have changed drastically and led to new types of challenges related to data analysis, modeling and prediction. One of the key challenges is related to the semantic analysis of textual data especially in big data streams settings. The existing solutions focus on either topic analysis or the sentiment analysis. Moreover, the semantic outlier detection over data streams as one of the key problems in data mining and data analysis fields has less focus. In this paper, we introduce a new concept of semantic outlier through which the topic of the textual data is considered as the primary content of the data stream while the sentiment is considered as the context in which the data has been generated and affected. Also, we propose a framework for semantic outlier detection in big data streams which incorporates the contextual detection concepts. The advantage of the proposed concept is that it incorporates both topic and sentiment analysis into one single process; while at the same time the framework enables the implementation of different algorithms and approaches for semantic analysis.
Authors and Affiliations
Hussien Ahmad and Salah Dowaji
Marketing of Library and Information Services in Global Era: A Current Approach
This paper deals with the marketing of library and information services in the global era. It discusses about the marketing concept of today's library and information centers covering various topics such as management...
Systematic Literature Review on Opinion Mining of Big Data for Gov ernment Intelligence
With the advent of new technology paradigm, SMAC (Social media, Mobile, Analytics and Cloud) the information network generates an infinite ocean of data spreading faster and larger than earlier. A high quality informatio...
Search Engines and Resource Discovery on the Web: Is Dublin Core an Impact Factor?
This study evaluates the effectiveness of the Dublin Core metadata elements on the retrieval of web pages in a suite of six search engines, AlltheWeb, AltaVista, Google, Excite, Lycos, and WebCrawler. The effectiveness o...
More Effective Web Search Using Bigrams and Trigrams
This paper investigates the effectiveness of quoted bigrams and trigrams as query terms to target web search. Prior research in this area has largely focused on static corpora each containing only a few million documents...
Open access to scientific knowledge and feudalism knowledge: Is there a connection?
The role of universities and transnational corporations in the circulation of scientific knowledge is considered. If institutions generate, mostly scientific knowledge, trying to facilitate its free circulation, then tra...