Using Word Embeddings for Ontology Enrichment

Abstract

Word embeddings, distributed word representations in a reduced linear space, show a lot of promise for accomplishing Natural Language Processing (NLP) tasks in an unsupervised manner. In this study, we investigate if the success of word2vec, a Neural Networks based word embeddings algorithm, can be replicated in an aggluginative language like Turkish. Turkish is more challenging than languages like English for complex NLP tasks because of her rich morphology. We picked ontology enrichment, again a relatively harder NLP task, as our test application. Firstly, we show how ontological relations can be extracted automaticaly from Turkish Wikipedia to construct a gold standard. Then by running experiments we show that the word vector representations produced by word2vec are useful to detect ontological relations encoded in Wikipedia. We propose a simple but yet effective weakly supervised ontology enrichment algorithm where for a given word a few know ontologically related concepts coupled with similarity scores computed via word2vec models can result in discovery of other related concepts. We argue how our algorithm can be improved and augmented to make it a viable component of an ontoloy learning and population framework.

Authors and Affiliations

İzzet Pembeci*| Muğla Sıtkı Koçman University. Department of Computer Engineering

Keywords

Related Articles

Fuzzy Multicriterial Methods for the Selection of IT-Professionals

This paper presents the solution of issues related to selection based on evaluation of demand set forth to IT specialists, to develop appropriate decision support system. In this case problem is reduced to multicriterial...

BAT algorithm for Cryptanalysis of Feistel cryptosystems

Recent cryptosystems constitute an effective task for cryptanalysis algorithms due to their internal structure based on nonlinearity. This problem can be formulated as NP-Hard. It has long been subject to various attacks...

Classification of Siirt and Long Type Pistachios (Pistacia vera L.) by Artificial Neural Networks

Quality is one of the important factors in agricultural products marketing. Grading machines have great role in quality control systems. The most efficient method used in grading machines today is image processing. This...

The Usage of Artificial Neural Networks Method in the Diagnosis of Rheumatoid Arthritis

In this study, artificial neural networks (ANN) method is used for the diagnosis of rheumatoid arthritis in order to support medical diagnostics. For the diagnosis of rheumatoid arthritis, backpropagation algorithm was e...

Grade prediction improved by regular and maximal association rules

In this paper we propose a method of predicting student scholar performance using the power of regular and maximal association rules. Due to the large number of generated rules, traditional data mining algorithms can bec...

Download PDF file
  • EP ID EP799
  • DOI 10.18201/ijisae.58806
  • Views 436
  • Downloads 25

How To Cite

İzzet Pembeci* (2016). Using Word Embeddings for Ontology Enrichment. International Journal of Intelligent Systems and Applications in Engineering, 4(3), 49-56. https://europub.co.uk./articles/-A-799