Semantic Similarity Calculation of Chinese Word

Abstract

This paper puts forward a two layers computing method to calculate semantic similarity of Chinese word. Firstly, using Latent Dirichlet Allocation (LDA) subject model to generate subject spatial domain. Then mapping word into topic space and forming topic distribution which is used to calculate semantic similarity of word(the first layer computing). Finally, using semantic dictionary "HowNet" to deeply excavate semantic similarity of word (the second layer computing). This method not only overcomes the problem that it’s not specific enough merely using LDA to calculate semantic similarity of word, but also solves the problems such as new words (haven’t been added in dictionary) and without considering specific context when calculating semantic similarity based on semantic dictionary "HowNet". By experimental comparison, this thesis proves feasibility,availability and advantages of the calculation method.

Authors and Affiliations

Liqiang Pan, Pu Zhang, Anping Xiong

Keywords

Related Articles

Deep Gated Recurrent and Convolutional Network Hybrid Model for Univariate Time Series Classification

Hybrid LSTM-fully convolutional networks (LSTM-FCN) for time series classification have produced state-of-the-art classification results on univariate time series. We empirically show that replacing the LSTM with a gated...

A Multimedia System for Breath Regulation and Relaxation

In the hectic life today, detrimental stress has caused numerous illness. To adjust mental states, breath regulation plays a core role in multiple relaxation techniques. In this paper, we introduce a multimedia system su...

Sentiment Analysis Using Deep Learning Techniques: A Review

The World Wide Web such as social networks, forums, review sites and blogs generate enormous heaps of data in the form of users views, emotions, opinions and arguments about different social events, products, brands, and...

Designing Smart Sewerbot for the Identification of Sewer Defects and Blockages

Internet of thing (IoT) is a new concept where the term ‘thing’ is associated with the configurable sensors and devices no matter domestic or industrial, whereas bridging up a relationship in between these things and int...

Data Synchronization Model for Heterogeneous Mobile Databases and Server-side Database

Mobile devices, because they can be used to access corporate information anytime anywhere, have recently received considerable attention, and several research efforts have been tailored towards addressing data synchroniz...

Download PDF file
  • EP ID EP137052
  • DOI 10.14569/IJACSA.2014.050802
  • Views 99
  • Downloads 0

How To Cite

Liqiang Pan, Pu Zhang, Anping Xiong (2014). Semantic Similarity Calculation of Chinese Word. International Journal of Advanced Computer Science & Applications, 5(8), 8-12. https://europub.co.uk./articles/-A-137052