Tokenization and its challenges in Sindhi language
Journal Title: INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND EMERGING TECHNOLOGIES - Year 2017, Vol 1, Issue 1
Abstract
Natural language processing, is a branch of Artificial Intelligence (AI). This is computational techniques which are used to analysis and synthesis of NLP and its applications. Natural Language is the ability and capability to understand the spoken language. Sindhi language has polymorphic characteristics. Sindhi is an old as well as complex language in the world because of its semantic features, so the tokenization is difficult task for Sindhi language. Tokenization is also called word segmentation into words or script (numbers, alphabets). In this research issues of tokenization are discussing. In many language just like Urdu, Sindhi Arabic and so on. Most of the language have space insertion and space omission errors. So, it‟s very important to measure the different corpus with different algorithms in this research we utilize and develop J.Mahar model on corpus. When this tokenizer is tested on this data with one lac and seventy five thousand words of Sindhi text. On this corpus JM tokenizer provides 96% accuracy.
Learning Objects Tagging for an E-teacher’s Changing Roles
In this research paper we will present a robust and possible approach which will use tagging technique of learning material. We will implement a flexible designing instructional strategy for an E-teacher. We have propose...
Internet of Things (IoT) Applications: An Overview
Internet of Things (IoT) makes the physical world smarter and transform into the digital world. It makes the devices smarter, processing more artificially intelligent, and transmission more informative by merging real wo...
Biometric Recognition Techniques: An Analysis
Biometrics refers to innovations for measuring and investigating a person's physiological or behavioral qualities which are remarkable to people subsequently can be utilized to recognize a man. Biometrics is a developing...
Modeling the Noise Shaping ADC for the nodes of Internet of Things
Sensors of internet of the things (IoT) benefits from reduced power of high-resolution analog-to-digital converters (ADC). Power reduction is required for its applications, operating from energy harvesting or battery pow...
Smart university: A Case Study of Shah Abdul Latif University Khairpur
In education technologies play a vital role to empower a student. Technological innovation and student demands are the better way of learning are dramatically changing the nature of education. The smart university is the...