Greedy Algorithms to Optimize a Sentence Set Near-Uniformly Distributed on Syllable Units and Punctuation Marks

Abstract

An optimum sentence set that near-uniformly dis-tributed on syllable units and punctuation marks is important to develop a syllable-based automatic speech recognition (ASR). It is usually extracted from a mother set of millions of unique sentences using Modified Least-to-Most (LTM) Greedy algorithm. The Modified LTM Greedy is capable of minimizing the number of syllables but ignores distributing their frequencies. Hence, two schemes are proposed to minimize the number of syllables as well as to distribute their frequencies near-uniformly. Testing on a mother set of 10 million Indonesian sentences shows that both schemes perform better than the Modified LTM Greedy for two syllable units: monosyllables and bisyllables.

Authors and Affiliations

Bagus Nugroho Budi Nurtomo, Suyanto Suyanto

Keywords

Related Articles

Graphing emotional patterns by dilation of the iris in video sequences

For this paper, we took videos of iris of people while induced a feeling of joy or sadness, using videos to motivate the states affective. The manuscript implemented is a system of recognition affective pattern by dilati...

Towards Efficient Graph Traversal using a Multi-GPU Cluster

Graph processing has always been a challenge, as there are inherent complexities in it. These include scalability to larger data sets and clusters, dependencies between vertices in the graph, irregular memory accesses du...

A new vehicle detection method 

This paper presents a new vehicle detection method from images acquired by cameras embedded in a moving vehicle. Given the sequence of images, the proposed algorithms should detect out all cars in realtime. Related to th...

Credibility Evaluation of Online Distance Education Websites

Web credibility is becoming a significant factor in increasing user satisfaction, trust, and loyalty. Web credibility is particularly important for people who cannot visit an institution for one reason or other and mostl...

Data Mining Models Comparison for Diabetes Prediction

From the past few years, data mining got a lot of attention for extracting information from large datasets to find patterns and to establish relationships to solve problems. Well known data mining algorithms include clas...

Download PDF file
  • EP ID EP408080
  • DOI 10.14569/IJACSA.2018.091035
  • Views 90
  • Downloads 0

How To Cite

Bagus Nugroho Budi Nurtomo, Suyanto Suyanto (2018). Greedy Algorithms to Optimize a Sentence Set Near-Uniformly Distributed on Syllable Units and Punctuation Marks. International Journal of Advanced Computer Science & Applications, 9(10), 291-296. https://europub.co.uk./articles/-A-408080