Development of the method for filtering verbal noise while search keywords for the English text
Journal Title: Технологический аудит и резервы производства - Year 2018, Vol 6, Issue 2
Abstract
<p><em>The object of research is the processing of verbal information to identify keywords in the text. The most important step in the search for key terms is the calculation of their weights in the document in question, which makes it possible to evaluate their significance relative to each other in this context. To solve this problem, there are many approaches that are conditionally divided into two groups: they require learning and do not require learning. Learning implies the need to pre-process the original body of texts in order to extract information about the frequency of occurrence of terms in the entire body. An alternative approach is using linguistic ontologies, which are more or less approximate models of the existing set of words in a given language. On the basis of both approaches, systems are created for the automatic extraction of key terms. Nevertheless, in the direction of searching for keywords, research is not stopped in order to improve the accuracy and completeness of the results, as well as to use methods of extracting information from the text to solve new problems.</em></p><p><em>Existing approaches to the definition of keywords are characterized. The best quality of text processing is achieved by linguistic methods or when their combinations are statistical. A system for automatically determining key phrases from natural language text should be developed using the morphological dictionary and syntax rules.</em></p><em>The study uses an approach to defining keywords based on finding syntactic links between word forms in sentences in English text using the instrumental capabilities of modern linguistic packages. In the framework of the general approach to reducing verbal noise in the method, it is proposed that it is achieved with the help of formalized operations: the replacement of pronouns with the corresponding nouns; removal of noise connections; removing noise words; withdrawal of stop words. The described operations can be used as additional modules that improve the results of finding keywords for both the developed method for determining keywords of English text and other algorithms for finding keywords.</em>
Authors and Affiliations
Oleg Bisikalo, Alexander Yahimovich, Yaroslav Yahimovich
The emergency simulation with the help of four-layer hidden Markov model
<p><em>The object of research is the process of selecting a synergistically determined pair for the elements of complex systems in the design, manufacture or repair. One of the most problematic places in the selection is...
Investigation of the existing methodology of value estimation and methods of discount rate estimation
<p><em>The subject of research is the current practice of determining the fair value of assets and liabilities at the present (discounted) cost. One of the most problematic places is the determination of the discount rat...
Research into energy efficiency of the underfloor heating system, assembled dry
<p><em>The object of research is the thermal parameters of operation of a fragment of the floor heating system assembled dry, under conditions of actual application set in the lab premises. </em></p><p><em>One of the mos...
Improvement of tax policy of territorial communities in the context of budget decentralization
<p><em>The object of research is the process of improving tax policy at the level of territorial communities, taking into account the specifics of fiscal decentralization. One of the most problematic places is the redist...
Research of surface properties of water-flour suspensions in the presence of hydrocolloids and protein supplements
<p><em>An important issue of improving the technology of non-yeasted gluten-free bread is the development of measures to improve the structural and mechanical properties of dough and bread. To this end, the use of polysa...