STATISTICAL METHODS OF FORMATION OF TEXT CORPORA AND LEXICOGRAPHIC RESOURCES (ON THE BASIS OF THE SPECIALTY “ACOUSTICS AND ULTRASONIC”)

Abstract

The article considers the description of the step sequence in forming the text corpora, and then frequency dictionaries on the example of Acoustics and Ultrasonic Technique (AUST) specialty, the texts of which are referred to scientific and technical discourse. The necessity of application of real text corpora compiled with the help of statistical methods in the present-day research processes is proved. Statistical method usage allows to determine such a mandatory parameter as the reliability of text corpus and lexicographic resources created on its basis – frequency dictionaries, alphabet-frequency dictionaries, etc. The example of specialty AUST demonstrates how statistically verified characteristics of the text corpus allowed to create a reliable probabilistic-statistical model (frequency dictionary) of this subject area. The statistical reliabil- ity of the dictionary manifested itself in the fact that the percentage of covering the AUST texts with the units of the base dictionary (the first 2 thousand words) is 86%, which makes it possible to understand the content of almost any text on the specialty AUST using the lexical units presented in it (the base dictionary).

Authors and Affiliations

G. F. Dyachenko, S. L. Mykhailiuk, I. F. Duvanskaya

Keywords

Related Articles

PECULIARITIES OF TRANSLATING SUBJECTIVE MODALITY IN THE BRITISH NOVEL OF THE XIX CENTURY (BASED ON J. AUSTIN’S NOVEL “PRIDE AND PREJUDICE”)

The article describes the emergence of the British novel as a literary phenomenon and its characteristic features. Based on J. Austin’s novel ‘Pride and Prejudice’, some peculiarities of translating subjective modality i...

CATEGORY OF WORD IN THE CURRENT THEORY OF INFORMATION

In the article on the material of the French-language and Ukrainian-language texts of the Gospel of John was illustrated an example of the analysis of ancient literary text with the help of modern theories of information...

STRUCTURAL-COMPOSITIONAL AND NARRATIVE ORGANIZATION OF ENGLISH SCIENTIFIC ARTICLES DISCOURSE

Peculiar features of structural-compositional organization of English scientific articles are defined in the article. Their obligatory and facultative structural-compositional elements are also determined. Peculiar featu...

THE OBJECTIVATION OF INTELLECTUAL QUALITIES OF THE PERSON BY COMPARATIVE PHRASEOLOGICAL UNITS IN THE ENGLISH LANGUAGE WORLD VIEW

The article is written in the sphere of anthropocentric linguistics and is devoted to the verbalization of intellectual qualities of the person by the means of comparative phraseological units in the English Language Wor...

LEXICAL MEANS OF EXPRESSING MODALITIES OF MOTIVATION IN MODERN UKRAINIAN

The paper outlines in-text implementations and means of expression of expression of will. In particular, intonation as a phonetic means for detecting modular values, the statement plays a decisive role in the formation o...

Download PDF file
  • EP ID EP562563
  • DOI -
  • Views 73
  • Downloads 0

How To Cite

G. F. Dyachenko, S. L. Mykhailiuk, I. F. Duvanskaya (2018). STATISTICAL METHODS OF FORMATION OF TEXT CORPORA AND LEXICOGRAPHIC RESOURCES (ON THE BASIS OF THE SPECIALTY “ACOUSTICS AND ULTRASONIC”). Закарпатські філологічні студії, 6(), 158-161. https://europub.co.uk./articles/-A-562563