STATISTICAL METHODS OF FORMATION OF TEXT CORPORA AND LEXICOGRAPHIC RESOURCES (ON THE BASIS OF THE SPECIALTY “ACOUSTICS AND ULTRASONIC”)

Abstract

The article considers the description of the step sequence in forming the text corpora, and then frequency dictionaries on the example of Acoustics and Ultrasonic Technique (AUST) specialty, the texts of which are referred to scientific and technical discourse. The necessity of application of real text corpora compiled with the help of statistical methods in the present-day research processes is proved. Statistical method usage allows to determine such a mandatory parameter as the reliability of text corpus and lexicographic resources created on its basis – frequency dictionaries, alphabet-frequency dictionaries, etc. The example of specialty AUST demonstrates how statistically verified characteristics of the text corpus allowed to create a reliable probabilistic-statistical model (frequency dictionary) of this subject area. The statistical reliabil- ity of the dictionary manifested itself in the fact that the percentage of covering the AUST texts with the units of the base dictionary (the first 2 thousand words) is 86%, which makes it possible to understand the content of almost any text on the specialty AUST using the lexical units presented in it (the base dictionary).

Authors and Affiliations

G. F. Dyachenko, S. L. Mykhailiuk, I. F. Duvanskaya

Keywords

Related Articles

REGIONAL SPECIFICS OF DANCE SONGS FROM WESTERN POLISSYA AND VOLYN IN A COMPARATIVE ASPECTS

he article is devoted to the comparative analysis of regional varieties of one of the most popular genres of the Ukrainian folk song – songs to dance. On the basis of text samples and rhythm-melodic features, the author...

MOTIF OF QUEST AS METAPHYSICAL DIMENTION OF HUMAN BEING IN THE SHORT STORIES BY FEDIR POTUSHNYAK

The Fedir Potushnyak’s works is closely connected to European and Ukrainian historical and cultural tendencies especially to the transcendental sphere that forms the metaphysical horizon of the author’s fiction. The arti...

CONCEPT “METHODICAL TERM” OF ENGLISH-SPEAKING PEDAGOGICAL DISCOURSE

In the article the approaches to definition of a concept of a discourse existing in linguistics are systematized and characterized, in particular it is found out structural characteristics of this multidimensional phenom...

THE ORIGINALITY OF THE AUTHOR'S MODEL OF THE WORLD IN THE NOVELS OF THE SAMLASHUK ULAS

The article deals with the problem of implementing the original author's philosophical and artistic conception of Ulas Samchuk in various aspects

THE MODERNIZATION OF FOREIGN LINGUISTIC TRAINING OF FUTURE AGRARIAN SPECIALISTS IN THE SPHERE OF PROFESSIONAL ACTIVITY

The article deals with the problems of improving the foreign linguistic professional training of future agrarian specialists. There is a necessity of new conception development of foreign language teaching, determination...

Download PDF file
  • EP ID EP562563
  • DOI -
  • Views 68
  • Downloads 0

How To Cite

G. F. Dyachenko, S. L. Mykhailiuk, I. F. Duvanskaya (2018). STATISTICAL METHODS OF FORMATION OF TEXT CORPORA AND LEXICOGRAPHIC RESOURCES (ON THE BASIS OF THE SPECIALTY “ACOUSTICS AND ULTRASONIC”). Закарпатські філологічні студії, 6(), 158-161. https://europub.co.uk./articles/-A-562563