RESOURCE VOLUME OPTIMIZATION OF LANGUAGE IDENTIFIERS FOR THE ACCOUNT OF METHODS AND IDENTIFICATION ALGORITHMS
Journal Title: Современные информационные технологии и ИТ-образование - Year 2017, Vol 13, Issue 3
Abstract
This article is a continuation of the author's series of publications on the topic of text’s language identification. Here is considered the possibility of optimizing the resource capacity of programs and systems for language identification of information blocks by modifying the identification and resultant algorithms, as well as the selection of language identification methods. This allows significantly increase their efficiency and to calculate the requirements for resources at the design stage of such software solutions, which significantly reduces the time of their development and debugging. Also given are: classification of language identification methods, their comparative table, block diagrams and gradations of the corresponding algorithms. The work will be of interest to specialists in the field of computer linguistics and developers of automated complexes for processing unstructured data, such as: global monitoring systems, information retrieval systems, literature cataloguer, automatic document abstracting systems, automated text translators, etc.
Authors and Affiliations
Sergey Kalegin
INTERNET TECHNOLOGIES FOR TRAFFIC AND PEDESTRIAN DATA COLLECTION
The article shows the technology of obtaining initial data on the state and functioning of the city transport system. The technology makes it possible to obtain data on the intensity of any movement: intensity of traffic...
CONCEPT OF THE IMPROVED ARCHITECTURE OF VIRTUAL COMPUTER LABORATORY FOR EFFECTIVE TRAINING OF SPECIALISTS SKILLED IN DISTRIBUTED INFORMATION SYSTEMS AND DESIGN TOOLS
The article discusses the advanced architecture of the virtual computer laboratory, which is used in the innovative practice of training specialists in distributed information systems, as well as software developers skil...
A NOVEL APPROACH FOR BOOSTING PERFORMANCE OF JAVASCRIPT ENGINE FOR WEB APPLICATIONS
JavaScript is the most widespread language for Web programming. And, literally, it is vital for Web 2.0. With the development of Web 2.0, JavaScript engines experience increasingly large performance-related challenges. T...
THEORETICALLY UNBREAKABLE CIPHERS AS THEY SHOULD BE UNDERSTOOD
Perfectly-secret ciphers according to the Claude Shannon's theory, which are considered as unbreakable, and more specifically random keystream ciphers, are discussed. An analysis of the sources mentioned in the reference...
PLAYING WITH A CHAIN OR PHYSICAL AND MATHEMATICAL INFORMATICS
The article describes an educational laboratory work within the framework of interdisciplinary connections at the intersection of informatics, mathematics and physics: the study of the sagging of a closed chain with diff...