UDC 519.766.48

THE QUESTION OF CHOOSING A VOCABULARY OF TRIGRAMS FOR THE AUTOMATIC IDENTIFICATION OF THE YAKUT LANGUAGE

Leontiev Nyurgun Anatolievich
M.K.Ammosov North-Eastern Federal University
PhD, Associate Professor

Abstract
This article compares the identification methods , such as by using a vocabulary method, by using bigram and by using newspaper corpus. The article describes the process of creating a dictionary for automatic language identifier text by using the trigrams on the materials of the Yakut language . The question of the adequacy of the dictionary and selection of trigrams to improve the accuracy of the language.

Category: 05.00.00 Technical sciences

Article reference:
The question of choosing a vocabulary of trigrams for the automatic identification of the Yakut language // Modern scientific researches and innovations. 2014. № 12. P. 1 [Electronic journal]. URL: https://web.snauka.ru/en/issues/2014/12/40443

View this article in Russian

Sorry, this article is only available in Русский.



Artice view count: Please wait

All articles of author «Леонтьев Ньургун Анатольевич»


© If you have found a violation of copyrights please notify us immediately by e-mail or feedback form.

Contact author (comments/reviews)

Write comment

You must authorise to write a comment.

Если Вы еще не зарегистрированы на сайте, то Вам необходимо зарегистрироваться:
  • Register