Метод классификации вокальных звуков речи на основе саундлетной байесовской нейронной сети

dc.contributor.authorФедоров, Е.Е.
dc.date.accessioned2017-01-25T19:17:17Z
dc.date.available2017-01-25T19:17:17Z
dc.date.issued2015
dc.description.abstractПредложен метод классификации вокальных звуков речи, который базируется на авторской саундлетной байесовской нейронной сети и позволяет учитывать структуру квазипериодического сигнала и сопоставлять образцы вокальных звуков речи разной длины. Разработаны методы создания образцов, формирования опорных образцов и модель их классификации.uk_UA
dc.description.abstractЗапропоновано метод класифікації вокальних звуків мовлення, який базується на авторській саундлетній байєсівській нейронній мережі та дозволяє враховувати структуру квазіперіодичного сигналу і зіставляти зразки вокальних звуків мовлення різної довжини. Розроблено методи створення зразків, формування опорних зразків і модель їх класифікації.uk_UA
dc.description.abstractThe urgent task of developing a software component is a human speech recognition, using the intelligent computer systems. The basis of this problem is the construction of the effective methods, providing the high speed of the learning pattern recognition models as well as high probability, the adequacy and speed of speech signals recognition. The existing speech recognition system images using the following approaches: logical, metric, Bayesian, artificial neural network, structural. The existing methods and models is usually based on hidden Markov models, dynamic programming algorithm DTW. The artificial neural networks have the following disadvantages: while learning a few months; the retention of the large amount of reference patterns (sounds and words), as well as weighting coefficients; big time recognition; probability of detection is less than 95 %; the presence of hundreds of thousands of training patterns. To remedy these shortcomings, this article describes a method for the classification of vocal speech sounds on reference patterns on the basis of sound let. The work improves the approach to detection of the vocal sounds, which allows to generalize the single sound patterns of different lengths and different swing amplitudes, which increases the efficiency of the classification of vocal speech sounds. The author introduces the notion of a vocal sound sample and the method of its creation. Further development of the generating the plurality of the reference patterns method, which is characterized based on the soundlet and soundlet mappings collections, which increases the efficiency of the procedure of generating the reference patterns. On the basis of the sound let and sound let mappings collections, the method of the vocal sounds classification is improved, based on sound let Bayesian neural network (SBNN). The proposed model SBNN has the following characteristics: the neurons of the input layer correspond to the components of the vector that describes the test pattern; the neurons of the first (hidden) layer correspond to the reference patterns; the neurons of the second layer correspond to the sounds; adaptation to the voice characteristics of the particular operator is carried out by adding to the model vectors of the reference patterns; each neuron of the first (hidden) layer processes information based on normalized distances between the reference sound pattern and a test pattern of the sound; the weight of connections between neurons in the first (hidden) and second (output) layer is equal to 1 or 0 for these balances do not require the procedure of training; aggregation of outputs of neurons in the first (hidden) layer is performed on the basis of the maximum; in the second (output) layer are calculated posterior probabilities by Bayes formula, which allows to determine the probability that a test pattern vocal sound. The numerical studies are conducted on the vocal sounds of the TIMIT database. Were use such artificial neural networks like MLP, RBFNN, GRNN, PNN, RMLP and author SBNN. The study allows to conclude that the author's method provides the highest probability of classification. Algorithms can be used to solve the problems associated with the speech recognition in information systems, analysis of the vibration signal in intelligent systems technical diagnostics, the speaker identification in security systems and for the phonoscope examination.uk_UA
dc.identifier.citationМетод классификации вокальных звуков речи на основе саундлетной байесовской нейронной сети / Е.Е. Федоров // Управляющие системы и машины. — 2015. — № 6. — С. 78-83. — Бібліогр.: 8 назв. — рос.uk_UA
dc.identifier.issn0130-5395
dc.identifier.udc004.934
dc.identifier.urihttps://nasplib.isofts.kiev.ua/handle/123456789/112654
dc.language.isoruuk_UA
dc.publisherМіжнародний науково-навчальний центр інформаційних технологій і систем НАН та МОН Україниuk_UA
dc.relation.ispartofУправляющие системы и машины
dc.statuspublished earlieruk_UA
dc.subjectИнформационные речевые технологииuk_UA
dc.titleМетод классификации вокальных звуков речи на основе саундлетной байесовской нейронной сетиuk_UA
dc.title.alternativeМетод класифікації вокальних звуків мови на основі саундлетної байєсівської нейронної мережіuk_UA
dc.title.alternativeThe Classification Method of Vocal Speech Sounds on the Basis of Soundlet Bayesian Neural Networkuk_UA
dc.typeArticleuk_UA

Файли

Оригінальний контейнер

Зараз показуємо 1 - 1 з 1
Завантаження...
Ескіз
Назва:
12-Fedorov.pdf
Розмір:
299.1 KB
Формат:
Adobe Portable Document Format

Контейнер ліцензії

Зараз показуємо 1 - 1 з 1
Завантаження...
Ескіз
Назва:
license.txt
Розмір:
817 B
Формат:
Item-specific license agreed upon to submission
Опис: