Use este identificador para citar ou linkar para este item: http://repositorio.ufc.br/handle/riufc/73415
Registro completo de metadados
Campo DCValorIdioma
dc.contributor.authorRibeiro, Fábio Cisne-
dc.contributor.authorCarvalho, Raphael Torres Santos-
dc.contributor.authorCortez, Paulo César-
dc.contributor.authorAlbuquerque, Victor Hugo Costa de-
dc.contributor.authorRebouças Filho, Pedro Pedrosa-
dc.date.accessioned2023-07-10T14:20:00Z-
dc.date.available2023-07-10T14:20:00Z-
dc.date.issued2018-
dc.identifier.citationRIBEIRO, Fábio Cisne; CARVALHO, Raphael Torres Santos; CORTEZ, Paulo César; ALBUQUERQUE, Victor Hugo Costa de; REBOUÇAS FILHO, Pedro Pedrosa. Binary neural networks for classification of voice commands from throat microphone. IEEE Access, [s.l.], v. 6, p. 70130 - 70144, 2018.pt_BR
dc.identifier.issn2169-3536-
dc.identifier.otherDOI: https://doi.org/10.1109/ACCESS.2018.2881199-
dc.identifier.urihttp://www.repositorio.ufc.br/handle/riufc/73415-
dc.description.abstractMulti-class pattern classification has many applications including speech recognition, and it is not easy to extend from two-class neural networks (NNs). This paper presents a study about using binary classifiers with NNs together with a perceptual linear prediction (PLP) method for feature extraction to increase the classification rate of voice commands captured using a throat microphone, comparing this method with a single NN. Because there is no other data set with voice commands captured using a throat microphone in the Brazilian Portuguese language in researched literature, we created a data set with isolated voice commands with utterances captured from 150 people (men and women). All the voice samples are captured in Brazilian Portuguese, and they are the digits “0”through “9”and the words “Ok”and “Cancel”. The results show that the throat microphone is robust in noise environment, achieving 95.4% of hit rate in our speech recognition system with multiple NNs using the one-against-all approach, better performance than a simple NN that reach 91.88%. This result is very representative, since both classifiers obtained high hit rates. But, it requires 535% more time for training the multiple NNs compared with simple NN. The best configuration on PLP extraction order is 9 or 10 for voice samples captured by the throat microphone, which was observed that poor stressed vowel and fricative-like words “3” and “7”in Portuguese confuses the classifier.pt_BR
dc.language.isoenpt_BR
dc.publisherIEEE Accesspt_BR
dc.rightsAcesso Abertopt_BR
dc.subjectMulti-class pattern recognitionpt_BR
dc.subjectSpeech recognitionpt_BR
dc.subjectNeural networkspt_BR
dc.subjectBinary classifierspt_BR
dc.subjectReconhecimento de padrão multiclassept_BR
dc.subjectReconhecimento de falapt_BR
dc.subjectRedes neuraispt_BR
dc.subjectClassificadores bináriospt_BR
dc.titleBinary neural networks for classification of voice commands from throat microphonept_BR
dc.typeArtigo de Periódicopt_BR
Aparece nas coleções:DEEL - Artigos publicados em revista científica

Arquivos associados a este item:
Arquivo Descrição TamanhoFormato 
2018_art_fcribeiro.pdf14,32 MBAdobe PDFVisualizar/Abrir


Os itens no repositório estão protegidos por copyright, com todos os direitos reservados, salvo quando é indicado o contrário.