arXiv Analytics

Sign in

arXiv:1703.08002 [cs.CL]AbstractReferencesReviewsResources

A network of deep neural networks for distant speech recognition

Mirco Ravanelli, Philemon Brakel, Maurizio Omologo, Yoshua Bengio

Published 2017-03-23Version 1

Despite the remarkable progress recently made in distant speech recognition, state-of-the-art technology still suffers from a lack of robustness, especially when adverse acoustic conditions characterized by non-stationary noises and reverberation are met. A prominent limitation of current systems lies in the lack of matching and communication between the various technologies involved in the distant speech recognition process. The speech enhancement and speech recognition modules are, for instance, often trained independently. Moreover, the speech enhancement normally helps the speech recognizer, but the output of the latter is not commonly used, in turn, to improve the speech enhancement. To address both concerns, we propose a novel architecture based on a network of deep neural networks, where all the components are jointly trained and better cooperate with each other thanks to a full communication scheme between them. Experiments, conducted using different datasets, tasks and acoustic conditions, revealed that the proposed framework can overtake other competitive solutions, including recent joint training approaches.

Related articles: Most relevant | Search more
arXiv:2008.07267 [cs.CL] (Published 2020-08-17)
A Survey of Active Learning for Text Classification using Deep Neural Networks
arXiv:2010.08346 [cs.CL] (Published 2020-10-16)
From Talk to Action with Accountability: Monitoring the Public Discussion of Finnish Decision-Makers with Deep Neural Networks and Topic Modelling
arXiv:1803.08312 [cs.CL] (Published 2018-03-22, updated 2018-07-25)
Learning Eligibility in Cancer Clinical Trials using Deep Neural Networks