A Hybrid Method for Automatic Speech Recognition Performance Improvement in Real World Noisy Environment

Urmila Shrawankar; Vilas Thakare

doi:10.3844/jcssp.2013.94.104

Research Article Open Access

A Hybrid Method for Automatic Speech Recognition Performance Improvement in Real World Noisy Environment

Urmila Shrawankar¹ and Vilas Thakare²

¹ G H Raisoni College of Engineering, India
² SGB Amravati University, India

Abstract

It is a well known fact that, speech recognition systems perform well when the system is used in conditions similar to the one used to train the acoustic models. However, mismatches degrade the performance. In adverse environment, it is very difficult to predict the category of noise in advance in case of real world environmental noise and difficult to achieve environmental robustness. After doing rigorous experimental study it is observed that, a unique method is not available that will clean the noisy speech as well as preserve the quality which have been corrupted by real natural environmental (mixed) noise. It is also observed that only back-end techniques are not sufficient to improve the performance of a speech recognition system. It is necessary to implement performance improvement techniques at every step of back-end as well as front-end of the Automatic Speech Recognition (ASR) model. Current recognition systems solve this problem using a technique called adaptation. This study presents an experimental study that aims two points, first is to implement the hybrid method that will take care of clarifying the speech signal as much as possible with all combinations of filters and enhancement techniques. The second point is to develop a method for training all categories of noise that can adapt the acoustic models for a new environment that will help to improve the performance of the speech recognizer under real world environmental mismatched conditions. This experiment confirms that hybrid adaptation methods improve the ASR performance on both levels, (Signal-to-Noise Ratio) SNR improvement as well as word recognition accuracy in real world noisy environment.

Journal of Computer Science

Volume 9 No. 1, 2013, 94-104

DOI: https://doi.org/10.3844/jcssp.2013.94.104

Submitted On: 29 September 2012 Published On: 27 February 2013

How to Cite: Shrawankar, U. & Thakare, V. (2013). A Hybrid Method for Automatic Speech Recognition Performance Improvement in Real World Noisy Environment. Journal of Computer Science, 9(1), 94-104. https://doi.org/10.3844/jcssp.2013.94.104

Copyright: © 2013 Urmila Shrawankar and Vilas Thakare. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

5,790 Views
4,393 Downloads
1 Citations

Download

Keywords

Real World Noisy Environment
ASR Performance
Speech Processing Hybrid Methods
Environment Adaptation
ASR Back-end Techniques
ASR Front-end Techniques
SNR Improvement
Word Recognition Accuracy Improvement