Open Access Open Access  Restricted Access Subscription or Fee Access

Perceptual Features Based Continuous Speech Recognition in Additive Noise Environment Using Various Modelling Techniques

Revathi Arunachalam, Y. Venkataraman

Abstract


 

The main objective of this paper is to discuss the effectiveness of Mel frequency perceptual features and the noise reduction technique in evaluating the performance of multi speaker independent continuous speech recognition system in additive noise environment by using various modelling techniques. The proposed perceptual features are captured and trained using clustering technique, GMM, continuous density HMM and back propagation neural networks. Speech recognition system is evaluated on clean and noisy test speeches by using minimum distance criterion, maximum log likelihood criterion and minimum mean squared error criterion for these speech modelling techniques. Performance of these features is tested on speeches randomly chosen from “TIMIT” speech corpus. This algorithm provides 98.3, 90.8, 99.3 and 97.3% as accuracy for clean speech recognition system evaluated on GMM models and continuous density HMM models, Clustering models and neural network models respectively. Evaluation is done for 300 test speeches. System is also tested on noisy test speech by considering RLS and combination of RLS & wavelets as additional pre- processing techniques and the performance is found to be better than the testing without additional pre- processing. Noises such as babble, M109, white, destroyer engine, paper rustle, F16, factory and destroyerops are considered in this work and these are added to the test speeches at various levels and performance of the system is evaluated for 300 test speeches. Among the various modelling techniques, clustering technique and continuous density HMM technique are performing better for clean speech recognition and noisy speech recognition respectively. GMM and continuous density HMM along with additional pre-processing technique are performing better for noisy speech recognition even if the noise energy is greater than the signal energy.

 

Keywords: Hidden Markov model (HMM), Neural network, Continuous density HMM (CHMM), Gaussian mixture model (GMM), Speech recognition, Vector quantization (VQ), Mel frequency perceptual linear predictive cepstrum (MFPLPC), Noise, Wavelet transform, Recursive least s

 


Keywords


Hidden Markov model (HMM), Neural network, Continuous density HMM (CHMM), Gaussian mixture model (GMM), Speech recognition, Vector quantization (VQ), Mel frequency perceptual linear predictive cepstrum (MFPLPC), Noise, Wavelet transform, Recursive least s

Full Text:

PDF

Refbacks

  • There are currently no refbacks.