Open Access Open Access  Restricted Access Subscription Access

Performance Analysis of SOFM based Reduced Complexity Feature Extraction Methods with back Propagation Neural Network for Multilingual Digit Recognition


Affiliations
1 School of Electronics Engineering Department, VIT University, Chennai - 600 127, Tamil Nadu, India
 

Background: Speech recognition is an active area of research, used to transliterate words vocalized by individuals in order to make them machine recognizable. For real time speech recognition applications the response time, size of training data and recognition accuracy are the important aspects. Methods: A Hybrid speech recognition system is proposed on the basis on Artificial Neural Network (ANN) in this research. The Self Organising Feature Map (SOFM) is used to reduce the feature vector dimensions which are extracted using the Mel-Frequency Cepstrum Coefficients (MFCC), Perceptual Linear Predictive (PLP) and Discrete Wavelet Transform (DWT) methods. The Back Propagation Network (BPN) algorithm is used for training the Artificial Neural Network for pattern classification. Findings: The proposed method is tested with TIDIGITS data. Results indicate that despite of the large reduction in the feature vector dimensions the recognition accuracy obtained using SOFM technique is same as that of the recognition accuracy of the conventional methods. The response time is also fast and the data size of the input data is reduced considerably. The proposed hybrid system is further tested using multilingual isolated digit data.

Keywords

Artificial Neural Network, Discrete Wavelet Transform, Feature Extraction, Mel Frequency Cepstrum Co-efficients, Perceptual Linear Predictive, Self-organising Feature Map, Speech Recognition
User

Abstract Views: 145

PDF Views: 0




  • Performance Analysis of SOFM based Reduced Complexity Feature Extraction Methods with back Propagation Neural Network for Multilingual Digit Recognition

Abstract Views: 145  |  PDF Views: 0

Authors

John Sahaya Rani Alex
School of Electronics Engineering Department, VIT University, Chennai - 600 127, Tamil Nadu, India
Ajinkya Sunil Mukhedkar
School of Electronics Engineering Department, VIT University, Chennai - 600 127, Tamil Nadu, India
Nithya Venkatesan
School of Electronics Engineering Department, VIT University, Chennai - 600 127, Tamil Nadu, India

Abstract


Background: Speech recognition is an active area of research, used to transliterate words vocalized by individuals in order to make them machine recognizable. For real time speech recognition applications the response time, size of training data and recognition accuracy are the important aspects. Methods: A Hybrid speech recognition system is proposed on the basis on Artificial Neural Network (ANN) in this research. The Self Organising Feature Map (SOFM) is used to reduce the feature vector dimensions which are extracted using the Mel-Frequency Cepstrum Coefficients (MFCC), Perceptual Linear Predictive (PLP) and Discrete Wavelet Transform (DWT) methods. The Back Propagation Network (BPN) algorithm is used for training the Artificial Neural Network for pattern classification. Findings: The proposed method is tested with TIDIGITS data. Results indicate that despite of the large reduction in the feature vector dimensions the recognition accuracy obtained using SOFM technique is same as that of the recognition accuracy of the conventional methods. The response time is also fast and the data size of the input data is reduced considerably. The proposed hybrid system is further tested using multilingual isolated digit data.

Keywords


Artificial Neural Network, Discrete Wavelet Transform, Feature Extraction, Mel Frequency Cepstrum Co-efficients, Perceptual Linear Predictive, Self-organising Feature Map, Speech Recognition



DOI: https://doi.org/10.17485/ijst%2F2015%2Fv8i19%2F138166