Refine your search
Collections
Co-Authors
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z All
Sahaya Rani Alex, John
- Selection of Features for Emotion Recognition from Speech
Abstract Views :149 |
PDF Views:0
Authors
Affiliations
1 School of Electronics Engineering, VIT University, Chennai - 632014, Tamil Nadu, IN
1 School of Electronics Engineering, VIT University, Chennai - 632014, Tamil Nadu, IN
Source
Indian Journal of Science and Technology, Vol 9, No 39 (2016), Pagination:Abstract
Background/Objective: Speech is one of the modes for Human Computer Interface (HCI). Speech contains message to convey as well as the speaker characteristics such as speaker identity and emotional state of the speaker. Recently, researchers are taking more interest in the emotional parameters of speech signals which helps to improve the functionality of HCI. This research focus on selecting features which helps to identify the emotion of the speaker. Methods/Statistical Analysis: Mel Frequency Cepstrum Coefficient (MFCC), Linear Prediction Cepstrum Coefficient (LPCC) and Perceptual Linear Predictive (PLP) methods are used to extract the features. Each emotion is modeled as one Hidden Markov Model (HMM) using Hidden Markov Tool Kit (HTK tool kit). The Beagle Bone Black (BBB) board is chosen for the implementation because of the form factor. Findings: The results indicate that MFCC features gives 100% accuracy for surprise emotion, PLP features gives 100% accuracy for anger emotion and LPCC features give 100% accuracy for fear emotion. Conclusion/Improvement: A hybrid feature extraction method should be devised to detect all emotions with 100% accuracy.Keywords
BBB, Emotion recognition, HCI, HMM, LPCC, MFCC.- Performance Analysis of SOFM based Reduced Complexity Feature Extraction Methods with back Propagation Neural Network for Multilingual Digit Recognition
Abstract Views :149 |
PDF Views:0
Authors
Affiliations
1 School of Electronics Engineering Department, VIT University, Chennai - 600 127, Tamil Nadu, IN
1 School of Electronics Engineering Department, VIT University, Chennai - 600 127, Tamil Nadu, IN
Source
Indian Journal of Science and Technology, Vol 8, No 19 (2015), Pagination:Abstract
Background: Speech recognition is an active area of research, used to transliterate words vocalized by individuals in order to make them machine recognizable. For real time speech recognition applications the response time, size of training data and recognition accuracy are the important aspects. Methods: A Hybrid speech recognition system is proposed on the basis on Artificial Neural Network (ANN) in this research. The Self Organising Feature Map (SOFM) is used to reduce the feature vector dimensions which are extracted using the Mel-Frequency Cepstrum Coefficients (MFCC), Perceptual Linear Predictive (PLP) and Discrete Wavelet Transform (DWT) methods. The Back Propagation Network (BPN) algorithm is used for training the Artificial Neural Network for pattern classification. Findings: The proposed method is tested with TIDIGITS data. Results indicate that despite of the large reduction in the feature vector dimensions the recognition accuracy obtained using SOFM technique is same as that of the recognition accuracy of the conventional methods. The response time is also fast and the data size of the input data is reduced considerably. The proposed hybrid system is further tested using multilingual isolated digit data.Keywords
Artificial Neural Network, Discrete Wavelet Transform, Feature Extraction, Mel Frequency Cepstrum Co-efficients, Perceptual Linear Predictive, Self-organising Feature Map, Speech Recognition- Low Complexity DWT Architecture Implementation for Feature Extraction using Different Multipliers
Abstract Views :137 |
PDF Views:0
Authors
Affiliations
1 School of Electronics Engineering Department, VIT University, Chennai Campus, Chennai - 600127, Tamil Nadu, IN
1 School of Electronics Engineering Department, VIT University, Chennai Campus, Chennai - 600127, Tamil Nadu, IN