Open Access Open Access  Restricted Access Subscription Access

A Voice Identification System using Hidden Markov Model


Affiliations
1 SITE, VIT University, Vellore - 632014, Tamil Nadu, India
2 Department of Computer Science, Yarmouk University, Irbid – 21163, Jordan
 

Background/Objectives: Voice Identification System refers to a system which comprises of hardware, software and it is used to identify voice for several applications. The aim of the research is to develop a small scale system that incorporate both speaker recognition and speech recognition and can show specific visual information to a user. Methods: To this end, we have developed a system based on the technique of Hidden Markov Model. The Hidden Markov Model is a stochastic approach which models the algorithm as a double stochastic process in which the observed data is thought to be the result of having passed a hidden process through second process. Both processes are characterized only through one that is observed. A database of voice information is created. To extract features from voice signals, Mel-Frequency Cepstral Coefficients (MFCC) technique has been applied producing a set of feature vectors. Subsequently, the system uses The Vector Quantization (VQ) for features training and classification. Findings: The designed system has been tested with multiple speakers as reference. Speech recognition based on Hidden Markov Model is achieved successfully for the conversion of speech to text. In this proposed research, speech recognition is achieved with accuracy about 90%. Applications: The system has potential to be used in music industry, crime investigation, personal assistant and in hi-tech devices.

Keywords

Hidden Markov Model, Mel-Frequency Cepstrum Coefficients (MFCC), Speech Recognition, Vector Quantization, Voice Identification
User

Abstract Views: 192

PDF Views: 0




  • A Voice Identification System using Hidden Markov Model

Abstract Views: 192  |  PDF Views: 0

Authors

T. K. Das
SITE, VIT University, Vellore - 632014, Tamil Nadu, India
Khalid M. O. Nahar
Department of Computer Science, Yarmouk University, Irbid – 21163, Jordan

Abstract


Background/Objectives: Voice Identification System refers to a system which comprises of hardware, software and it is used to identify voice for several applications. The aim of the research is to develop a small scale system that incorporate both speaker recognition and speech recognition and can show specific visual information to a user. Methods: To this end, we have developed a system based on the technique of Hidden Markov Model. The Hidden Markov Model is a stochastic approach which models the algorithm as a double stochastic process in which the observed data is thought to be the result of having passed a hidden process through second process. Both processes are characterized only through one that is observed. A database of voice information is created. To extract features from voice signals, Mel-Frequency Cepstral Coefficients (MFCC) technique has been applied producing a set of feature vectors. Subsequently, the system uses The Vector Quantization (VQ) for features training and classification. Findings: The designed system has been tested with multiple speakers as reference. Speech recognition based on Hidden Markov Model is achieved successfully for the conversion of speech to text. In this proposed research, speech recognition is achieved with accuracy about 90%. Applications: The system has potential to be used in music industry, crime investigation, personal assistant and in hi-tech devices.

Keywords


Hidden Markov Model, Mel-Frequency Cepstrum Coefficients (MFCC), Speech Recognition, Vector Quantization, Voice Identification



DOI: https://doi.org/10.17485/ijst%2F2016%2Fv9i4%2F130368