Speech Recognition of Isolated Words in Noisy Conditions Using Radon Transform and Discrete Cosine Transform Based Features Derived from Speech Spectrogram

Yogesh S. Angal; Pawan K. Ajmera; Raghunath S. Holambe; Rajan H. Chile

Speech Recognition of Isolated Words in Noisy Conditions Using Radon Transform and Discrete Cosine Transform Based Features Derived from Speech Spectrogram

Yogesh S. Angal ¹, Pawan K. Ajmera ², Raghunath S. Holambe ³, Rajan H. Chile ⁴

Affiliations
1 Department of Instrumentation Engineering, Padmashree Dr.D.Y. Patil Institute of Engineering and Technology, Pimpri, Pune-18, Maharashtra, India
2 S.G.G.S. Institute of Engineering and Technolgy, Nanded. Maharashtra State (M.S.), India
3 Department of Instrumentation Engineering at S.G.G.S. Institute of Engineering and Technology, Nanded Maharashtra State (M.S.), India
4 Department of Instrumentation Engineering at S.G.G.S. Institute of Engineering and Technology, Nanded Maharashtra State (M.S.) 431 607, India

This paper presents a new feature extraction technique for speech recognition using Radon Transform (RT) and Discrete Cosine Transform (DCT). A spectrogram is a time varying spectrum(forming an image) that shows how the spectral density of a signal varies with time. In the proposed scheme speech specific features have been extracted by applying image processing technique to the patterns available in the spectrogram. Radon transform has been used to derive the effective acoustic features from speech spectrogram. The proposed technique computes radon projections for nine orientations and captures the acoustic characteristics of the speech spectrogram. DCT applied on Radon projections yields low dimensional feature vectors. The technique is computationally efficient, speaker-independent, robust to session variations and insensitive to additive noise. Radon projections for seven orientations capture the acoustic characteristics of the spectrogram. The performance of the proposed algorithm has been evaluated in presence of additive white Gaussian noise from (30dB to -5dB SNR) on Texas Instruments-46(TI-46) speech database. The performance of the proposed technique in noisy environment is much better than existing popular algorithms.

Keywords

Speech Recognition, Spectrogram, Feature Extraction, Radon Transform, Discrete Cosine Transform.

I-Scholar

Journal Help

User

Subscription Login to verify subscription

Notifications

Journal Content
Browse

Font Size

Information

Abstract Views: 202

PDF Views: 3

Speech Recognition of Isolated Words in Noisy Conditions Using Radon Transform and Discrete Cosine Transform Based Features Derived from Speech Spectrogram

Abstract Views: 202 | PDF Views: 3

Authors

Yogesh S. Angal
Department of Instrumentation Engineering, Padmashree Dr.D.Y. Patil Institute of Engineering and Technology, Pimpri, Pune-18, Maharashtra, India

Pawan K. Ajmera
S.G.G.S. Institute of Engineering and Technolgy, Nanded. Maharashtra State (M.S.), India

Raghunath S. Holambe
Department of Instrumentation Engineering at S.G.G.S. Institute of Engineering and Technology, Nanded Maharashtra State (M.S.), India

Rajan H. Chile
Department of Instrumentation Engineering at S.G.G.S. Institute of Engineering and Technology, Nanded Maharashtra State (M.S.) 431 607, India

Abstract

Keywords

Speech Recognition, Spectrogram, Feature Extraction, Radon Transform, Discrete Cosine Transform.

Username
Password
Remember me

Username
Password
Remember me

Digital Signal Processing

Digital Signal Processing

Speech Recognition of Isolated Words in Noisy Conditions Using Radon Transform and Discrete Cosine Transform Based Features Derived from Speech Spectrogram

Subscribe/Renew Journal

Keywords

Speech Recognition of Isolated Words in Noisy Conditions Using Radon Transform and Discrete Cosine Transform Based Features Derived from Speech Spectrogram

Authors

Abstract

Keywords