Neural response based phoneme classification under noisy condition

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Citations (Scopus)

Abstract

Human listeners are capable of recognizing speech in noisy environment, while most of the traditional speech recognition methods do not perform well in the presence of noise. Unlike traditional Mel-frequency cepstral coefficient (MFCC)-based method, this study proposes a phoneme classification technique using the neural responses of a physiologically-based computational model of the auditory periphery. Neurograms were constructed from the responses of the model auditory nerve to speech phonemes. The features of neurograms were used to train the recognition system using a Gaussian Mixture Model (GMM) classification technique. Performance was evaluated for different types of phonemes such as stops, fricatives and vowels from the TIMIT database for both under quiet and noisy conditions. Although performance of the proposed method is comparable with that of MFCC-based classifier in quiet condition, the neural response-based proposed method outperforms the traditional MFCC-based method under noisy conditions even with the use of less number of features in the proposed method. The proposed method could be used in the field of speech recognition such as speech to text application, especially under noisy conditions.

Original languageEnglish
Title of host publication2014 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2014
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages175-179
Number of pages5
ISBN (Electronic)9781479961207
DOIs
Publication statusPublished - 27 Jan 2014
Externally publishedYes
Event2014 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2014 - Kuching, Sarawak, Malaysia
Duration: 1 Dec 20144 Dec 2014

Publication series

Name2014 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2014

Conference

Conference2014 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2014
Country/TerritoryMalaysia
CityKuching, Sarawak
Period1/12/144/12/14

Keywords

  • GMM
  • MFCC
  • auditory nerve model
  • neurogram
  • phoneme classification

Fingerprint

Dive into the research topics of 'Neural response based phoneme classification under noisy condition'. Together they form a unique fingerprint.

Cite this