Robust distributed speech recognition in noise and packet loss conditions

Research output: Contribution to a Journal (Peer & Non Peer)Articlepeer-review

5 Citations (Scopus)

Abstract

This paper examines the performance of a Distributed Speech Recognition (DSR) system in the presence of both background noise and packet loss. Recognition performance is examined for feature vectors extracted from speech using a physiologically-based auditory model, as an alternative to the more commonly-used Mel Frequency Cepstral Coefficient (MFCC) front-end. The feature vectors produced by the auditory model are vector quantised and combined in pairs for transmission over a statistically modelled channel that is subject to packet burst loss. In order to improve recognition performance in the presence of noise, the speech is enhanced prior to feature extraction using Wiener filtering. Packet loss mitigation to compensate for missing features is also used to further improve performance. Speech recognition results show the benefit of combining speech enhancement and packet loss mitigation to compensate for channel and environmental degradations. (C) 2010 Elsevier Inc. All rights reserved.
Original languageEnglish (Ireland)
Pages (from-to)1559-1571
Number of pages13
JournalDigital Signal Processing
Volume20
Issue number6
DOIs
Publication statusPublished - 1 Dec 2010

Keywords

  • Auditory front-end
  • Packet loss
  • Robust distributed speech recognition
  • Speech enhancement

Authors (Note for portal: view the doc link for the full list of authors)

  • Authors
  • Flynn, R,Jones, E
  • R. Flynn and E. Jones

Fingerprint

Dive into the research topics of 'Robust distributed speech recognition in noise and packet loss conditions'. Together they form a unique fingerprint.

Cite this