Abstract
This paper describes pitch extraction of telephone-bandwidth speech using a combined place-temporal approach. The method makes use of a nonlinear auditory model which regenerates harmonics below 300 Hz which have been removed from the speech by the bandlimiting effect of a telephone channel. The spectral rerpresentation produced by the nonlinear auditory model is further processed using the Harmonic Product Spectrum to give an initial estimate of the fundamental frequency, using only place information. This initial estimate is then refined by temporal analysis on the detailed time response of a single section of the auditory model.
Original language | English |
---|---|
Pages | 123-126 |
Number of pages | 4 |
Publication status | Published - 1995 |
Event | 4th European Conference on Speech Communication and Technology, EUROSPEECH 1995 - Madrid, Spain Duration: 18 Sep 1995 → 21 Sep 1995 |
Conference
Conference | 4th European Conference on Speech Communication and Technology, EUROSPEECH 1995 |
---|---|
Country/Territory | Spain |
City | Madrid |
Period | 18/09/95 → 21/09/95 |