Abstract
This paper describes a frequency-domain method of extracting the fundamental frequency of voiced speech which has been band-limited to 300 Hz to 3.4 KHz. The method uses a linear auditory model into which non-linearity has been introduced. Two methods for introducing the non-linearity into the model are described. Harmonic product spectra are derived from the outputs of the linear and non-linear auditory models. Results show that the spectrum derived from the output of the nonlinear auditory model is superior to that obtained from the output of the linear model.
| Original language | English |
|---|---|
| Pages | 449-452 |
| Number of pages | 4 |
| Publication status | Published - 1991 |
| Event | 2nd European Conference on Speech Communication and Technology, EUROSPEECH 1991 - Genova, Italy Duration: 24 Sep 1991 → 26 Sep 1991 |
Conference
| Conference | 2nd European Conference on Speech Communication and Technology, EUROSPEECH 1991 |
|---|---|
| Country/Territory | Italy |
| City | Genova |
| Period | 24/09/91 → 26/09/91 |
Keywords
- Auditory modelling
- pitch extraction
- speech processing