Abstract
In coping with best-effort service, many VoIP applications employ adaptive playout strategies. Objective methods of speech quality assessment such as the ITU-T Recommendation P.862 (also known as Perceptual Evaluation of Speech Quality PESQ) typically do not capture distortion due to playout adjustments as they match up short segments prior to analysis. Similarly, the ITU-T E-Model does not capture the effect of delay variation and uses an average delay figure in its calculations. In this paper we explore in some detail, the extent of playout adjustments within VoIP applications and assess the likely impact on Mean Opinion Score MOS. We review the impact of various factors such as Voice Activity Detection (VAD) settings and hangover thresholds on talkspurt/silence period distribution. In this context we examine the distribution of playout adjustments resulting from various playout algorithms and assess the likely impact on MOS. We show that our hybrid playout strategy which utilises synchronised time to implement aninformed fixed delay playout strategy wherever possible will significantly reduce playout adjustments and any consequent MOS degradation.
Original language | English |
---|---|
Publication status | Published - 2005 |
Event | Measurement of Speech and Audio Quality in Networks, MESAQIN 2005 - Prague, Czech Republic Duration: 9 Jun 2005 → 10 Jun 2005 |
Conference
Conference | Measurement of Speech and Audio Quality in Networks, MESAQIN 2005 |
---|---|
Country/Territory | Czech Republic |
City | Prague |
Period | 9/06/05 → 10/06/05 |
Keywords
- MOS
- Playout Adjustments
- Synchronised Time
- Talkspurt/silence Period Distribution