TY - GEN
T1 - The Role of ECG and Respiration in Predicting Quality of Experience
AU - Vijayakumar, Sowmya
AU - Flynn, Ronan
AU - Corcoran, Peter
AU - Murray, Niall
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - The field of user quality of experience (QoE) in multimedia communications has become increasingly important due to the widespread use of digital technology in our everyday lives. The ability to accurately predict user QoE by processing physiological signals has significant applications. This study proposes a machine learning (ML) approach for predicting user QoE using physiological signals. It focuses on perceived overall quality and perceived audio quality by processing electrocardiogram (ECG) and respiration signals from the SoPMD Dataset 2. The study evaluated various ML models on individual and fused modalities while implementing dimensionality reduction, feature selection, and hyperparameter tuning. The SHapley Additive exPlanations (SHAP) method was used to interpret the ML models' outputs, which helped identify the features providing the most utility. The results show that the random forest model provided the best performance, with an F1-score of 87.91% for fusion data and 80.49% for ECG data in classifying perceived audio quality and overall quality, respectively. These results imply that physiological signals, such as ECG and respiration, hold great potential for predicting user QoE in multimedia experiences.
AB - The field of user quality of experience (QoE) in multimedia communications has become increasingly important due to the widespread use of digital technology in our everyday lives. The ability to accurately predict user QoE by processing physiological signals has significant applications. This study proposes a machine learning (ML) approach for predicting user QoE using physiological signals. It focuses on perceived overall quality and perceived audio quality by processing electrocardiogram (ECG) and respiration signals from the SoPMD Dataset 2. The study evaluated various ML models on individual and fused modalities while implementing dimensionality reduction, feature selection, and hyperparameter tuning. The SHapley Additive exPlanations (SHAP) method was used to interpret the ML models' outputs, which helped identify the features providing the most utility. The results show that the random forest model provided the best performance, with an F1-score of 87.91% for fusion data and 80.49% for ECG data in classifying perceived audio quality and overall quality, respectively. These results imply that physiological signals, such as ECG and respiration, hold great potential for predicting user QoE in multimedia experiences.
KW - ECG
KW - explainable AI
KW - machine learning
KW - physiological signals
KW - QoE
KW - respiration
KW - SHAP
UR - https://www.scopus.com/pages/publications/85201056868
U2 - 10.1109/QoMEX61742.2024.10598275
DO - 10.1109/QoMEX61742.2024.10598275
M3 - Conference Publication
AN - SCOPUS:85201056868
T3 - 2024 16th International Conference on Quality of Multimedia Experience, QoMEX 2024
SP - 118
EP - 124
BT - 2024 16th International Conference on Quality of Multimedia Experience, QoMEX 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 16th International Conference on Quality of Multimedia Experience, QoMEX 2024
Y2 - 18 June 2024 through 20 June 2024
ER -