Stefano Matta

Facial and Speech-based Signal Processing Systems for Quality of Experience and Emotion Estimation of Multimedia Applications

Porcu, Simone;Floris, Alessandro
2025-01-01

Abstract

Assessing the Quality of Experience (QoE) of multimedia services is crucial to ensure end users are satisfied. However, traditional feedback collection may suffer from bias due to the rating scale and may neglect the dynamic subjective nature of human perception, influenced by the user’s emotional state. This paper proposes an alternative solution to unobtrusively estimate QoE relying on features related to the user’s facial expressions and speech characteristics, which naturally reflect the user’s emotional state and enable QoE estimation without explicit feedback. The presented solution includes several steps, from the data collection process to the extraction and selection of the facial and speech features used to train the resulting QoE estimation models. Both single-modal and multi-modal learning approaches based on data fusion have been considered. These models can support the management of network and application resources of traditional and immersive multimedia services.
2025
Inglese
2025 International Conference on Visual Communications and Image Processing (VCIP), Klagenfurt, Austria
1
3
3
2025 International Conference on Visual Communications and Image Processing (VCIP)
Esperti anonimi
01-04 December 2025
Klagenfurt, Austria
internazionale
scientifica
Quality of Experience
Facial Emotion Recognition
Speech Emotion Recognition
Multimedia
no
4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno
Porcu, Simone; Floris, Alessandro
273
2
4.1 Contributo in Atti di convegno
none
info:eu-repo/semantics/conferencePaper
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Questionario e social

Condividi su:
Impostazioni cookie