SpeechQoE: A Novel Personalized QoE Assessment Model for Voice Services via Speech Sensing

Chaowei Wang, Huadi Zhu, Ming Li

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Quality of Experience (QoE) assessment is a long-lasting but yet-to-be-resolved task. Existing approaches, especially for conversational voice services, are restricted to leveraging network-centric parameters. However, their performances are hardly satisfactory due to the failure to consider comprehensive QoE-related factors. Moreover, they develop a one-for-all model that is uniform for all individuals and thus incapable of handling user diversity in QoE perception. This paper proposes a personalized QoE assessment model, namely SpeechQoE. It exploits speaker's speech signals to infer individual's perceived quality in voice services. SpeechQoE fundamentally addresses the drawback of conventional models. Instead of enumerating and incorporating unlimited QoE-related factors, SpeechQoE takes as input speech signals that inherently bear rich information needed for QoE assessment of the speaker. SpeechQoE employs an efficient few-shot learning framework to adapt the model to a new user quickly. We additionally design a lightweight data synthetic scheme to minimize the overhead of data collection needed for model adaption. A modular integration with a conventional parametric model is further implemented to avoid issues caused by the clean-slate data-driven approach. Our experiments show that SpeechQoE achieves an accuracy of 91.4% in QoE assessment which outperforms the state-of-the-art solutions by a clear margin. As another contribution of this work, we build a dataset that would be the first source of annotated audio tracks for QoE assessment of conversational calls.

Original languageEnglish
Title of host publicationSenSys 2022 - Proceedings of the 20th ACM Conference on Embedded Networked Sensor Systems
Pages305-319
Number of pages15
ISBN (Electronic)9781450398862
DOIs
StatePublished - 6 Nov 2022
Event20th ACM Conference on Embedded Networked Sensor Systems, SenSys 2022 - Boston, United States
Duration: 6 Nov 20229 Nov 2022

Publication series

NameSenSys 2022 - Proceedings of the 20th ACM Conference on Embedded Networked Sensor Systems

Conference

Conference20th ACM Conference on Embedded Networked Sensor Systems, SenSys 2022
Country/TerritoryUnited States
CityBoston
Period6/11/229/11/22

Fingerprint

Dive into the research topics of 'SpeechQoE: A Novel Personalized QoE Assessment Model for Voice Services via Speech Sensing'. Together they form a unique fingerprint.

Cite this