跳到主要导航 跳到搜索 跳到主要内容

AccCall: Enhancing Real-time Phone Call Quality with Smartphone's Built-in Accelerometer

  • Lei Wang
  • , Xingwei Wang
  • , Xi Zhang
  • , Xiaolei Ma
  • , Yu Zhang
  • , Fusang Zhang
  • , Tao Gu
  • , Haipeng Dai*
  • *此作品的通讯作者
  • Soochow University
  • Beijing University of Technology
  • Macquarie University
  • Inspur Computing Technology Pty Ltd
  • Nanjing University

科研成果: 期刊稿件文章同行评审

摘要

Speech enhancement can greatly improve the user experience during phone calls in low signal-to-noise ratio (SNR) scenarios. In this paper, we propose a low-cost, energy-efficient, and environment-independent speech enhancement system, namely AccCall, that improves phone call quality using the smartphone's built-in accelerometer. However, a significant gap remains between the underlying insight and its practical applications, as several critical challenges should be addressed, including efficiency of speech enhancement in cross-user scenario, adaptive system triggering to reduce energy consumption, and lightweight deployment for real-time processing. To this end, we first design Acc-Aided Network (AccNet), a cross-modal deep learning model inherently capable of cross-user generalization through three key components, including cross-modal fusion module, accelerometer-aided (abbreviated as acc-aided) mask generator, the unified loss function. Second, we adopt a machine learning-based approach instead of deep learning to achieve high accuracy in distinguishing call activity states followed by adaptive system triggering, ensuring lower energy consumption and efficient deployment on mobile platforms. Finally, we propose a knowledge-distillation-driven structured pruning framework that optimizes model efficiency while preserving performance. Extensive experiments with 20 participants have been conducted under a user-independent scenario. The results show that AccCall achieves excellent and reliable adaptive triggering performance, and enables substantial real-time improvements in SISDR, SISNR, STOI, PESQ, and WER, demonstrating the superiority of our system in enhancing speech quality and intelligibility for phone calls.

源语言英语
文章编号3749463
期刊Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies
9
3
DOI
出版状态已出版 - 3 9月 2025

联合国可持续发展目标

此成果有助于实现下列可持续发展目标:

  1. 可持续发展目标 7 - 经济适用的清洁能源
    可持续发展目标 7 经济适用的清洁能源

指纹

探究 'AccCall: Enhancing Real-time Phone Call Quality with Smartphone's Built-in Accelerometer' 的科研主题。它们共同构成独一无二的指纹。

引用此