TY - GEN
T1 - WowSense
T2 - 20th International Conference on Mobility, Sensing and Networking, MSN 2024
AU - Gao, Yichao
AU - Guo, Kaiwen
AU - Zhang, Chuanzi
AU - Xin, Yiyu
AU - Han, Feiyu
AU - Du, Haohua
AU - Li, Xiang Yang
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Smart-devices' grip-state has shown great potential to enable various intelligent applications, including virtual keyboard and automatic UI adaption. However, smartphones nowadays often lack the capability to detect the grip-state, resulting in a poor experience of human-computer interaction. To implement an effective and efficient grip-state detection, we need to tackle a number of technical challenges such as effective features extraction and fusion from multimodal data, nonalignment of different modal data, and limited labeled data availability. In this work, to address these challenges, we design a two-stage grip-state detecting system, named WowSense, for high-accuracy, real-time detection of phone's grip-states using IMU (Inertial Measurement Unit) and CS (Capacitivc Screen) data. Our system WowSense consists of the multimodal alignment stage and the grip-state classification stage. In the first stage, we employ a novel augmentation method to capture subtle features from IMU and CS data. Additionally, we utilize contrastive learning to extract consistent information across these two modalities using a large amount of unlabeled data. In the second stage, we design an attention-based classifier to capture complementary information using only a small amount of labeled data. We implement our system in OpenHarmony and our extensive experimental results demonstrate the superiority of our system, which achieves 95 % accuracy with only 40 % of the data labeled on a self-collected dataset and 92.5
AB - Smart-devices' grip-state has shown great potential to enable various intelligent applications, including virtual keyboard and automatic UI adaption. However, smartphones nowadays often lack the capability to detect the grip-state, resulting in a poor experience of human-computer interaction. To implement an effective and efficient grip-state detection, we need to tackle a number of technical challenges such as effective features extraction and fusion from multimodal data, nonalignment of different modal data, and limited labeled data availability. In this work, to address these challenges, we design a two-stage grip-state detecting system, named WowSense, for high-accuracy, real-time detection of phone's grip-states using IMU (Inertial Measurement Unit) and CS (Capacitivc Screen) data. Our system WowSense consists of the multimodal alignment stage and the grip-state classification stage. In the first stage, we employ a novel augmentation method to capture subtle features from IMU and CS data. Additionally, we utilize contrastive learning to extract consistent information across these two modalities using a large amount of unlabeled data. In the second stage, we design an attention-based classifier to capture complementary information using only a small amount of labeled data. We implement our system in OpenHarmony and our extensive experimental results demonstrate the superiority of our system, which achieves 95 % accuracy with only 40 % of the data labeled on a self-collected dataset and 92.5
KW - Multimodal fusion
KW - Sensing
KW - Smartphone grip-state detect
UR - https://www.scopus.com/pages/publications/105010318622
U2 - 10.1109/MSN63567.2024.00130
DO - 10.1109/MSN63567.2024.00130
M3 - 会议稿件
AN - SCOPUS:105010318622
T3 - Proceedings - 2024 20th International Conference on Mobility, Sensing and Networking, MSN 2024
SP - 948
EP - 955
BT - Proceedings - 2024 20th International Conference on Mobility, Sensing and Networking, MSN 2024
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 20 December 2024 through 22 December 2024
ER -