TY - JOUR
T1 - Secure Voice Input on Augmented Reality Headsets
AU - Shang, Jiacheng
AU - Wu, Jie
N1 - Publisher Copyright:
© 2002-2012 IEEE.
PY - 2022/4/1
Y1 - 2022/4/1
N2 - Voice-based input is usually used as the primary input method for augmented reality (AR) headsets due to immersive AR experience and good recognition performance. However, recent researches show that attackers can inject inaudible voice commands to the devices that lack voice verification. Even if we secure voice input with voice verification techniques, attackers can record the victim's voice and replay it. To defend against voice-spoofing attacks, AR headsets should be able to determine whether the voice is from the person who is using the AR headsets. Existing voice-spoofing defense systems are designed for smartphone platforms and usually fail to work due to the special locations of microphones and loudspeakers on AR headsets. To address this challenge, in this paper, we propose a voice-spoofing defense system for AR headsets by leveraging both the internal body propagation and the air propagation of human voices. Experimental results show that our system can successfully accept normal users with average accuracy of 97 percent and defend against two basic types of attacks with average accuracy of at least 98 percent. More importantly, even if the attackers can fool our line-fitting model by manipulating special voice signals, our MCD-SVDD model can still reject them with accuracy of 100 percent.
AB - Voice-based input is usually used as the primary input method for augmented reality (AR) headsets due to immersive AR experience and good recognition performance. However, recent researches show that attackers can inject inaudible voice commands to the devices that lack voice verification. Even if we secure voice input with voice verification techniques, attackers can record the victim's voice and replay it. To defend against voice-spoofing attacks, AR headsets should be able to determine whether the voice is from the person who is using the AR headsets. Existing voice-spoofing defense systems are designed for smartphone platforms and usually fail to work due to the special locations of microphones and loudspeakers on AR headsets. To address this challenge, in this paper, we propose a voice-spoofing defense system for AR headsets by leveraging both the internal body propagation and the air propagation of human voices. Experimental results show that our system can successfully accept normal users with average accuracy of 97 percent and defend against two basic types of attacks with average accuracy of at least 98 percent. More importantly, even if the attackers can fool our line-fitting model by manipulating special voice signals, our MCD-SVDD model can still reject them with accuracy of 100 percent.
KW - AR headsets
KW - liveness detection
KW - voice spoofing attack
UR - http://www.scopus.com/inward/record.url?scp=85126286701&partnerID=8YFLogxK
U2 - 10.1109/TMC.2020.3020470
DO - 10.1109/TMC.2020.3020470
M3 - Article
AN - SCOPUS:85126286701
SN - 1536-1233
VL - 21
SP - 1420
EP - 1433
JO - IEEE Transactions on Mobile Computing
JF - IEEE Transactions on Mobile Computing
IS - 4
ER -