Wonil Chang
YOU?
Author Swipe
View article: Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation Open
We present a unified and hardware efficient architecture for two stage voice trigger detection (VTD) and false trigger mitigation (FTM) tasks. Two stage VTD systems of voice assistants can get falsely activated to audio segments acoustical…
View article: Orthogonality Constrained Multi-Head Attention For Keyword Spotting
Orthogonality Constrained Multi-Head Attention For Keyword Spotting Open
Multi-head attention mechanism is capable of learning various representations from sequential data while paying attention to different subsequences, e.g., word-pieces or syllables in a spoken word. From the subsequences, it retrieves riche…
View article: An End-to-End Text-independent Speaker Verification Framework with a Keyword Adversarial Network
An End-to-End Text-independent Speaker Verification Framework with a Keyword Adversarial Network Open
This paper presents an end-to-end text-independent speaker verification framework by jointly considering the speaker embedding (SE) network and automatic speech recognition (ASR) network. The SE network learns to output an embedding vector…