Ivan Himawan
YOU?
Author Swipe
Jointly Trained Conversion Model With LPCNet for Any-to-One Voice Conversion Using Speaker-Independent Linguistic Features Open
We propose a joint training scheme of an any-to-one voice conversion (VC) system with LPCNet to improve the speech naturalness, speaker similarity, and intelligibility of the converted speech. Recent advancements in neural-based vocoders, …
Speaker Adaptation of a Multilingual Acoustic Model for Cross-Language Synthesis Open
Several studies have shown promising results in adapting DNN-based acoustic models as a mechanism to transfer characteristics from pre-trained models. One such example is speaker adaptation using a small amount of data, where fine-tuning h…
QUT System Description to the NIST SRE 2018 Campaign Open
The QUT speech team has participated in the 2018 National Institute of Standard and Technology (NIST) Speaker Recognition Evaluation (SRE), and has made one primary and two contrastive submissions to the fixed condition. Our systems relies…
3D convolution recurrent neural networks for bird sound detection Open
With the increasing use of a high quality acoustic device to monitor wildlife population, it has become imperative to develop techniques for analyzing animals’ calls automatically. Bird sound detection is one example of a long-term monitor…
Semi-Supervised Learning with Semantic Knowledge Extraction for Improved Speech Recognition in Air Traffic Control Open
Automatic Speech Recognition (ASR) can introduce higher levels of automation into Air Traffic Control (ATC), where spoken language is still the predominant form of communication. While ATC uses standard phraseology and a limited vocabulary…
IDIAP SUBMISSION TO THE NIST SRE 2016 SPEAKER RECOGNITION EVALUATION Open
LIDIAP
Investigating Cross-lingual Multi-level Adaptive Networks: The Importance of the Correlation of Source and Target Languages Open
The multi-level adaptive networks (MLAN) technique is a cross-lingual adaptation framework where a bottleneck (BN) layer in a deep neural network (DNN) trained in a source lan- guage is used for producing BN features to be exploited in a s…