Marc Ferràs
YOU?
Author Swipe
Unsupervised Cross-Domain Speech-to-Speech Conversion with\n Time-Frequency Consistency Open
In recent years generative adversarial network (GAN) based models have been\nsuccessfully applied for unsupervised speech-to-speech conversion.The rich\ncompact harmonic view of the magnitude spectrogram is considered a suitable\nchoice fo…
Content Normalization for Text-Dependent Speaker Verification Open
Subspace based techniques, such as i-vector and Joint Factor Analysis (JFA) have shown to provide state-of-the-art performance for fixed phrase based text-dependent speaker verification.However, the error rates of such systems on the rando…
Intra-class covariance adaptation in PLDA back-ends for speaker verification Open
Multi-session training conditions are becoming increasingly common in recent benchmark datasets for both text-independent and text-dependent speaker verification. In the state-of-the-art i-vector framework for speaker verification, such co…
Exploiting sequence information for text-dependent Speaker Verification Open
Model-based approaches to Speaker Verification (SV), such as Joint Factor Analysis (JFA), i-vector and relevance Maximum-a-Posteriori (MAP), have shown to provide state-of-the-art performance for text-dependent systems with fixed phrases. …
Speaker Diarization and Linking of Meeting Data Open
Finding who spoke when in a collection of recordings, with speakers being uniquely identified across the database, is a challenging task. In this scenario, reasonable computing times and acoustic variation across recordings remain two majo…
A Large-Scale Open-Source Acoustic Simulator for Speaker Recognition Open
The state-of-the-art speaker-recognition systems suffer from significant performance loss on degraded speech conditions and acoustic mismatch between enrolment and test phases. Past international evaluation campaigns, such as the NIST spea…
Deep neural network based posteriors for text-dependent speaker verification Open
The i-vector and Joint Factor Analysis (JFA) systems for text- dependent speaker verification use sufficient statistics computed from a speech utterance to estimate speaker models. These statis- tics average the acoustic information over t…
System fusion and speaker linking for longitudinal diarization of TV shows Open
Performing speaker diarization while uniquely identifying the speakers in a collection of audio recordings is a challenging task. Based on our previous work on speaker diarization and linking, we developed a system for diarizing longitudin…
IDIAP SUBMISSION TO THE NIST SRE 2016 SPEAKER RECOGNITION EVALUATION Open
LIDIAP
Implementation of the Standard I-vector System for the Kaldi Speech Recognition Toolkit Open
LIDIAP