Explanipedia

Unsupervised Cross-Domain Speech-to-Speech Conversion with\n Time-Frequency Consistency Open

Asif M. Khan, Fabien Cardinaux, Stefan Uhlich, Marc Ferràs, Asja Fischer · 2020

Computer science

In recent years generative adversarial network (GAN) based models have been\nsuccessfully applied for unsupervised speech-to-speech conversion.The rich\ncompact harmonic view of the magnitude spectrogram is considered a suitable\nchoice fo…

Content Normalization for Text-Dependent Speaker Verification Open

Subhadeep Dey, Srikanth Madikeri, Petr Motlíček, Marc Ferràs · 2017

Computer science Sociology

Subspace based techniques, such as i-vector and Joint Factor Analysis (JFA) have shown to provide state-of-the-art performance for fixed phrase based text-dependent speaker verification.However, the error rates of such systems on the rando…

Intra-class covariance adaptation in PLDA back-ends for speaker verification Open

Srikanth Madikeri, Marc Ferràs, Petr Motlíček, Subhadeep Dey · 2017

Computer science Mathematics Geography

Multi-session training conditions are becoming increasingly common in recent benchmark datasets for both text-independent and text-dependent speaker verification. In the state-of-the-art i-vector framework for speaker verification, such co…

Exploiting sequence information for text-dependent Speaker Verification Open

Subhadeep Dey, Petr Motlíček, Srikanth Madikeri, Marc Ferràs · 2017

Computer science Mathematics Biology

Model-based approaches to Speaker Verification (SV), such as Joint Factor Analysis (JFA), i-vector and relevance Maximum-a-Posteriori (MAP), have shown to provide state-of-the-art performance for text-dependent systems with fixed phrases. …

Speaker Diarization and Linking of Meeting Data Open

Marc Ferràs, Srikanth Madikeri, Hervé Bourlard · 2016

Computer science Physics Economics

Finding who spoke when in a collection of recordings, with speakers being uniquely identified across the database, is a challenging task. In this scenario, reasonable computing times and acoustic variation across recordings remain two majo…

A Large-Scale Open-Source Acoustic Simulator for Speaker Recognition Open

Marc Ferràs, Srikanth Madikeri, Petr Motlíček, Subhadeep Dey, Hervé Bourlard · 2016

Computer science

The state-of-the-art speaker-recognition systems suffer from significant performance loss on degraded speech conditions and acoustic mismatch between enrolment and test phases. Past international evaluation campaigns, such as the NIST spea…

Deep neural network based posteriors for text-dependent speaker verification Open

Subhadeep Dey, Srikanth Madikeri, Marc Ferràs, Petr Motlíček · 2016

Computer science Mathematics Physics

The i-vector and Joint Factor Analysis (JFA) systems for text- dependent speaker verification use sufficient statistics computed from a speech utterance to estimate speaker models. These statis- tics average the acoustic information over t…

System fusion and speaker linking for longitudinal diarization of TV shows Open

Marc Ferràs, Srikanth Madikeri, Petr Motlíček, Hervé Bourlard · 2016

Computer science Philosophy

Performing speaker diarization while uniquely identifying the speakers in a collection of audio recordings is a challenging task. Based on our previous work on speaker diarization and linking, we developed a system for diarizing longitudin…

IDIAP SUBMISSION TO THE NIST SRE 2016 SPEAKER RECOGNITION EVALUATION Open

Srikanth Madikeri, Subhadeep Dey, Marc Ferràs, Petr Motlíček, Ivan Himawan · 2016

Computer science

LIDIAP

Implementation of the Standard I-vector System for the Kaldi Speech Recognition Toolkit Open

Srikanth Madikeri, Subhadeep Dey, Petr Motlíček, Marc Ferràs · 2016

Computer science

LIDIAP

Marc Ferràs YOU? Author Swipe