Roy Hirsch
YOU?
Author Swipe
View article: On the Semantic Latent Space of Diffusion-Based Text-to-Speech Models
On the Semantic Latent Space of Diffusion-Based Text-to-Speech Models Open
The incorporation of Denoising Diffusion Models (DDMs) in the Text-to-Speech (TTS) domain is rising, providing great value in synthesizing high quality speech. Although they exhibit impressive audio quality, the extent of their semantic ca…
View article: Weakly-Supervised Surgical Phase Recognition
Weakly-Supervised Surgical Phase Recognition Open
A key element of computer-assisted surgery systems is phase recognition of surgical videos. Existing phase recognition algorithms require frame-wise annotation of a large number of videos, which is time and money consuming. In this work we…
View article: Efficient Discovery and Effective Evaluation of Visual Perceptual Similarity: A Benchmark and Beyond
Efficient Discovery and Effective Evaluation of Visual Perceptual Similarity: A Benchmark and Beyond Open
Visual similarities discovery (VSD) is an important task with broad e-commerce applications. Given an image of a certain object, the goal of VSD is to retrieve images of different objects with high perceptual visual similarity. Although be…
View article: Self-Supervised Learning for Endoscopic Video Analysis
Self-Supervised Learning for Endoscopic Video Analysis Open
Self-supervised learning (SSL) has led to important breakthroughs in computer vision by allowing learning from large amounts of unlabeled data. As such, it might have a pivotal role to play in biomedicine where annotating data requires a h…
View article: Cold Item Integration in Deep Hybrid Recommenders via Tunable Stochastic Gates
Cold Item Integration in Deep Hybrid Recommenders via Tunable Stochastic Gates Open
A major challenge in collaborative filtering methods is how to produce recommendations for cold items (items with no ratings), or integrate cold item into an existing catalog. Over the years, a variety of hybrid recommendation models have …