Filippo Botti
YOU?
Author Swipe
SISMA: Semantic Face Image Synthesis with Mamba Open
Diffusion Models have become very popular for Semantic Image Synthesis (SIS) of human faces. Nevertheless, their training and inference is computationally expensive and their computational requirements are high due to the quadratic complex…
U-Shape Mamba: State Space Model for Faster Diffusion Open
Diffusion models have become the most popular approach for high-quality image generation, but their high computational cost still remains a significant challenge. To address this problem, we propose U-Shape Mamba (USM), a novel diffusion m…
$^R$FLAV: Rolling Flow matching for infinite Audio Video generation Open
Joint audio-video (AV) generation is still a significant challenge in generative AI, primarily due to three critical requirements: quality of the generated samples, seamless multimodal synchronization and temporal coherence, with audio tra…
Mamba-ST: State Space Model for Efficient Style Transfer Open
The goal of style transfer is, given a content image and a style source, generating a new image preserving the content but with the artistic representation of the style source. Most of the state-of-the-art architectures use transformers or…
Masked Style Transfer for Source-Coherent Image-to-Image Translation Open
The goal of image-to-image translation (I2I) is to translate images from one domain to another while maintaining the content representations. A popular method for I2I translation involves the use of a reference image to guide the transform…