Explanipedia

Diffusion Models Are Real-Time Game Engines Open

Dani Valevski, Yaniv Leviathan, Moab Arar, Shlomi Fruchter · 2024

Computer science Physics

We present GameNGen, the first game engine powered entirely by a neural model that also enables real-time interaction with a complex environment over long trajectories at high quality. When trained on the classic game DOOM, GameNGen extrac…

The Chosen One: Consistent Characters in Text-to-Image Diffusion Models Open

Omri Avrahami, Amir Hertz, Yael Vinker, Moab Arar, Shlomi Fruchter , et al. · 2024

Computer science Physics

Recent advances in text-to-image generation models have unlocked vast potential for visual creativity. However, the users that use these models struggle with the generation of consistent characters, a crucial aspect for numerous real-world…

PALP: Prompt Aligned Personalization of Text-to-Image Models Open

Moab Arar, Andrey Voynov, Amir Hertz, Omri Avrahami, Shlomi Fruchter , et al. · 2024

Computer science Sociology Physics

Content creators often aim to create personalized images using personal subjects that go beyond the capabilities of conventional text-to-image models. Additionally, they may want the resulting image to encompass a specific location, style,…

Curved Diffusion: A Generative Model With Optical Geometry Control Open

Andrey Voynov, Amir Hertz, Moab Arar, Shlomi Fruchter, Daniel Cohen‐Or · 2023

Computer science Mathematics Physics

State-of-the-art diffusion models can generate highly realistic images based on various conditioning like text, segmentation, and depth. However, an essential aspect often overlooked is the specific camera geometry used during image captur…

The Chosen One: Consistent Characters in Text-to-Image Diffusion Models Open

Omri Avrahami, Amir Hertz, Yael Vinker, Moab Arar, Shlomi Fruchter , et al. · 2023

Computer science Mathematics Physics

Recent advances in text-to-image generation models have unlocked vast potential for visual creativity. However, the users that use these models struggle with the generation of consistent characters, a crucial aspect for numerous real-world…

Domain-Agnostic Tuning-Encoder for Fast Personalization of Text-To-Image Models Open

Moab Arar, Rinon Gal, Yuval Atzmon, Gal Chechik, Daniel Cohen‐Or , et al. · 2023

Computer science Mathematics Political science

Text-to-image (T2I) personalization allows users to guide the creative image generation process by combining their own visual concepts in natural language prompts. Recently, encoder-based techniques have emerged as a new effective approach…

Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models Open

Rinon Gal, Moab Arar, Yuval Atzmon, Amit H. Bermano, Gal Chechik , et al. · 2023

Computer science Philosophy Mathematics

Text-to-image personalization aims to teach a pre-trained diffusion model to reason about novel, user provided concepts, embedding them into new scenes guided by natural language prompts. However, current personalization approaches struggl…

Single Motion Diffusion Open

Sigal Raab, Inbal Leibovitch, Guy Tevet, Moab Arar, Amit H. Bermano , et al. · 2023

Computer science

Synthesizing realistic animations of humans, animals, and even imaginary creatures, has long been a goal for artists and computer graphics professionals. Compared to the imaging domain, which is rich with large available datasets, the numb…

Learned Queries for Efficient Local Attention Open

Moab Arar, Ariel Shamir, Amit H. Bermano · 2021

Computer science Art Physics

Vision Transformers (ViT) serve as powerful vision models. Unlike convolutional neural networks, which dominated vision research in previous years, vision transformers enjoy the ability to capture long-range dependencies in the data. Nonet…

InAugment: Improving Classifiers via Internal Augmentation Open

Moab Arar, Ariel Shamir, Amit H. Bermano · 2021

Computer science Mathematics Chemistry

Image augmentation techniques apply transformation functions such as rotation, shearing, or color distortion on an input image. These augmentations were proven useful in improving neural networks' generalization ability. In this paper, we …

Image resizing by reconstruction from deep features Open

Dov Danon, Moab Arar, Daniel Cohen‐Or, Ariel Shamir · 2021

Computer science Philosophy Business

Traditional image resizing methods usually work in pixel space and use various saliency measures. The challenge is to adjust the image shape while trying to preserve important content. In this paper we perform image resizing in feature spa…

Focus-and-Expand: Training Guidance Through Gradual Manipulation of Input Features Open

Moab Arar, Noa Fish, Dani Daniel, Evgeny Tenetov, Ariel Shamir , et al. · 2020

Computer science Mathematics Physics

We present a simple and intuitive Focus-and-eXpand (\fax) method to guide the training process of a neural network towards a specific solution. Optimizing a neural network is a highly non-convex problem. Typically, the space of solutions i…

Unsupervised Multi-Modal Image Registration via Geometry Preserving Image-to-Image Translation Open

Moab Arar, Yiftach Ginger, Dov Danon, I. Leizerson, Amit H. Bermano , et al. · 2020

Computer science Chemistry Sociology

Many applications, such as autonomous driving, heavily rely on multi-modal data where spatial alignment between the modalities is required. Most multi-modal registration methods struggle computing the spatial correspondence between the ima…

Unsupervised Multi-Modal Image Registration via Geometry Preserving\n Image-to-Image Translation Open

Moab Arar, Yiftach Ginger, Dov Danon, I. Leizerson, Amit H. Bermano , et al. · 2020

Computer science Chemistry Sociology

Many applications, such as autonomous driving, heavily rely on multi-modal\ndata where spatial alignment between the modalities is required. Most\nmulti-modal registration methods struggle computing the spatial correspondence\nbetween the …

Image Resizing by Reconstruction from Deep Features Open

Moab Arar, Dov Danon, Daniel Cohen‐Or, Ariel Shamir · 2019

Computer science Business Philosophy

Traditional image resizing methods usually work in pixel space and use various saliency measures. The challenge is to adjust the image shape while trying to preserve important content. In this paper we perform image resizing in feature spa…

Moab Arar YOU? Author Swipe