Exploring foci of:
arXiv (Cornell University)
UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
December 2024 • Lunhao Duan, Shanshan Zhao, Wenjun Yan, Yinglun Li, Qing-Guo Chen, Xu Zhao, Weihua Luo, Kaifu Zhang, Mingming Gong, Gui-Song Xia
Recently, text-to-image generation models have achieved remarkable advancements, particularly with diffusion models facilitating high-quality image synthesis from textual descriptions. However, these models often struggle with achieving precise control over pixel-level layouts, object appearances, and global styles when using text prompts alone. To mitigate this issue, previous works introduce conditional images as auxiliary inputs for image generation, enhancing control but typically necessitating specialized mod…
Transformer
Computer Science
Rayon
Artificial Intelligence
Electrical Engineering
Engineering
Computer Hardware
Voltage
Materials Science