UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation

Exploring foci of: arXiv (Cornell University) UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation December 2024 • Lunhao Duan, Shanshan Zhao, Wenjun Yan, Yinglun Li, Qing-Guo Chen, Xu Zhao, Weihua Luo, Kaifu Zhang, Mingming Gong, Gui-Song Xia Recently, text-to-image generation models have achieved remarkable advancements, particularly with diffusion models facilitating high-quality image synthesis from textual descriptions. However, these models often struggle with achieving precise control over pixel-level layouts, object appearances, and global styles when using text prompts alone. To mitigate this issue, previous works introduce conditional images as auxiliary inputs for image generation, enhancing control but typically necessitating specialized mod… Open Article Page

Transformer Computer Science Rayon Artificial Intelligence Electrical Engineering Engineering Computer Hardware Voltage Materials Science Open Article