Text-to-Events: Synthetic Event Camera Streams from Conditional Text Input

Exploring foci of: arXiv (Cornell University) Text-to-Events: Synthetic Event Camera Streams from Conditional Text Input June 2024 • J. C. Ott, Zuowen Wang, Shih‐Chii Liu Event cameras are advantageous for tasks that require vision sensors with low-latency and sparse output responses. However, the development of deep network algorithms using event cameras has been slow because of the lack of large labelled event camera datasets for network training. This paper reports a method for creating new labelled event datasets by using a text-to-X model, where X is one or multiple output modalities, in the case of this work, events. Our proposed text-to-events model produces synthetic event … Open Article Page

Computer Science Artificial Intelligence Physics Quantum Mechanics Open Article