arXiv (Cornell University)
Text-to-Events: Synthetic Event Camera Streams from Conditional Text Input
June 2024 • J. C. Ott, Zuowen Wang, Shih‐Chii Liu
Event cameras are advantageous for tasks that require vision sensors with low-latency and sparse output responses. However, the development of deep network algorithms using event cameras has been slow because of the lack of large labelled event camera datasets for network training. This paper reports a method for creating new labelled event datasets by using a text-to-X model, where X is one or multiple output modalities, in the case of this work, events. Our proposed text-to-events model produces synthetic event …