Exploring foci of:
arXiv (Cornell University)
Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding
September 2023 • Mohamed Afham, Satya Narayan Shukla, Omid Poursaeed, Pengchuan Zhang, Ashish S. Shah, Ser-Nam Lim
While most modern video understanding models operate on short-range clips, real-world videos are often several minutes long with semantically consistent segments of variable length. A common approach to process long videos is applying a short-form video model over uniformly sampled clips of fixed temporal length and aggregating the outputs. This approach neglects the underlying nature of long videos since fixed-length clips are often redundant or uninformative. In this paper, we aim to provide a generic and adapti…
Computer Science
Segmentation Fault
Artificial Intelligence
List Of Solar Eclipses In The 21St Century
Process (Computing)
Computer Vision
Mathematics
Management
Economics
Combinatorics
Database