arXiv (Cornell University)
Pose-Aware Self-Supervised Learning with Viewpoint Trajectory Regularization
March 2024 • Jiayun Wang, Stella X. Yu, Yubei Chen
Learning visual features from unlabeled images has proven successful for semantic categorization, often by mapping different $views$ of the same object to the same feature to achieve recognition invariance. However, visual recognition involves not only identifying $what$ an object is but also understanding $how$ it is presented. For example, seeing a car from the side versus head-on is crucial for deciding whether to stay put or jump out of the way. While unsupervised feature learning for downstream viewpoint reas…