LocLLM: Exploiting Generalizable Human Keypoint Localization via Large Language Model

Exploring foci of: arXiv (Cornell University) LocLLM: Exploiting Generalizable Human Keypoint Localization via Large Language Model June 2024 • Dongkai Wang, Shiyu Xuan, Shiliang Zhang The capacity of existing human keypoint localization models is limited by keypoint priors provided by the training data. To alleviate this restriction and pursue more general model, this work studies keypoint localization from a different perspective by reasoning locations based on keypiont clues in text descriptions. We propose LocLLM, the first Large-Language Model (LLM) based keypoint localization model that takes images and text instructions as inputs and outputs the desired keypoint coordinates. LocLLM levera… Open Article Page

Computer Science Artificial Intelligence Open Article