MotionShot: Adaptive Motion Transfer across Arbitrary Objects for Text-to-Video Generation Article Swipe
YOU?
·
· 2025
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2507.16310
Existing text-to-video methods struggle to transfer motion smoothly from a reference object to a target object with significant differences in appearance or structure between them. To address this challenge, we introduce MotionShot, a training-free framework capable of parsing reference-target correspondences in a fine-grained manner, thereby achieving high-fidelity motion transfer while preserving coherence in appearance. To be specific, MotionShot first performs semantic feature matching to ensure high-level alignments between the reference and target objects. It then further establishes low-level morphological alignments through reference-to-target shape retargeting. By encoding motion with temporal attention, our MotionShot can coherently transfer motion across objects, even in the presence of significant appearance and structure disparities, demonstrated by extensive experiments. The project page is available at: https://motionshot.github.io/.
Related Topics
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/2507.16310
- https://arxiv.org/pdf/2507.16310
- OA Status
- green
- OpenAlex ID
- https://openalex.org/W4414417554
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4414417554Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.2507.16310Digital Object Identifier
- Title
-
MotionShot: Adaptive Motion Transfer across Arbitrary Objects for Text-to-Video GenerationWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2025Year of publication
- Publication date
-
2025-07-22Full publication date if available
- Authors
-
Yanchen Liu, Yanan Sun, Zhening Xing, Junyao Gao, Kai Chen, Wenjie PeiList of authors in order
- Landing page
-
https://arxiv.org/abs/2507.16310Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/2507.16310Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/2507.16310Direct OA link when available
- Cited by
-
0Total citation count in OpenAlex
Full payload
| id | https://openalex.org/W4414417554 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.2507.16310 |
| ids.doi | https://doi.org/10.48550/arxiv.2507.16310 |
| ids.openalex | https://openalex.org/W4414417554 |
| fwci | |
| type | preprint |
| title | MotionShot: Adaptive Motion Transfer across Arbitrary Objects for Text-to-Video Generation |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T12290 |
| topics[0].field.id | https://openalex.org/fields/22 |
| topics[0].field.display_name | Engineering |
| topics[0].score | 0.9952999949455261 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/2207 |
| topics[0].subfield.display_name | Control and Systems Engineering |
| topics[0].display_name | Human Motion and Animation |
| topics[1].id | https://openalex.org/T12720 |
| topics[1].field.id | https://openalex.org/fields/33 |
| topics[1].field.display_name | Social Sciences |
| topics[1].score | 0.9697999954223633 |
| topics[1].domain.id | https://openalex.org/domains/2 |
| topics[1].domain.display_name | Social Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/3312 |
| topics[1].subfield.display_name | Sociology and Political Science |
| topics[1].display_name | Multimedia Communication and Technology |
| topics[2].id | https://openalex.org/T11439 |
| topics[2].field.id | https://openalex.org/fields/17 |
| topics[2].field.display_name | Computer Science |
| topics[2].score | 0.9693999886512756 |
| topics[2].domain.id | https://openalex.org/domains/3 |
| topics[2].domain.display_name | Physical Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/1707 |
| topics[2].subfield.display_name | Computer Vision and Pattern Recognition |
| topics[2].display_name | Video Analysis and Summarization |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| language | en |
| locations[0].id | pmh:oai:arXiv.org:2507.16310 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | |
| locations[0].pdf_url | https://arxiv.org/pdf/2507.16310 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | text |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/2507.16310 |
| locations[1].id | doi:10.48550/arxiv.2507.16310 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400194 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | True |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | arXiv (Cornell University) |
| locations[1].source.host_organization | https://openalex.org/I205783295 |
| locations[1].source.host_organization_name | Cornell University |
| locations[1].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | |
| locations[1].raw_type | article |
| locations[1].license_id | |
| locations[1].is_accepted | False |
| locations[1].is_published | |
| locations[1].raw_source_name | |
| locations[1].landing_page_url | https://doi.org/10.48550/arxiv.2507.16310 |
| indexed_in | arxiv, datacite |
| authorships[0].author.id | https://openalex.org/A5075257692 |
| authorships[0].author.orcid | https://orcid.org/0000-0003-3366-747X |
| authorships[0].author.display_name | Yanchen Liu |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Liu, Yanchen |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5091058342 |
| authorships[1].author.orcid | https://orcid.org/0000-0001-6374-1429 |
| authorships[1].author.display_name | Yanan Sun |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Sun, Yanan |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5009538017 |
| authorships[2].author.orcid | https://orcid.org/0000-0002-6092-0891 |
| authorships[2].author.display_name | Zhening Xing |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Xing, Zhening |
| authorships[2].is_corresponding | False |
| authorships[3].author.id | https://openalex.org/A5029286892 |
| authorships[3].author.orcid | https://orcid.org/0009-0007-2270-6820 |
| authorships[3].author.display_name | Junyao Gao |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Gao, Junyao |
| authorships[3].is_corresponding | False |
| authorships[4].author.id | https://openalex.org/A5048500768 |
| authorships[4].author.orcid | https://orcid.org/0000-0002-3930-8294 |
| authorships[4].author.display_name | Kai Chen |
| authorships[4].author_position | middle |
| authorships[4].raw_author_name | Chen, Kai |
| authorships[4].is_corresponding | False |
| authorships[5].author.id | https://openalex.org/A5078487642 |
| authorships[5].author.orcid | https://orcid.org/0000-0001-8117-2696 |
| authorships[5].author.display_name | Wenjie Pei |
| authorships[5].author_position | last |
| authorships[5].raw_author_name | Pei, Wenjie |
| authorships[5].is_corresponding | False |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/2507.16310 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | MotionShot: Adaptive Motion Transfer across Arbitrary Objects for Text-to-Video Generation |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T06:51:31.235846 |
| primary_topic.id | https://openalex.org/T12290 |
| primary_topic.field.id | https://openalex.org/fields/22 |
| primary_topic.field.display_name | Engineering |
| primary_topic.score | 0.9952999949455261 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/2207 |
| primary_topic.subfield.display_name | Control and Systems Engineering |
| primary_topic.display_name | Human Motion and Animation |
| cited_by_count | 0 |
| locations_count | 2 |
| best_oa_location.id | pmh:oai:arXiv.org:2507.16310 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://arxiv.org/pdf/2507.16310 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | text |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/2507.16310 |
| primary_location.id | pmh:oai:arXiv.org:2507.16310 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | |
| primary_location.pdf_url | https://arxiv.org/pdf/2507.16310 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | text |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/2507.16310 |
| publication_date | 2025-07-22 |
| publication_year | 2025 |
| referenced_works_count | 0 |
| abstract_inverted_index.a | 9, 13, 32, 41 |
| abstract_inverted_index.By | 84 |
| abstract_inverted_index.It | 73 |
| abstract_inverted_index.To | 25, 54 |
| abstract_inverted_index.be | 55 |
| abstract_inverted_index.by | 109 |
| abstract_inverted_index.in | 19, 40, 52, 99 |
| abstract_inverted_index.is | 115 |
| abstract_inverted_index.of | 36, 102 |
| abstract_inverted_index.or | 21 |
| abstract_inverted_index.to | 4, 12, 63 |
| abstract_inverted_index.we | 29 |
| abstract_inverted_index.The | 112 |
| abstract_inverted_index.and | 70, 105 |
| abstract_inverted_index.at: | 117 |
| abstract_inverted_index.can | 92 |
| abstract_inverted_index.our | 90 |
| abstract_inverted_index.the | 68, 100 |
| abstract_inverted_index.even | 98 |
| abstract_inverted_index.from | 8 |
| abstract_inverted_index.page | 114 |
| abstract_inverted_index.then | 74 |
| abstract_inverted_index.this | 27 |
| abstract_inverted_index.with | 16, 87 |
| abstract_inverted_index.first | 58 |
| abstract_inverted_index.shape | 82 |
| abstract_inverted_index.them. | 24 |
| abstract_inverted_index.while | 49 |
| abstract_inverted_index.across | 96 |
| abstract_inverted_index.ensure | 64 |
| abstract_inverted_index.motion | 6, 47, 86, 95 |
| abstract_inverted_index.object | 11, 15 |
| abstract_inverted_index.target | 14, 71 |
| abstract_inverted_index.address | 26 |
| abstract_inverted_index.between | 23, 67 |
| abstract_inverted_index.capable | 35 |
| abstract_inverted_index.feature | 61 |
| abstract_inverted_index.further | 75 |
| abstract_inverted_index.manner, | 43 |
| abstract_inverted_index.methods | 2 |
| abstract_inverted_index.parsing | 37 |
| abstract_inverted_index.project | 113 |
| abstract_inverted_index.thereby | 44 |
| abstract_inverted_index.through | 80 |
| abstract_inverted_index.Existing | 0 |
| abstract_inverted_index.encoding | 85 |
| abstract_inverted_index.matching | 62 |
| abstract_inverted_index.objects, | 97 |
| abstract_inverted_index.objects. | 72 |
| abstract_inverted_index.performs | 59 |
| abstract_inverted_index.presence | 101 |
| abstract_inverted_index.semantic | 60 |
| abstract_inverted_index.smoothly | 7 |
| abstract_inverted_index.struggle | 3 |
| abstract_inverted_index.temporal | 88 |
| abstract_inverted_index.transfer | 5, 48, 94 |
| abstract_inverted_index.achieving | 45 |
| abstract_inverted_index.available | 116 |
| abstract_inverted_index.coherence | 51 |
| abstract_inverted_index.extensive | 110 |
| abstract_inverted_index.framework | 34 |
| abstract_inverted_index.introduce | 30 |
| abstract_inverted_index.low-level | 77 |
| abstract_inverted_index.reference | 10, 69 |
| abstract_inverted_index.specific, | 56 |
| abstract_inverted_index.structure | 22, 106 |
| abstract_inverted_index.MotionShot | 57, 91 |
| abstract_inverted_index.alignments | 66, 79 |
| abstract_inverted_index.appearance | 20, 104 |
| abstract_inverted_index.attention, | 89 |
| abstract_inverted_index.challenge, | 28 |
| abstract_inverted_index.coherently | 93 |
| abstract_inverted_index.high-level | 65 |
| abstract_inverted_index.preserving | 50 |
| abstract_inverted_index.MotionShot, | 31 |
| abstract_inverted_index.appearance. | 53 |
| abstract_inverted_index.differences | 18 |
| abstract_inverted_index.establishes | 76 |
| abstract_inverted_index.significant | 17, 103 |
| abstract_inverted_index.demonstrated | 108 |
| abstract_inverted_index.disparities, | 107 |
| abstract_inverted_index.experiments. | 111 |
| abstract_inverted_index.fine-grained | 42 |
| abstract_inverted_index.retargeting. | 83 |
| abstract_inverted_index.high-fidelity | 46 |
| abstract_inverted_index.morphological | 78 |
| abstract_inverted_index.text-to-video | 1 |
| abstract_inverted_index.training-free | 33 |
| abstract_inverted_index.correspondences | 39 |
| abstract_inverted_index.reference-target | 38 |
| abstract_inverted_index.reference-to-target | 81 |
| abstract_inverted_index.https://motionshot.github.io/. | 118 |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 6 |
| citation_normalized_percentile |