Semantic Object-level Modeling for Robust Visual Camera Relocalization Article Swipe
YOU?
·
· 2024
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2402.06951
Visual relocalization is crucial for autonomous visual localization and navigation of mobile robotics. Due to the improvement of CNN-based object detection algorithm, the robustness of visual relocalization is greatly enhanced especially in viewpoints where classical methods fail. However, ellipsoids (quadrics) generated by axis-aligned object detection may limit the accuracy of the object-level representation and degenerate the performance of visual relocalization system. In this paper, we propose a novel method of automatic object-level voxel modeling for accurate ellipsoidal representations of objects. As for visual relocalization, we design a better pose optimization strategy for camera pose recovery, to fully utilize the projection characteristics of 2D fitted ellipses and the 3D accurate ellipsoids. All of these modules are entirely intergrated into visual SLAM system. Experimental results show that our semantic object-level mapping and object-based visual relocalization methods significantly enhance the performance of visual relocalization in terms of robustness to new viewpoints.
Related Topics
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/2402.06951
- https://arxiv.org/pdf/2402.06951
- OA Status
- green
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4391871678
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4391871678Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.2402.06951Digital Object Identifier
- Title
-
Semantic Object-level Modeling for Robust Visual Camera RelocalizationWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2024Year of publication
- Publication date
-
2024-02-10Full publication date if available
- Authors
-
Yifan Zhu, Lingjuan Miao, Haitao Wu, Zhiqiang Zhou, Weiyi Chen, Longwen WuList of authors in order
- Landing page
-
https://arxiv.org/abs/2402.06951Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/2402.06951Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/2402.06951Direct OA link when available
- Concepts
-
Computer vision, Computer science, Artificial intelligence, Object (grammar), Computer graphics (images)Top concepts (fields/topics) attached by OpenAlex
- Cited by
-
0Total citation count in OpenAlex
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4391871678 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.2402.06951 |
| ids.doi | https://doi.org/10.48550/arxiv.2402.06951 |
| ids.openalex | https://openalex.org/W4391871678 |
| fwci | |
| type | preprint |
| title | Semantic Object-level Modeling for Robust Visual Camera Relocalization |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T10627 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.9966999888420105 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1707 |
| topics[0].subfield.display_name | Computer Vision and Pattern Recognition |
| topics[0].display_name | Advanced Image and Video Retrieval Techniques |
| topics[1].id | https://openalex.org/T10191 |
| topics[1].field.id | https://openalex.org/fields/22 |
| topics[1].field.display_name | Engineering |
| topics[1].score | 0.9922999739646912 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/2202 |
| topics[1].subfield.display_name | Aerospace Engineering |
| topics[1].display_name | Robotics and Sensor-Based Localization |
| topics[2].id | https://openalex.org/T11714 |
| topics[2].field.id | https://openalex.org/fields/17 |
| topics[2].field.display_name | Computer Science |
| topics[2].score | 0.9879000186920166 |
| topics[2].domain.id | https://openalex.org/domains/3 |
| topics[2].domain.display_name | Physical Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/1707 |
| topics[2].subfield.display_name | Computer Vision and Pattern Recognition |
| topics[2].display_name | Multimodal Machine Learning Applications |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C31972630 |
| concepts[0].level | 1 |
| concepts[0].score | 0.6640622019767761 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q844240 |
| concepts[0].display_name | Computer vision |
| concepts[1].id | https://openalex.org/C41008148 |
| concepts[1].level | 0 |
| concepts[1].score | 0.6078320741653442 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[1].display_name | Computer science |
| concepts[2].id | https://openalex.org/C154945302 |
| concepts[2].level | 1 |
| concepts[2].score | 0.6016013026237488 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q11660 |
| concepts[2].display_name | Artificial intelligence |
| concepts[3].id | https://openalex.org/C2781238097 |
| concepts[3].level | 2 |
| concepts[3].score | 0.5871143341064453 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q175026 |
| concepts[3].display_name | Object (grammar) |
| concepts[4].id | https://openalex.org/C121684516 |
| concepts[4].level | 1 |
| concepts[4].score | 0.33962398767471313 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q7600677 |
| concepts[4].display_name | Computer graphics (images) |
| keywords[0].id | https://openalex.org/keywords/computer-vision |
| keywords[0].score | 0.6640622019767761 |
| keywords[0].display_name | Computer vision |
| keywords[1].id | https://openalex.org/keywords/computer-science |
| keywords[1].score | 0.6078320741653442 |
| keywords[1].display_name | Computer science |
| keywords[2].id | https://openalex.org/keywords/artificial-intelligence |
| keywords[2].score | 0.6016013026237488 |
| keywords[2].display_name | Artificial intelligence |
| keywords[3].id | https://openalex.org/keywords/object |
| keywords[3].score | 0.5871143341064453 |
| keywords[3].display_name | Object (grammar) |
| keywords[4].id | https://openalex.org/keywords/computer-graphics |
| keywords[4].score | 0.33962398767471313 |
| keywords[4].display_name | Computer graphics (images) |
| language | en |
| locations[0].id | pmh:oai:arXiv.org:2402.06951 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | |
| locations[0].pdf_url | https://arxiv.org/pdf/2402.06951 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | text |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/2402.06951 |
| locations[1].id | doi:10.48550/arxiv.2402.06951 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400194 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | True |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | arXiv (Cornell University) |
| locations[1].source.host_organization | https://openalex.org/I205783295 |
| locations[1].source.host_organization_name | Cornell University |
| locations[1].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | |
| locations[1].raw_type | article |
| locations[1].license_id | |
| locations[1].is_accepted | False |
| locations[1].is_published | |
| locations[1].raw_source_name | |
| locations[1].landing_page_url | https://doi.org/10.48550/arxiv.2402.06951 |
| indexed_in | arxiv, datacite |
| authorships[0].author.id | https://openalex.org/A5101632981 |
| authorships[0].author.orcid | https://orcid.org/0000-0001-8359-6492 |
| authorships[0].author.display_name | Yifan Zhu |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Zhu, Yifan |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5100348913 |
| authorships[1].author.orcid | https://orcid.org/0000-0003-1782-4535 |
| authorships[1].author.display_name | Lingjuan Miao |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Miao, Lingjuan |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5081635469 |
| authorships[2].author.orcid | https://orcid.org/0000-0002-4804-3806 |
| authorships[2].author.display_name | Haitao Wu |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Wu, Haitao |
| authorships[2].is_corresponding | False |
| authorships[3].author.id | https://openalex.org/A5029710781 |
| authorships[3].author.orcid | https://orcid.org/0000-0001-6871-8236 |
| authorships[3].author.display_name | Zhiqiang Zhou |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Zhou, Zhiqiang |
| authorships[3].is_corresponding | False |
| authorships[4].author.id | https://openalex.org/A5100731420 |
| authorships[4].author.orcid | https://orcid.org/0000-0003-1702-5653 |
| authorships[4].author.display_name | Weiyi Chen |
| authorships[4].author_position | middle |
| authorships[4].raw_author_name | Chen, Weiyi |
| authorships[4].is_corresponding | False |
| authorships[5].author.id | https://openalex.org/A5032451912 |
| authorships[5].author.orcid | https://orcid.org/0000-0002-9442-4781 |
| authorships[5].author.display_name | Longwen Wu |
| authorships[5].author_position | last |
| authorships[5].raw_author_name | Wu, Longwen |
| authorships[5].is_corresponding | False |
| has_content.pdf | True |
| has_content.grobid_xml | True |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/2402.06951 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Semantic Object-level Modeling for Robust Visual Camera Relocalization |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T06:51:31.235846 |
| primary_topic.id | https://openalex.org/T10627 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.9966999888420105 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1707 |
| primary_topic.subfield.display_name | Computer Vision and Pattern Recognition |
| primary_topic.display_name | Advanced Image and Video Retrieval Techniques |
| related_works | https://openalex.org/W2755342338, https://openalex.org/W2058170566, https://openalex.org/W2036807459, https://openalex.org/W2772917594, https://openalex.org/W2775347418, https://openalex.org/W1969923398, https://openalex.org/W2166024367, https://openalex.org/W3116076068, https://openalex.org/W2229312674, https://openalex.org/W2079911747 |
| cited_by_count | 0 |
| locations_count | 2 |
| best_oa_location.id | pmh:oai:arXiv.org:2402.06951 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://arxiv.org/pdf/2402.06951 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | text |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/2402.06951 |
| primary_location.id | pmh:oai:arXiv.org:2402.06951 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | |
| primary_location.pdf_url | https://arxiv.org/pdf/2402.06951 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | text |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/2402.06951 |
| publication_date | 2024-02-10 |
| publication_year | 2024 |
| referenced_works_count | 0 |
| abstract_inverted_index.a | 66, 86 |
| abstract_inverted_index.2D | 102 |
| abstract_inverted_index.3D | 107 |
| abstract_inverted_index.As | 80 |
| abstract_inverted_index.In | 61 |
| abstract_inverted_index.by | 41 |
| abstract_inverted_index.in | 31, 141 |
| abstract_inverted_index.is | 2, 27 |
| abstract_inverted_index.of | 10, 17, 24, 49, 57, 69, 78, 101, 111, 138, 143 |
| abstract_inverted_index.to | 14, 95, 145 |
| abstract_inverted_index.we | 64, 84 |
| abstract_inverted_index.All | 110 |
| abstract_inverted_index.Due | 13 |
| abstract_inverted_index.and | 8, 53, 105, 129 |
| abstract_inverted_index.are | 114 |
| abstract_inverted_index.for | 4, 74, 81, 91 |
| abstract_inverted_index.may | 45 |
| abstract_inverted_index.new | 146 |
| abstract_inverted_index.our | 125 |
| abstract_inverted_index.the | 15, 22, 47, 50, 55, 98, 106, 136 |
| abstract_inverted_index.SLAM | 119 |
| abstract_inverted_index.into | 117 |
| abstract_inverted_index.pose | 88, 93 |
| abstract_inverted_index.show | 123 |
| abstract_inverted_index.that | 124 |
| abstract_inverted_index.this | 62 |
| abstract_inverted_index.fail. | 36 |
| abstract_inverted_index.fully | 96 |
| abstract_inverted_index.limit | 46 |
| abstract_inverted_index.novel | 67 |
| abstract_inverted_index.terms | 142 |
| abstract_inverted_index.these | 112 |
| abstract_inverted_index.voxel | 72 |
| abstract_inverted_index.where | 33 |
| abstract_inverted_index.Visual | 0 |
| abstract_inverted_index.better | 87 |
| abstract_inverted_index.camera | 92 |
| abstract_inverted_index.design | 85 |
| abstract_inverted_index.fitted | 103 |
| abstract_inverted_index.method | 68 |
| abstract_inverted_index.mobile | 11 |
| abstract_inverted_index.object | 19, 43 |
| abstract_inverted_index.paper, | 63 |
| abstract_inverted_index.visual | 6, 25, 58, 82, 118, 131, 139 |
| abstract_inverted_index.crucial | 3 |
| abstract_inverted_index.enhance | 135 |
| abstract_inverted_index.greatly | 28 |
| abstract_inverted_index.mapping | 128 |
| abstract_inverted_index.methods | 35, 133 |
| abstract_inverted_index.modules | 113 |
| abstract_inverted_index.propose | 65 |
| abstract_inverted_index.results | 122 |
| abstract_inverted_index.system. | 60, 120 |
| abstract_inverted_index.utilize | 97 |
| abstract_inverted_index.However, | 37 |
| abstract_inverted_index.accuracy | 48 |
| abstract_inverted_index.accurate | 75, 108 |
| abstract_inverted_index.ellipses | 104 |
| abstract_inverted_index.enhanced | 29 |
| abstract_inverted_index.entirely | 115 |
| abstract_inverted_index.modeling | 73 |
| abstract_inverted_index.objects. | 79 |
| abstract_inverted_index.semantic | 126 |
| abstract_inverted_index.strategy | 90 |
| abstract_inverted_index.CNN-based | 18 |
| abstract_inverted_index.automatic | 70 |
| abstract_inverted_index.classical | 34 |
| abstract_inverted_index.detection | 20, 44 |
| abstract_inverted_index.generated | 40 |
| abstract_inverted_index.recovery, | 94 |
| abstract_inverted_index.robotics. | 12 |
| abstract_inverted_index.(quadrics) | 39 |
| abstract_inverted_index.algorithm, | 21 |
| abstract_inverted_index.autonomous | 5 |
| abstract_inverted_index.degenerate | 54 |
| abstract_inverted_index.ellipsoids | 38 |
| abstract_inverted_index.especially | 30 |
| abstract_inverted_index.navigation | 9 |
| abstract_inverted_index.projection | 99 |
| abstract_inverted_index.robustness | 23, 144 |
| abstract_inverted_index.viewpoints | 32 |
| abstract_inverted_index.ellipsoidal | 76 |
| abstract_inverted_index.ellipsoids. | 109 |
| abstract_inverted_index.improvement | 16 |
| abstract_inverted_index.intergrated | 116 |
| abstract_inverted_index.performance | 56, 137 |
| abstract_inverted_index.viewpoints. | 147 |
| abstract_inverted_index.Experimental | 121 |
| abstract_inverted_index.axis-aligned | 42 |
| abstract_inverted_index.localization | 7 |
| abstract_inverted_index.object-based | 130 |
| abstract_inverted_index.object-level | 51, 71, 127 |
| abstract_inverted_index.optimization | 89 |
| abstract_inverted_index.significantly | 134 |
| abstract_inverted_index.relocalization | 1, 26, 59, 132, 140 |
| abstract_inverted_index.representation | 52 |
| abstract_inverted_index.characteristics | 100 |
| abstract_inverted_index.relocalization, | 83 |
| abstract_inverted_index.representations | 77 |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 6 |
| citation_normalized_percentile |