Provenance Analysis of Archaeological Artifacts via Multimodal RAG Systems Article Swipe
YOU?
·
· 2025
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2509.20769
In this work, we present a retrieval-augmented generation (RAG)-based system for provenance analysis of archaeological artifacts, designed to support expert reasoning by integrating multimodal retrieval and large vision-language models (VLMs). The system constructs a dual-modal knowledge base from reference texts and images, enabling raw visual, edge-enhanced, and semantic retrieval to identify stylistically similar objects. Retrieved candidates are synthesized by the VLM to generate structured inferences, including chronological, geographical, and cultural attributions, alongside interpretive justifications. We evaluate the system on a set of Eastern Eurasian Bronze Age artifacts from the British Museum. Expert evaluation demonstrates that the system produces meaningful and interpretable outputs, offering scholars concrete starting points for analysis and significantly alleviating the cognitive burden of navigating vast comparative corpora.
Related Topics
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/2509.20769
- https://arxiv.org/pdf/2509.20769
- OA Status
- green
- OpenAlex ID
- https://openalex.org/W4414789187
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4414789187Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.2509.20769Digital Object Identifier
- Title
-
Provenance Analysis of Archaeological Artifacts via Multimodal RAG SystemsWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2025Year of publication
- Publication date
-
2025-09-25Full publication date if available
- Authors
-
T Zhang, Yilong Sun, Ruiliang LiuList of authors in order
- Landing page
-
https://arxiv.org/abs/2509.20769Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/2509.20769Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/2509.20769Direct OA link when available
- Cited by
-
0Total citation count in OpenAlex
Full payload
| id | https://openalex.org/W4414789187 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.2509.20769 |
| ids.doi | https://doi.org/10.48550/arxiv.2509.20769 |
| ids.openalex | https://openalex.org/W4414789187 |
| fwci | |
| type | preprint |
| title | Provenance Analysis of Archaeological Artifacts via Multimodal RAG Systems |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T14339 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.992900013923645 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1707 |
| topics[0].subfield.display_name | Computer Vision and Pattern Recognition |
| topics[0].display_name | Image Processing and 3D Reconstruction |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| language | en |
| locations[0].id | pmh:oai:arXiv.org:2509.20769 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | |
| locations[0].pdf_url | https://arxiv.org/pdf/2509.20769 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | text |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/2509.20769 |
| locations[1].id | doi:10.48550/arxiv.2509.20769 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400194 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | True |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | arXiv (Cornell University) |
| locations[1].source.host_organization | https://openalex.org/I205783295 |
| locations[1].source.host_organization_name | Cornell University |
| locations[1].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[1].license | cc-by |
| locations[1].pdf_url | |
| locations[1].version | |
| locations[1].raw_type | article |
| locations[1].license_id | https://openalex.org/licenses/cc-by |
| locations[1].is_accepted | False |
| locations[1].is_published | |
| locations[1].raw_source_name | |
| locations[1].landing_page_url | https://doi.org/10.48550/arxiv.2509.20769 |
| indexed_in | arxiv, datacite |
| authorships[0].author.id | https://openalex.org/A5090441210 |
| authorships[0].author.orcid | |
| authorships[0].author.display_name | T Zhang |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Zhang, Tuo |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5108590109 |
| authorships[1].author.orcid | |
| authorships[1].author.display_name | Yilong Sun |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Sun, Yuechun |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5043259112 |
| authorships[2].author.orcid | https://orcid.org/0000-0001-9628-827X |
| authorships[2].author.display_name | Ruiliang Liu |
| authorships[2].author_position | last |
| authorships[2].raw_author_name | Liu, Ruiliang |
| authorships[2].is_corresponding | False |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/2509.20769 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Provenance Analysis of Archaeological Artifacts via Multimodal RAG Systems |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T06:51:31.235846 |
| primary_topic.id | https://openalex.org/T14339 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.992900013923645 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1707 |
| primary_topic.subfield.display_name | Computer Vision and Pattern Recognition |
| primary_topic.display_name | Image Processing and 3D Reconstruction |
| cited_by_count | 0 |
| locations_count | 2 |
| best_oa_location.id | pmh:oai:arXiv.org:2509.20769 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://arxiv.org/pdf/2509.20769 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | text |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/2509.20769 |
| primary_location.id | pmh:oai:arXiv.org:2509.20769 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | |
| primary_location.pdf_url | https://arxiv.org/pdf/2509.20769 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | text |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/2509.20769 |
| publication_date | 2025-09-25 |
| publication_year | 2025 |
| referenced_works_count | 0 |
| abstract_inverted_index.a | 5, 33, 79 |
| abstract_inverted_index.In | 0 |
| abstract_inverted_index.We | 74 |
| abstract_inverted_index.by | 21, 58 |
| abstract_inverted_index.of | 13, 81, 115 |
| abstract_inverted_index.on | 78 |
| abstract_inverted_index.to | 17, 49, 61 |
| abstract_inverted_index.we | 3 |
| abstract_inverted_index.Age | 85 |
| abstract_inverted_index.The | 30 |
| abstract_inverted_index.VLM | 60 |
| abstract_inverted_index.and | 25, 40, 46, 68, 99, 109 |
| abstract_inverted_index.are | 56 |
| abstract_inverted_index.for | 10, 107 |
| abstract_inverted_index.raw | 43 |
| abstract_inverted_index.set | 80 |
| abstract_inverted_index.the | 59, 76, 88, 95, 112 |
| abstract_inverted_index.base | 36 |
| abstract_inverted_index.from | 37, 87 |
| abstract_inverted_index.that | 94 |
| abstract_inverted_index.this | 1 |
| abstract_inverted_index.vast | 117 |
| abstract_inverted_index.large | 26 |
| abstract_inverted_index.texts | 39 |
| abstract_inverted_index.work, | 2 |
| abstract_inverted_index.Bronze | 84 |
| abstract_inverted_index.Expert | 91 |
| abstract_inverted_index.burden | 114 |
| abstract_inverted_index.expert | 19 |
| abstract_inverted_index.models | 28 |
| abstract_inverted_index.points | 106 |
| abstract_inverted_index.system | 9, 31, 77, 96 |
| abstract_inverted_index.(VLMs). | 29 |
| abstract_inverted_index.British | 89 |
| abstract_inverted_index.Eastern | 82 |
| abstract_inverted_index.Museum. | 90 |
| abstract_inverted_index.images, | 41 |
| abstract_inverted_index.present | 4 |
| abstract_inverted_index.similar | 52 |
| abstract_inverted_index.support | 18 |
| abstract_inverted_index.visual, | 44 |
| abstract_inverted_index.Eurasian | 83 |
| abstract_inverted_index.analysis | 12, 108 |
| abstract_inverted_index.concrete | 104 |
| abstract_inverted_index.corpora. | 119 |
| abstract_inverted_index.cultural | 69 |
| abstract_inverted_index.designed | 16 |
| abstract_inverted_index.enabling | 42 |
| abstract_inverted_index.evaluate | 75 |
| abstract_inverted_index.generate | 62 |
| abstract_inverted_index.identify | 50 |
| abstract_inverted_index.objects. | 53 |
| abstract_inverted_index.offering | 102 |
| abstract_inverted_index.outputs, | 101 |
| abstract_inverted_index.produces | 97 |
| abstract_inverted_index.scholars | 103 |
| abstract_inverted_index.semantic | 47 |
| abstract_inverted_index.starting | 105 |
| abstract_inverted_index.Retrieved | 54 |
| abstract_inverted_index.alongside | 71 |
| abstract_inverted_index.artifacts | 86 |
| abstract_inverted_index.cognitive | 113 |
| abstract_inverted_index.including | 65 |
| abstract_inverted_index.knowledge | 35 |
| abstract_inverted_index.reasoning | 20 |
| abstract_inverted_index.reference | 38 |
| abstract_inverted_index.retrieval | 24, 48 |
| abstract_inverted_index.artifacts, | 15 |
| abstract_inverted_index.candidates | 55 |
| abstract_inverted_index.constructs | 32 |
| abstract_inverted_index.dual-modal | 34 |
| abstract_inverted_index.evaluation | 92 |
| abstract_inverted_index.generation | 7 |
| abstract_inverted_index.meaningful | 98 |
| abstract_inverted_index.multimodal | 23 |
| abstract_inverted_index.navigating | 116 |
| abstract_inverted_index.provenance | 11 |
| abstract_inverted_index.structured | 63 |
| abstract_inverted_index.(RAG)-based | 8 |
| abstract_inverted_index.alleviating | 111 |
| abstract_inverted_index.comparative | 118 |
| abstract_inverted_index.inferences, | 64 |
| abstract_inverted_index.integrating | 22 |
| abstract_inverted_index.synthesized | 57 |
| abstract_inverted_index.demonstrates | 93 |
| abstract_inverted_index.interpretive | 72 |
| abstract_inverted_index.attributions, | 70 |
| abstract_inverted_index.geographical, | 67 |
| abstract_inverted_index.interpretable | 100 |
| abstract_inverted_index.significantly | 110 |
| abstract_inverted_index.stylistically | 51 |
| abstract_inverted_index.archaeological | 14 |
| abstract_inverted_index.chronological, | 66 |
| abstract_inverted_index.edge-enhanced, | 45 |
| abstract_inverted_index.justifications. | 73 |
| abstract_inverted_index.vision-language | 27 |
| abstract_inverted_index.retrieval-augmented | 6 |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 3 |
| citation_normalized_percentile |