Palm: A Culturally Inclusive and Linguistically Diverse Dataset for Arabic LLMs Article Swipe
YOU?
·
· 2025
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2503.00151
As large language models (LLMs) become increasingly integrated into daily life, ensuring their cultural sensitivity and inclusivity is paramount. We introduce our dataset, a year-long community-driven project covering all 22 Arab countries. The dataset includes instructions (input, response pairs) in both Modern Standard Arabic (MSA) and dialectal Arabic (DA), spanning 20 diverse topics. Built by a team of 44 researchers across the Arab world, all of whom are authors of this paper, our dataset offers a broad, inclusive perspective. We use our dataset to evaluate the cultural and dialectal capabilities of several frontier LLMs, revealing notable limitations. For instance, while closed-source LLMs generally exhibit strong performance, they are not without flaws, and smaller open-source models face greater challenges. Moreover, certain countries (e.g., Egypt, the UAE) appear better represented than others (e.g., Iraq, Mauritania, Yemen). Our annotation guidelines, code, and data for reproducibility are publicly available.
Related Topics
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/2503.00151
- https://arxiv.org/pdf/2503.00151
- OA Status
- green
- OpenAlex ID
- https://openalex.org/W4415081309
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4415081309Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.2503.00151Digital Object Identifier
- Title
-
Palm: A Culturally Inclusive and Linguistically Diverse Dataset for Arabic LLMsWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2025Year of publication
- Publication date
-
2025-02-28Full publication date if available
- Authors
-
Fakhraddin Alwajih, Abdellah El Mekki, Samar M. Magdy, AbdelRahim Elmadany, Omer Nacar, El Moatez Billah Nagoudi, Reem Abdel‐Salam, Hanin Atwany, Youssef Nafea, Azmi Yahya, Rahaf Alhamouri, Hamzah A. Alsayadi, Hiba Zayed, Sara Shatnawi, Serry Sibaee, Yasir Ech-chammakhy, Walid Al-Dhabyani, M. A. B. Md Ali, Imen Jarraya, Ahmed Oumar El-Shangiti, Aisha Alraeesi, Mohammed Anwar Al-Ghrawi, Abdulrahman S. Al-Batati, Elgizouli Mohamed, Noha Taha Elgindi, Muhammed Saeed, Houdaifa Atou, Issam Ait Yahia, Abdelhak Bouayad, Mohammed Machrouh, Amal Makouar, Dania Alkawi, Mukhtar Mohamed, Safaa Taher Abdelfadil, Amine Ziad Ounnoughene, Rouabhia Anfel, Rwaa Assi, Ahmed Sorkatti, Mohamedou Cheikh Tourad, Anis Koubâa, Ismaïl Berrada, Mustafa Jarrar, Shady Shehata, Muhammad Abdul-MageedList of authors in order
- Landing page
-
https://arxiv.org/abs/2503.00151Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/2503.00151Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/2503.00151Direct OA link when available
- Cited by
-
0Total citation count in OpenAlex
Full payload
| id | https://openalex.org/W4415081309 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.2503.00151 |
| ids.doi | https://doi.org/10.48550/arxiv.2503.00151 |
| ids.openalex | https://openalex.org/W4415081309 |
| fwci | |
| type | preprint |
| title | Palm: A Culturally Inclusive and Linguistically Diverse Dataset for Arabic LLMs |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T10181 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.9980999827384949 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1702 |
| topics[0].subfield.display_name | Artificial Intelligence |
| topics[0].display_name | Natural Language Processing Techniques |
| topics[1].id | https://openalex.org/T13629 |
| topics[1].field.id | https://openalex.org/fields/17 |
| topics[1].field.display_name | Computer Science |
| topics[1].score | 0.9466000199317932 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1702 |
| topics[1].subfield.display_name | Artificial Intelligence |
| topics[1].display_name | Text Readability and Simplification |
| topics[2].id | https://openalex.org/T12353 |
| topics[2].field.id | https://openalex.org/fields/12 |
| topics[2].field.display_name | Arts and Humanities |
| topics[2].score | 0.9222999811172485 |
| topics[2].domain.id | https://openalex.org/domains/2 |
| topics[2].domain.display_name | Social Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/1203 |
| topics[2].subfield.display_name | Language and Linguistics |
| topics[2].display_name | Lexicography and Language Studies |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| language | en |
| locations[0].id | pmh:oai:arXiv.org:2503.00151 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | |
| locations[0].pdf_url | https://arxiv.org/pdf/2503.00151 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | text |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/2503.00151 |
| locations[1].id | doi:10.48550/arxiv.2503.00151 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400194 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | True |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | arXiv (Cornell University) |
| locations[1].source.host_organization | https://openalex.org/I205783295 |
| locations[1].source.host_organization_name | Cornell University |
| locations[1].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | |
| locations[1].raw_type | article |
| locations[1].license_id | |
| locations[1].is_accepted | False |
| locations[1].is_published | |
| locations[1].raw_source_name | |
| locations[1].landing_page_url | https://doi.org/10.48550/arxiv.2503.00151 |
| indexed_in | arxiv, datacite |
| authorships[0].author.id | https://openalex.org/A5054154345 |
| authorships[0].author.orcid | https://orcid.org/0000-0002-8530-4243 |
| authorships[0].author.display_name | Fakhraddin Alwajih |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Alwajih, Fakhraddin |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5024996988 |
| authorships[1].author.orcid | https://orcid.org/0000-0002-7394-3611 |
| authorships[1].author.display_name | Abdellah El Mekki |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Mekki, Abdellah El |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5089179548 |
| authorships[2].author.orcid | |
| authorships[2].author.display_name | Samar M. Magdy |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Magdy, Samar Mohamed |
| authorships[2].is_corresponding | False |
| authorships[3].author.id | https://openalex.org/A5025365353 |
| authorships[3].author.orcid | |
| authorships[3].author.display_name | AbdelRahim Elmadany |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Elmadany, Abdelrahim A. |
| authorships[3].is_corresponding | False |
| authorships[4].author.id | https://openalex.org/A5093933812 |
| authorships[4].author.orcid | https://orcid.org/0000-0001-7493-9318 |
| authorships[4].author.display_name | Omer Nacar |
| authorships[4].author_position | middle |
| authorships[4].raw_author_name | Nacar, Omer |
| authorships[4].is_corresponding | False |
| authorships[5].author.id | https://openalex.org/A5041553790 |
| authorships[5].author.orcid | |
| authorships[5].author.display_name | El Moatez Billah Nagoudi |
| authorships[5].author_position | middle |
| authorships[5].raw_author_name | Nagoudi, El Moatez Billah |
| authorships[5].is_corresponding | False |
| authorships[6].author.id | https://openalex.org/A5083129152 |
| authorships[6].author.orcid | https://orcid.org/0000-0002-4594-211X |
| authorships[6].author.display_name | Reem Abdel‐Salam |
| authorships[6].author_position | middle |
| authorships[6].raw_author_name | Abdel-Salam, Reem |
| authorships[6].is_corresponding | False |
| authorships[7].author.id | https://openalex.org/A5107607217 |
| authorships[7].author.orcid | |
| authorships[7].author.display_name | Hanin Atwany |
| authorships[7].author_position | middle |
| authorships[7].raw_author_name | Atwany, Hanin |
| authorships[7].is_corresponding | False |
| authorships[8].author.id | https://openalex.org/A5094122220 |
| authorships[8].author.orcid | |
| authorships[8].author.display_name | Youssef Nafea |
| authorships[8].author_position | middle |
| authorships[8].raw_author_name | Nafea, Youssef |
| authorships[8].is_corresponding | False |
| authorships[9].author.id | https://openalex.org/A5007002553 |
| authorships[9].author.orcid | https://orcid.org/0000-0003-2495-6974 |
| authorships[9].author.display_name | Azmi Yahya |
| authorships[9].author_position | middle |
| authorships[9].raw_author_name | Yahya, Abdulfattah Mohammed |
| authorships[9].is_corresponding | False |
| authorships[10].author.id | https://openalex.org/A5062548482 |
| authorships[10].author.orcid | |
| authorships[10].author.display_name | Rahaf Alhamouri |
| authorships[10].author_position | middle |
| authorships[10].raw_author_name | Alhamouri, Rahaf |
| authorships[10].is_corresponding | False |
| authorships[11].author.id | https://openalex.org/A5045319295 |
| authorships[11].author.orcid | https://orcid.org/0000-0002-6062-0899 |
| authorships[11].author.display_name | Hamzah A. Alsayadi |
| authorships[11].author_position | middle |
| authorships[11].raw_author_name | Alsayadi, Hamzah A. |
| authorships[11].is_corresponding | False |
| authorships[12].author.id | https://openalex.org/A5111361940 |
| authorships[12].author.orcid | |
| authorships[12].author.display_name | Hiba Zayed |
| authorships[12].author_position | middle |
| authorships[12].raw_author_name | Zayed, Hiba |
| authorships[12].is_corresponding | False |
| authorships[13].author.id | https://openalex.org/A5046591555 |
| authorships[13].author.orcid | |
| authorships[13].author.display_name | Sara Shatnawi |
| authorships[13].author_position | middle |
| authorships[13].raw_author_name | Shatnawi, Sara |
| authorships[13].is_corresponding | False |
| authorships[14].author.id | https://openalex.org/A5093081405 |
| authorships[14].author.orcid | https://orcid.org/0009-0009-5649-4111 |
| authorships[14].author.display_name | Serry Sibaee |
| authorships[14].author_position | middle |
| authorships[14].raw_author_name | Sibaee, Serry |
| authorships[14].is_corresponding | False |
| authorships[15].author.id | https://openalex.org/A5109022186 |
| authorships[15].author.orcid | |
| authorships[15].author.display_name | Yasir Ech-chammakhy |
| authorships[15].author_position | middle |
| authorships[15].raw_author_name | Ech-Chammakhy, Yasir |
| authorships[15].is_corresponding | False |
| authorships[16].author.id | https://openalex.org/A5036322379 |
| authorships[16].author.orcid | https://orcid.org/0000-0001-8934-2178 |
| authorships[16].author.display_name | Walid Al-Dhabyani |
| authorships[16].author_position | middle |
| authorships[16].raw_author_name | Al-Dhabyani, Walid |
| authorships[16].is_corresponding | False |
| authorships[17].author.id | https://openalex.org/A5084491596 |
| authorships[17].author.orcid | https://orcid.org/0000-0001-7203-0610 |
| authorships[17].author.display_name | M. A. B. Md Ali |
| authorships[17].author_position | middle |
| authorships[17].raw_author_name | Ali, Marwa Mohamed |
| authorships[17].is_corresponding | False |
| authorships[18].author.id | https://openalex.org/A5054692006 |
| authorships[18].author.orcid | https://orcid.org/0000-0003-1746-3143 |
| authorships[18].author.display_name | Imen Jarraya |
| authorships[18].author_position | middle |
| authorships[18].raw_author_name | Jarraya, Imen |
| authorships[18].is_corresponding | False |
| authorships[19].author.id | https://openalex.org/A5091963092 |
| authorships[19].author.orcid | |
| authorships[19].author.display_name | Ahmed Oumar El-Shangiti |
| authorships[19].author_position | middle |
| authorships[19].raw_author_name | El-Shangiti, Ahmed Oumar |
| authorships[19].is_corresponding | False |
| authorships[20].author.id | https://openalex.org/A5093976033 |
| authorships[20].author.orcid | |
| authorships[20].author.display_name | Aisha Alraeesi |
| authorships[20].author_position | middle |
| authorships[20].raw_author_name | Alraeesi, Aisha |
| authorships[20].is_corresponding | False |
| authorships[21].author.id | https://openalex.org/A5119181828 |
| authorships[21].author.orcid | |
| authorships[21].author.display_name | Mohammed Anwar Al-Ghrawi |
| authorships[21].author_position | middle |
| authorships[21].raw_author_name | Al-Ghrawi, Mohammed Anwar |
| authorships[21].is_corresponding | False |
| authorships[22].author.id | https://openalex.org/A5114270655 |
| authorships[22].author.orcid | |
| authorships[22].author.display_name | Abdulrahman S. Al-Batati |
| authorships[22].author_position | middle |
| authorships[22].raw_author_name | Al-Batati, Abdulrahman S. |
| authorships[22].is_corresponding | False |
| authorships[23].author.id | https://openalex.org/A5112340879 |
| authorships[23].author.orcid | |
| authorships[23].author.display_name | Elgizouli Mohamed |
| authorships[23].author_position | middle |
| authorships[23].raw_author_name | Mohamed, Elgizouli |
| authorships[23].is_corresponding | False |
| authorships[24].author.id | https://openalex.org/A5119181829 |
| authorships[24].author.orcid | |
| authorships[24].author.display_name | Noha Taha Elgindi |
| authorships[24].author_position | middle |
| authorships[24].raw_author_name | Elgindi, Noha Taha |
| authorships[24].is_corresponding | False |
| authorships[25].author.id | https://openalex.org/A5111378972 |
| authorships[25].author.orcid | |
| authorships[25].author.display_name | Muhammed Saeed |
| authorships[25].author_position | middle |
| authorships[25].raw_author_name | Saeed, Muhammed |
| authorships[25].is_corresponding | False |
| authorships[26].author.id | https://openalex.org/A5107642704 |
| authorships[26].author.orcid | |
| authorships[26].author.display_name | Houdaifa Atou |
| authorships[26].author_position | middle |
| authorships[26].raw_author_name | Atou, Houdaifa |
| authorships[26].is_corresponding | False |
| authorships[27].author.id | https://openalex.org/A5107746264 |
| authorships[27].author.orcid | |
| authorships[27].author.display_name | Issam Ait Yahia |
| authorships[27].author_position | middle |
| authorships[27].raw_author_name | Yahia, Issam Ait |
| authorships[27].is_corresponding | False |
| authorships[28].author.id | https://openalex.org/A5113051799 |
| authorships[28].author.orcid | |
| authorships[28].author.display_name | Abdelhak Bouayad |
| authorships[28].author_position | middle |
| authorships[28].raw_author_name | Bouayad, Abdelhak |
| authorships[28].is_corresponding | False |
| authorships[29].author.id | https://openalex.org/A5119181830 |
| authorships[29].author.orcid | |
| authorships[29].author.display_name | Mohammed Machrouh |
| authorships[29].author_position | middle |
| authorships[29].raw_author_name | Machrouh, Mohammed |
| authorships[29].is_corresponding | False |
| authorships[30].author.id | https://openalex.org/A5109022187 |
| authorships[30].author.orcid | |
| authorships[30].author.display_name | Amal Makouar |
| authorships[30].author_position | middle |
| authorships[30].raw_author_name | Makouar, Amal |
| authorships[30].is_corresponding | False |
| authorships[31].author.id | https://openalex.org/A5119181831 |
| authorships[31].author.orcid | |
| authorships[31].author.display_name | Dania Alkawi |
| authorships[31].author_position | middle |
| authorships[31].raw_author_name | Alkawi, Dania |
| authorships[31].is_corresponding | False |
| authorships[32].author.id | https://openalex.org/A5104329997 |
| authorships[32].author.orcid | |
| authorships[32].author.display_name | Mukhtar Mohamed |
| authorships[32].author_position | middle |
| authorships[32].raw_author_name | Mohamed, Mukhtar |
| authorships[32].is_corresponding | False |
| authorships[33].author.id | https://openalex.org/A5117865113 |
| authorships[33].author.orcid | |
| authorships[33].author.display_name | Safaa Taher Abdelfadil |
| authorships[33].author_position | middle |
| authorships[33].raw_author_name | Abdelfadil, Safaa Taher |
| authorships[33].is_corresponding | False |
| authorships[34].author.id | https://openalex.org/A5042385961 |
| authorships[34].author.orcid | |
| authorships[34].author.display_name | Amine Ziad Ounnoughene |
| authorships[34].author_position | middle |
| authorships[34].raw_author_name | Ounnoughene, Amine Ziad |
| authorships[34].is_corresponding | False |
| authorships[35].author.id | https://openalex.org/A5119957002 |
| authorships[35].author.orcid | |
| authorships[35].author.display_name | Rouabhia Anfel |
| authorships[35].author_position | middle |
| authorships[35].raw_author_name | Anfel, Rouabhia |
| authorships[35].is_corresponding | False |
| authorships[36].author.id | https://openalex.org/A5034915270 |
| authorships[36].author.orcid | |
| authorships[36].author.display_name | Rwaa Assi |
| authorships[36].author_position | middle |
| authorships[36].raw_author_name | Assi, Rwaa |
| authorships[36].is_corresponding | False |
| authorships[37].author.id | https://openalex.org/A5119181833 |
| authorships[37].author.orcid | |
| authorships[37].author.display_name | Ahmed Sorkatti |
| authorships[37].author_position | middle |
| authorships[37].raw_author_name | Sorkatti, Ahmed |
| authorships[37].is_corresponding | False |
| authorships[38].author.id | https://openalex.org/A5035607022 |
| authorships[38].author.orcid | https://orcid.org/0000-0003-1273-7512 |
| authorships[38].author.display_name | Mohamedou Cheikh Tourad |
| authorships[38].author_position | middle |
| authorships[38].raw_author_name | Tourad, Mohamedou Cheikh |
| authorships[38].is_corresponding | False |
| authorships[39].author.id | https://openalex.org/A5024626195 |
| authorships[39].author.orcid | https://orcid.org/0000-0003-3787-7423 |
| authorships[39].author.display_name | Anis Koubâa |
| authorships[39].author_position | middle |
| authorships[39].raw_author_name | Koubaa, Anis |
| authorships[39].is_corresponding | False |
| authorships[40].author.id | https://openalex.org/A5091770877 |
| authorships[40].author.orcid | https://orcid.org/0000-0003-4225-911X |
| authorships[40].author.display_name | Ismaïl Berrada |
| authorships[40].author_position | middle |
| authorships[40].raw_author_name | Berrada, Ismail |
| authorships[40].is_corresponding | False |
| authorships[41].author.id | https://openalex.org/A5050418400 |
| authorships[41].author.orcid | https://orcid.org/0000-0003-4351-4207 |
| authorships[41].author.display_name | Mustafa Jarrar |
| authorships[41].author_position | middle |
| authorships[41].raw_author_name | Jarrar, Mustafa |
| authorships[41].is_corresponding | False |
| authorships[42].author.id | https://openalex.org/A5075022926 |
| authorships[42].author.orcid | https://orcid.org/0000-0002-3258-6734 |
| authorships[42].author.display_name | Shady Shehata |
| authorships[42].author_position | middle |
| authorships[42].raw_author_name | Shehata, Shady |
| authorships[42].is_corresponding | False |
| authorships[43].author.id | https://openalex.org/A5004629670 |
| authorships[43].author.orcid | https://orcid.org/0000-0002-8590-2040 |
| authorships[43].author.display_name | Muhammad Abdul-Mageed |
| authorships[43].author_position | last |
| authorships[43].raw_author_name | Abdul-Mageed, Muhammad |
| authorships[43].is_corresponding | False |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/2503.00151 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-12T00:00:00 |
| display_name | Palm: A Culturally Inclusive and Linguistically Diverse Dataset for Arabic LLMs |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T06:51:31.235846 |
| primary_topic.id | https://openalex.org/T10181 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.9980999827384949 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1702 |
| primary_topic.subfield.display_name | Artificial Intelligence |
| primary_topic.display_name | Natural Language Processing Techniques |
| cited_by_count | 0 |
| locations_count | 2 |
| best_oa_location.id | pmh:oai:arXiv.org:2503.00151 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://arxiv.org/pdf/2503.00151 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | text |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/2503.00151 |
| primary_location.id | pmh:oai:arXiv.org:2503.00151 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | |
| primary_location.pdf_url | https://arxiv.org/pdf/2503.00151 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | text |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/2503.00151 |
| publication_date | 2025-02-28 |
| publication_year | 2025 |
| referenced_works_count | 0 |
| abstract_inverted_index.a | 23, 55, 75 |
| abstract_inverted_index.20 | 50 |
| abstract_inverted_index.22 | 29 |
| abstract_inverted_index.44 | 58 |
| abstract_inverted_index.As | 0 |
| abstract_inverted_index.We | 19, 79 |
| abstract_inverted_index.by | 54 |
| abstract_inverted_index.in | 39 |
| abstract_inverted_index.is | 17 |
| abstract_inverted_index.of | 57, 65, 69, 90 |
| abstract_inverted_index.to | 83 |
| abstract_inverted_index.For | 97 |
| abstract_inverted_index.Our | 134 |
| abstract_inverted_index.The | 32 |
| abstract_inverted_index.all | 28, 64 |
| abstract_inverted_index.and | 15, 45, 87, 111, 138 |
| abstract_inverted_index.are | 67, 107, 142 |
| abstract_inverted_index.for | 140 |
| abstract_inverted_index.not | 108 |
| abstract_inverted_index.our | 21, 72, 81 |
| abstract_inverted_index.the | 61, 85, 123 |
| abstract_inverted_index.use | 80 |
| abstract_inverted_index.Arab | 30, 62 |
| abstract_inverted_index.LLMs | 101 |
| abstract_inverted_index.UAE) | 124 |
| abstract_inverted_index.both | 40 |
| abstract_inverted_index.data | 139 |
| abstract_inverted_index.face | 115 |
| abstract_inverted_index.into | 8 |
| abstract_inverted_index.team | 56 |
| abstract_inverted_index.than | 128 |
| abstract_inverted_index.they | 106 |
| abstract_inverted_index.this | 70 |
| abstract_inverted_index.whom | 66 |
| abstract_inverted_index.(DA), | 48 |
| abstract_inverted_index.(MSA) | 44 |
| abstract_inverted_index.Built | 53 |
| abstract_inverted_index.Iraq, | 131 |
| abstract_inverted_index.LLMs, | 93 |
| abstract_inverted_index.code, | 137 |
| abstract_inverted_index.daily | 9 |
| abstract_inverted_index.large | 1 |
| abstract_inverted_index.life, | 10 |
| abstract_inverted_index.their | 12 |
| abstract_inverted_index.while | 99 |
| abstract_inverted_index.(LLMs) | 4 |
| abstract_inverted_index.(e.g., | 121, 130 |
| abstract_inverted_index.Arabic | 43, 47 |
| abstract_inverted_index.Egypt, | 122 |
| abstract_inverted_index.Modern | 41 |
| abstract_inverted_index.across | 60 |
| abstract_inverted_index.appear | 125 |
| abstract_inverted_index.become | 5 |
| abstract_inverted_index.better | 126 |
| abstract_inverted_index.broad, | 76 |
| abstract_inverted_index.flaws, | 110 |
| abstract_inverted_index.models | 3, 114 |
| abstract_inverted_index.offers | 74 |
| abstract_inverted_index.others | 129 |
| abstract_inverted_index.pairs) | 38 |
| abstract_inverted_index.paper, | 71 |
| abstract_inverted_index.strong | 104 |
| abstract_inverted_index.world, | 63 |
| abstract_inverted_index.(input, | 36 |
| abstract_inverted_index.Yemen). | 133 |
| abstract_inverted_index.authors | 68 |
| abstract_inverted_index.certain | 119 |
| abstract_inverted_index.dataset | 33, 73, 82 |
| abstract_inverted_index.diverse | 51 |
| abstract_inverted_index.exhibit | 103 |
| abstract_inverted_index.greater | 116 |
| abstract_inverted_index.notable | 95 |
| abstract_inverted_index.project | 26 |
| abstract_inverted_index.several | 91 |
| abstract_inverted_index.smaller | 112 |
| abstract_inverted_index.topics. | 52 |
| abstract_inverted_index.without | 109 |
| abstract_inverted_index.Standard | 42 |
| abstract_inverted_index.covering | 27 |
| abstract_inverted_index.cultural | 13, 86 |
| abstract_inverted_index.dataset, | 22 |
| abstract_inverted_index.ensuring | 11 |
| abstract_inverted_index.evaluate | 84 |
| abstract_inverted_index.frontier | 92 |
| abstract_inverted_index.includes | 34 |
| abstract_inverted_index.language | 2 |
| abstract_inverted_index.publicly | 143 |
| abstract_inverted_index.response | 37 |
| abstract_inverted_index.spanning | 49 |
| abstract_inverted_index.Moreover, | 118 |
| abstract_inverted_index.countries | 120 |
| abstract_inverted_index.dialectal | 46, 88 |
| abstract_inverted_index.generally | 102 |
| abstract_inverted_index.inclusive | 77 |
| abstract_inverted_index.instance, | 98 |
| abstract_inverted_index.introduce | 20 |
| abstract_inverted_index.revealing | 94 |
| abstract_inverted_index.year-long | 24 |
| abstract_inverted_index.annotation | 135 |
| abstract_inverted_index.available. | 144 |
| abstract_inverted_index.countries. | 31 |
| abstract_inverted_index.integrated | 7 |
| abstract_inverted_index.paramount. | 18 |
| abstract_inverted_index.Mauritania, | 132 |
| abstract_inverted_index.challenges. | 117 |
| abstract_inverted_index.guidelines, | 136 |
| abstract_inverted_index.inclusivity | 16 |
| abstract_inverted_index.open-source | 113 |
| abstract_inverted_index.represented | 127 |
| abstract_inverted_index.researchers | 59 |
| abstract_inverted_index.sensitivity | 14 |
| abstract_inverted_index.capabilities | 89 |
| abstract_inverted_index.increasingly | 6 |
| abstract_inverted_index.instructions | 35 |
| abstract_inverted_index.limitations. | 96 |
| abstract_inverted_index.performance, | 105 |
| abstract_inverted_index.perspective. | 78 |
| abstract_inverted_index.closed-source | 100 |
| abstract_inverted_index.reproducibility | 141 |
| abstract_inverted_index.community-driven | 25 |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 44 |
| citation_normalized_percentile |