Hend S. Al‐Khalifa
YOU?
Author Swipe
Integrating Linguistic and Eye Movements Features for Arabic Text Readability Assessment Using ML and DL Models Open
Evaluating text readability is crucial for supporting both language learners and native readers in selecting appropriate materials. Cognitive psychology research, leveraging behavioral data such as eye-tracking and electroencephalogram (EE…
The Prompting Brain: Neurocognitive Markers of Expertise in Guiding Large Language Models Open
Prompt engineering has rapidly emerged as a critical skill for effective interaction with large language models (LLMs). However, the cognitive and neural underpinnings of this expertise remain largely unexplored. This paper presents findin…
Enhancing Propaganda Detection in Arabic News Context Through Multi-Task Learning Open
Social media has become a platform for the rapid spread of persuasion techniques that can negatively affect individuals and society. Propaganda detection, a crucial task in natural language processing, aims to identify manipulative content…
Eye Movement Patterns as Indicators of Text Complexity in Arabic: A Comparative Analysis of Classical and Modern Standard Arabic Open
This study investigates eye movement patterns as indicators of text complexity in Arabic, focusing on the comparative analysis of Classical Arabic (CA) and Modern Standard Arabic (MSA) text. Using the AraEyebility corpus, which contains ey…
Constructing and evaluating ArabicStanceX: a social media dataset for Arabic stance detection Open
Arabic stance detection has attracted significant interest due to the growing importance of social media in shaping public opinion. However, the lack of comprehensive datasets has limited research progress in Arabic Natural Language Proces…
English-Arabic Hybrid Semantic Text Chunking Based on Fine-Tuning BERT Open
Semantic text chunking refers to segmenting text into coherently semantic chunks, i.e., into sets of statements that are semantically related. Semantic chunking is an essential pre-processing step in various NLP tasks e.g., document summar…
View article: The Landscape of Arabic Large Language Models (ALLMs): A New Era for Arabic Language Technology
The Landscape of Arabic Large Language Models (ALLMs): A New Era for Arabic Language Technology Open
The emergence of ChatGPT marked a transformative milestone for Artificial Intelligence (AI), showcasing the remarkable potential of Large Language Models (LLMs) to generate human-like text. This wave of innovation has revolutionized how we…
AraEyebility: Eye-Tracking Data for Arabic Text Readability Open
Assessing text readability is important for helping language learners and readers select texts that match their proficiency levels. Research in cognitive psychology, which uses behavioral data such as eye-tracking and electroencephalogram …
MultiProSE: A Multi-label Arabic Dataset for Propaganda, Sentiment, and Emotion Detection Open
Propaganda is a form of persuasion that has been used throughout history with the intention goal of influencing people's opinions through rhetorical and psychological persuasion techniques for determined ends. Although Arabic ranked as the…
Arabic Temporal Common Sense Understanding Open
Natural language understanding (NLU) includes temporal text understanding, which can be complex and encompasses temporal common sense understanding. There are many challenges in comprehending common sense within a text. Currently, there is…
GLARE: Google Apps Arabic Reviews Dataset Open
This paper introduces GLARE an Arabic Apps Reviews dataset collected from Saudi Google PlayStore. It consists of 76M reviews, 69M of which are Arabic reviews of 9,980 Android Applications. We present the data collection methodology, along …
Arabic Temporal Common Sense Understanding Open
Natural Language Understanding (NLU) includes temporal text understanding, which can be complex and encompasses temporal common sense understanding. There are many challenges in comprehending common sense within a text. Currently, there ar…
A Survey of Large Language Models for Arabic Language and its Dialects Open
This survey offers a comprehensive overview of Large Language Models (LLMs) designed for Arabic language and its dialects. It covers key architectures, including encoder-only, decoder-only, and encoder-decoder models, along with the datase…
Arabic paraphrased parallel synthetic dataset Open
The Arabic paraphrased parallel dataset plays a crucial role in advancing NLP and other language-related applications by leveraging data from diverse sources and expanding it through data augmentation techniques. This dataset enhances mach…
A Benchmark Evaluation of Multilingual Large Language Models for Arabic Cross-Lingual Named-Entity Recognition Open
Multilingual large language models (MLLMs) have demonstrated remarkable performance across a wide range of cross-lingual Natural Language Processing (NLP) tasks. The emergence of MLLMs made it possible to achieve knowledge transfer from hi…
CLEANANERCorp: Identifying and Correcting Incorrect Labels in the ANERcorp Dataset Open
Label errors are a common issue in machine learning datasets, particularly for tasks such as Named Entity Recognition. Such label errors might hurt model training, affect evaluation results, and lead to an inaccurate assessment of model pe…
The Qiyas Benchmark: Measuring ChatGPT Mathematical and Language Understanding in Arabic Open
Despite the growing importance of Arabic as a global language, there is a notable lack of language models pre-trained exclusively on Arabic data. This shortage has led to limited benchmarks available for assessing language model performanc…
Natural Language Processing Patents Landscape Analysis Open
Understanding NLP patents provides valuable insights into innovation trends and competitive dynamics in artificial intelligence. This study uses the Lens patent database to investigate the landscape of NLP patents. The overall patent outpu…
View article: Accessible Metaverse: A Theoretical Framework for Accessibility and Inclusion in the Metaverse
Accessible Metaverse: A Theoretical Framework for Accessibility and Inclusion in the Metaverse Open
The following article investigates the Metaverse and its potential to bolster digital accessibility for persons with disabilities. Through qualitative analysis, we examine responses from eleven experts in digital accessibility, Metaverse d…
Error Analysis of Pretrained Language Models (PLMs) in English-to-Arabic Machine Translation Open
Advances in neural machine translation utilizing pretrained language models (PLMs) have shown promise in improving the translation quality between diverse languages. However, translation from English to languages with complex morphology, s…
Arabic Paraphrase Generation Using Transformer-Based Approaches Open
Paraphrasing, a ubiquitous linguistic practice involving the rephrasing of sentences while preserving their underlying meaning, holds substantial significance across various Natural Language Processing (NLP) applications. This research foc…
Quantifying Gender Bias in Arabic Pre-Trained Language Models Open
The current renaissance in the development of Arabic Pre-trained Language models (APLMs) has yielded significant advancement across many fields. Nevertheless, no study has explored the dimensions of gender bias in these models. It is argue…
ChatGPT across Arabic Twitter: A Study of Topics, Sentiments, and Sarcasm Open
While ChatGPT has gained global significance and widespread adoption, its exploration within specific cultural contexts, particularly within the Arab world, remains relatively limited. This study investigates the discussions among early Ar…
A Data-Driven Exploration of a New Islamic Fatwas Dataset for Arabic NLP Tasks Open
Islamic content is a broad and diverse domain that encompasses various sources, topics, and perspectives. However, there is a lack of comprehensive and reliable datasets that can facilitate conducting studies on Islamic content. In this pa…
Handwritten Arabic Character Recognition for Children Writing Using Convolutional Neural Network and Stroke Identification Open
Automatic Arabic handwritten recognition is one of the recently studied problems in the field of Machine Learning. Unlike Latin languages, Arabic is a Semitic language that forms a harder challenge, especially with the variability of patte…
Towards Designing a ChatGPT Conversational Companion for Elderly People Open
Loneliness and social isolation are serious and widespread problems among older people, affecting their physical and mental health, quality of life, and longevity. In this paper, we propose a ChatGPT-based conversational companion system f…
The Saudi Privacy Policy Dataset Open
This paper introduces the Saudi Privacy Policy Dataset, a diverse compilation of Arabic privacy policies from various sectors in Saudi Arabia, annotated according to the 10 principles of the Personal Data Protection Law (PDPL); the PDPL wa…
Fine-Tuning BERT-Based Pre-Trained Models for Arabic Dependency Parsing Open
With the advent of pre-trained language models, many natural language processing tasks in various languages have achieved great success. Although some research has been conducted on fine-tuning BERT-based models for syntactic parsing, and …
Intelligent Framework for Detecting Predatory Publishing Venues Open
Predatory publishing venues publish questionable articles and pose a global threat to the integrity and quality of the scientific literature. They have given rise to the dark side of scholarly publishing and their effects have reached poli…