Salam Khalifa
YOU?
Author Swipe
View article: Computational Morphology and Lexicography Modeling of Modern Standard Arabic Nominals
Computational Morphology and Lexicography Modeling of Modern Standard Arabic Nominals Open
Modern Standard Arabic (MSA) nominals present many morphological and lexical modeling challenges that have not been consistently addressed previously. This paper attempts to define the space of such challenges, and leverage a recently prop…
View article: Exploring Linguistic Probes for Morphological Generalization
Exploring Linguistic Probes for Morphological Generalization Open
Modern work on the cross-linguistic computational modeling of morphological inflection has typically employed language-independent data splitting algorithms. In this paper, we supplement that approach with language-specific probes designed…
View article: Morphological Inflection: A Reality Check
Morphological Inflection: A Reality Check Open
Morphological inflection is a popular task in sub-word NLP with both practical and cognitive applications. For years now, state-of-the-art systems have reported high, but also highly variable, performance across data sets and languages. We…
View article: Deep Active Learning for Morphophonological Processing
Deep Active Learning for Morphophonological Processing Open
Seyed Morteza Mirbostani, Yasaman Boreshban, Salam Khalifa, SeyedAbolghasem Mirroshandel, Owen Rambow. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 2023.
View article: SIGMORPHON–UniMorph 2023 Shared Task 0: Typologically Diverse Morphological Inflection
SIGMORPHON–UniMorph 2023 Shared Task 0: Typologically Diverse Morphological Inflection Open
Omer Goldman, Khuyagbaatar Batsuren, Salam Khalifa, Aryaman Arora, Garrett Nicolai, Reut Tsarfaty, Ekaterina Vylomova. Proceedings of the 20th SIGMORPHON workshop on Computational Research in Phonetics, Phonology, and Morphology. 2023.
View article: Morphological Inflection: A Reality Check
Morphological Inflection: A Reality Check Open
Morphological inflection is a popular task in sub-word NLP with both practical and cognitive applications. For years now, state-of-the-art systems have reported high, but also highly variable, performance across data sets and languages. We…
View article: A Cautious Generalization Goes a Long Way: Learning Morphophonological Rules
A Cautious Generalization Goes a Long Way: Learning Morphophonological Rules Open
Explicit linguistic knowledge, encoded by resources such as rule-based morphological analyzers, continues to prove useful in downstream NLP tasks, especially for low-resource languages and dialects. Rules are an important asset in descript…
View article: Exploring Linguistic Probes for Morphological Inflection
Exploring Linguistic Probes for Morphological Inflection Open
Modern work on the cross-linguistic computational modeling of morphological inflection has typically employed language-independent data splitting algorithms. In this paper, we supplement that approach with language-specific probes designed…
View article: UniMorph 4.0: Universal Morphology
UniMorph 4.0: Universal Morphology Open
The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a lang…
View article: Morphotactic Modeling in an Open-source Multi-dialectal Arabic Morphological Analyzer and Generator
Morphotactic Modeling in an Open-source Multi-dialectal Arabic Morphological Analyzer and Generator Open
Arabic is a morphologically rich and complex language, with numerous dialectal variants. Previous efforts on Arabic morphology modeling focused on specific variants and specific domains using a range of techniques with different degrees of…
View article: Towards Learning Arabic Morphophonology
Towards Learning Arabic Morphophonology Open
One core challenge facing morphological inflection systems is capturing language-specific morphophonological changes. This is particularly true of languages like Arabic which are morphologically complex. In this paper, we learn explicit mo…
View article: Morphosyntactic Tagging with Pre-trained Language Models for Arabic and its Dialects
Morphosyntactic Tagging with Pre-trained Language Models for Arabic and its Dialects Open
We present state-of-the-art results on morphosyntactic tagging across different varieties of Arabic using fine-tuned pre-trained transformer language models. Our models consistently outperform existing systems in Modern Standard Arabic and…
View article: SIGMORPHON–UniMorph 2022 Shared Task 0: Generalization and Typologically Diverse Morphological Inflection
SIGMORPHON–UniMorph 2022 Shared Task 0: Generalization and Typologically Diverse Morphological Inflection Open
The 2022 SIGMORPHON–UniMorph shared task on large scale morphological inflection generation included a wide range of typologically diverse languages: 33 languages from 11 top-level language families: Arabic (Modern Standard), Assamese, Bra…
View article: SIGMORPHON–UniMorph 2022 Shared Task 0: Modeling Inflection in Language Acquisition
SIGMORPHON–UniMorph 2022 Shared Task 0: Modeling Inflection in Language Acquisition Open
This year’s iteration of the SIGMORPHONUniMorph shared task on “human-like” morphological inflection generation focuses on generalization and errors in language acquisition. Systems are trained on data sets extracted from corpora of child-…
View article: Morphosyntactic Tagging with Pre-trained Language Models for Arabic and its Dialects
Morphosyntactic Tagging with Pre-trained Language Models for Arabic and its Dialects Open
We present state-of-the-art results on morphosyntactic tagging across different varieties of Arabic using fine-tuned pre-trained transformer language models. Our models consistently outperform existing systems in Modern Standard Arabic and…
View article: SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages
SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages Open
This year's iteration of the SIGMORPHON Shared Task on morphological reinflection focuses on typological diversity and cross-lingual variation of morphosyntactic features. In terms of the task, we enrich UniMorph with new data for 32 langu…
View article: A Little Linguistics Goes a Long Way: Unsupervised Segmentation with Limited Language Specific Guidance
A Little Linguistics Goes a Long Way: Unsupervised Segmentation with Limited Language Specific Guidance Open
We present de-lexical segmentation, a linguistically motivated alternative to greedy or other unsupervised methods, requiring only minimal language specific input. Our technique involves creating a small grammar of closed-class affixes whi…
View article: MADARi: A Web Interface for Joint Arabic Morphological Annotation and Spelling Correction
MADARi: A Web Interface for Joint Arabic Morphological Annotation and Spelling Correction Open
In this paper, we introduce MADARi, a joint morphological annotation and spelling correction system for texts in Standard and Dialectal Arabic. The MADARi framework provides intuitive interfaces for annotating text and managing the annotat…
View article: MADARi: A Web Interface for Joint Arabic Morphological Annotation and\n Spelling Correction
MADARi: A Web Interface for Joint Arabic Morphological Annotation and\n Spelling Correction Open
In this paper, we introduce MADARi, a joint morphological annotation and\nspelling correction system for texts in Standard and Dialectal Arabic. The\nMADARi framework provides intuitive interfaces for annotating text and managing\nthe anno…
View article: An Arabic Morphological Analyzer and Generator with Copious Features
An Arabic Morphological Analyzer and Generator with Copious Features Open
We introduce CALIMA-Star, a very rich Arabic morphological analyzer and generator that provides functional and form-based morphological features as well as built-in tokenization, phonological representation, lexical rationality and much mo…
View article: A Morphological Analyzer for Gulf Arabic Verbs
A Morphological Analyzer for Gulf Arabic Verbs Open
We present CALIMAGLF, a Gulf Arabic morphological analyzer currently covering over 2,600 verbal lemmas. We describe in detail the process of building the analyzer starting from phonetic dictionary entries to fully inflected orthographic pa…
View article: A Large Scale Corpus of Gulf Arabic
A Large Scale Corpus of Gulf Arabic Open
Most Arabic natural language processing tools and resources are developed to serve Modern Standard Arabic (MSA), which is the official written language in the Arab World. Some Dialectal Arabic varieties, notably Egyptian Arabic, have recei…
View article: Improving Arabic Diacritization through Syntactic Analysis
Improving Arabic Diacritization through Syntactic Analysis Open
We present an approach to Arabic automatic diacritization that integrates syntactic analysis with morphological tagging through improving the prediction of case and state features.Our best system increases the accuracy of word diacritizati…