Romanization ≈ RomanizationRomanization
View article
Comparison of Different Orthographies for Machine Translation of Under-Resourced Dravidian Languages Open
Under-resourced languages are a significant challenge for statistical approaches to machine translation, and recently it has been shown that the usage of training data from closely-related languages can improve machine translation quality …
View article
Analyzing Sentiments in eLearning: A Comparative Study of Bangla and Romanized Bangla Text Using Transformers Open
In the modern world, learning is becoming increasingly critical due to rapid technological breakthroughs, which highlight the need for continuous skill development in both the personal and professional spheres. As a result, eLearning is a …
View article
Brahmi-Net: A transliteration and script conversion system for languages of the Indian subcontinent Open
We present Brahmi-Net -an online system for transliteration and script conversion for all major Indian language pairs (306 pairs).The system covers 13 Indo-Aryan languages, 4 Dravidian languages and English.For training the transliteration…
View article
Sentiment Analysis on Bangla and Romanized Bangla Text (BRBT) using Deep Recurrent models Open
Sentiment Analysis (SA) is an action research area in the digital age. With rapid and constant growth of online social media sites and services, and the increasing amount of textual data such as - statuses, comments, reviews etc. available…
View article
Social Factors in the Latinization of the Roman West Open
Latinization is a strangely overlooked topic. Historians have noted it has been ‘taken for granted’ and viewed as an unremarkable by-product of ‘Romanization’, despite its central importance for understanding the Roman provincial world, it…
View article
Exploring the Multilingual Applications of ChatGPT Open
ChatGPT's ability to realistically mimic human conversation and its high level of ability to handle linguistic ambiguity opens new and exciting avenues in language learning. Building upon the technical affordances of ChatGPT, this study ex…
View article
Building CMU Sphinx language model for the Holy Quran using simplified Arabic phonemes Open
In Malaysia, breast cancer is the commonest cancer in all ethnic. Breast cancer in
\nMalaysian women occurs in the younger age group compared with Western countries and
\nthe fourth most common cause of death among all cancers in Malaysia.…
View article
Natural language processing and machine learning based cyberbullying detection for Bangla and Romanized Bangla texts Open
The popularity of social media has been increasing tremendously in recent times and thus cyberbullying towards people has also increased at an alarming rate. Many cyberbullying texts can be found in the comment sections of many well-known …
View article
Out-of-the-box Universal Romanization Tool uroman Open
We present uroman, a tool for converting text in myriads of languages and scripts such as Chinese, Arabic and Cyrillic into a common Latin-script representation. The tool relies on Unicode data and other tables, and handles nearly all char…
View article
Romanization and Latinization of the Roman Empire in the light of data in the Computerized Historical Linguistic Database of Latin Inscriptions of the Imperial Age Open
The present study demonstrates that the process of linguistic Romanization, i.e. Latinization of the Roman Empire, is traceable by the data of the Computerized Historical Linguistic Database of Latin Inscriptions of the Imperial Age (LLDB)…
View article
Culture and Society in Later Roman Antioch: Papers from a Colloquium, London, 15th December 2001 Open
Introduction (Isabella Sandwell) Libanius and higher education at Antioch (Samuel N C Lieu) Communality and theatre in Libanius' Oration LXIV In Defence of the Pantomimes (Johannes Haubold and Richard Miles) Christian self-definition in th…
View article
Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset Open
This paper describes the Dakshina dataset, a new resource consisting of text in both the Latin and native scripts for 12 South Asian languages. The dataset includes, for each language: 1) native script Wikipedia text; 2) a romanization lex…
View article
‘Provincial Law’ in Britannia Open
‘In contrast to the Hellenized provinces of the East, the Western provinces—and especially those within the Libyan, Iberian, Celtic and Germanic linguistic zones—seem to present a relatively “barren” pre-Roman legal landscape….’ (Humfress …
View article
Archaeology and Zooarchaeology of the Late Iron Age-Roman Transition in the Province of Raetia (100<span>bc</span>–100<span>ad</span>) Open
The incorporation of the region north of the Alpine divide and its foreland into the Imperium Romanum initiated major changes in economic and social structure and in everyday life in the newly-founded province of Raetia. Controversy exists…
View article
Arabic–Chinese Neural Machine Translation: Romanized Arabic as Subword Unit for Arabic-sourced Translation Open
Morphologically rich and complex languages such as Arabic, pose a major challenge to neural machine translation (NMT) due to the large number of rare words and the inability of NMT to translate them. Unknown word (UNK) symbols are used to …
View article
Aksharantar: Open Indic-language Transliteration datasets and models for the Next Billion Users Open
Transliteration is very important in the Indian language context due to the usage of multiple scripts and the widespread use of romanized inputs. However, few training and evaluation sets are publicly available. We introduce Aksharantar, t…
View article
The Roman Legacy on European Chestnut and Walnut Arboriculture Open
The political and administrative unification process under the Roman Empire resulted not only in a progressive linguistic, religious, and cultural homogenisation of the concerned population, but also in the need of trade and exchanges for …
View article
Atar: Attention-based LSTM for Arabizi transliteration Open
A non-standard romanization of Arabic script, known as Arbizi, is widely used in Arabic online and SMS/chat communities. However, since state-of-the-art tools and applications for Arabic NLP expects Arabic to be written in Arabic script, h…
View article
Global English-related Digraphia and Roman-Cyrillic Biscriptal Practices Open
This paper deals with the new sociolinguistic phenomenon of global English-related digraphia, or the use of the Roman script, commonly associated with English, to represent local languages alongside the native scripts. In non-English speak…
View article
Knowledge Representation and Phonological Rules for the Automatic Transliteration of Balinese Script on Palm Leaf Manuscript Open
"Balinese ancient palm leaf manuscripts record many important kn owledges about world civilization histories. They vary from ordinary texts to Bali’s most sacred writings. In reality, the majority of Balinese can not read it because of lan…
View article
Toward Normalizing Romanized Gurumukhi Text from Social Media Open
Roman characters are used to write Indian language text on social media like facebook and twitter. Processing this text for NLP applications is not a trivial task. This text needs to be transliterated as well as conversion to canonical for…
View article
Identification of Issues and Challenges in Romanized Sindhi Text Open
Now-a-days Sindhi language is widely used in internet for the various purposes such as: newspapers, Sindhi literature, books, educational/official websites and social networks communications, teaching and learning processes. Having develop…
View article
Dystypia after ischemic stroke: a disturbance of linguistic processing for Romaji (Romanized Japanese)? Open
"Dystypia", characterized by an impairment of typing on a keyboard, is a unique neurobehavioral syndrome. A 77-year-old right-handed woman developed a relatively selective impairment of typing after ischemic stroke. The MRI documented new …
View article
Writing Arabizi: Orthographic Variation in Romanized Lebanese Arabic on Twitter Open
Over the past few decades, a new form of writing has emerged across the Arab world. Known as Arabizi, it is
\na type of Romanized Arabic that uses Latin characters instead of Arabic script. It is mainly used by youth in
\ntechnology-relate…
View article
Joint Approach to Deromanization of Code-mixed Texts Open
The conversion of romanized texts back to the native scripts is a challenging task because of the inconsistent romanization conventions and non-standard language use. This problem is compounded by code-mixing, i.e., using words from more t…
View article
GRT: Gurmukhi to Roman Transliteration System using Character Mapping and Handcrafted Rules Open
In the last two decades, the transliteration system has got significant research attention. It is observed that Punjabi to English transliteration for all type of part-of-speech words is comparably less studied. Currently, some research wo…
View article
Analysing bilingualism and biscriptality in medieval Scandinavian epigraphic sources: a sociolinguistic approach Open
Written culture in high and late medieval Scandinavia is characterized by a long and complex relationship between the Latin written tradition and the older native runic one. One product of the intersection of these traditions are several e…
View article
Linguistic Landscape for Korean Learning: A Survey of Perception, Attitude, and Practice of Korean Beginners at a Korean University Open
This study aimed to investigate the perception of, attitude to and practice of linguistic landscape for Korean learning among the international Korean beginners. A questionnaire as a self-assessment instrument was given to a group of 41 in…
View article
Türkiye’deki Eski Uygurca Metin Neşirleri İçin Kullanılacak Harfçevrim ve Yazıçevrim Kılavuzu Open
Transliteration (< trans + litera) is the spelling of words from one script with characters from the another. Transcription (< trans + script) is the representation of the sound of words in a language using any set of symbols you may…
View article
Building a Religious Empire : Tibetan Buddhism, Bureaucracy, and the Rise of the Gelukpa Open
"This book focuses on the story of the Geluk (Tibetan Dge lugs) school of Tibetan Buddhism, the most widespread school of Tibetan Buddhism, best known through its symbolic head, the Dalai Lama. The vast majority of the monasteries in Tibet…