Shantipriya Parida
YOU?
Author Swipe
View article: Multiscalar brain adaptability in AI systems
Multiscalar brain adaptability in AI systems Open
The advent of Generative AI has transformed creative and analytical landscapes, leveraging vast datasets to produce sophisticated outputs with remarkable efficiency. Despite these advancements, human judgment and adaptability remain indisp…
View article: Modern Technology: A Potential Tool to Impart English Language Skills
Modern Technology: A Potential Tool to Impart English Language Skills Open
The present paper discusses the Lingua Franca of the world ‘English language’ and the role of technology in learning and teaching the English language effectively. The 21st-century modern technology has enabled learning and teaching of Eng…
View article: Building pre-train LLM Dataset for the INDIC Languages: a case study on Hindi
Building pre-train LLM Dataset for the INDIC Languages: a case study on Hindi Open
Large language models (LLMs) demonstrated transformative capabilities in many applications that require automatically generating responses based on human instruction. However, the major challenge for building LLMs, particularly in Indic la…
View article: Quantum of information’ functionality as a measure of subjectivity beyond the capabilities of deep learning.
Quantum of information’ functionality as a measure of subjectivity beyond the capabilities of deep learning. Open
The potential of conscious artificial intelligence (AI), with its functional systems that surpass automation and rely on elements of understanding, is a beacon of hope in the AI revolution. The shift from automation to conscious AI, once r…
View article: Towards a More Inclusive AI: Progress and Perspectives in Large Language Model Training for the Sámi Language
Towards a More Inclusive AI: Progress and Perspectives in Large Language Model Training for the Sámi Language Open
Sámi, an indigenous language group comprising multiple languages, faces digital marginalization due to the limited availability of data and sophisticated language models designed for its linguistic intricacies. This work focuses on increas…
View article: A Reinforcement Learning Approach for Intelligent Conversational Chatbot For Enhancing Mental Health Therapy
A Reinforcement Learning Approach for Intelligent Conversational Chatbot For Enhancing Mental Health Therapy Open
View article: Building a Llama2-finetuned LLM for Odia Language Utilizing Domain Knowledge Instruction Set
Building a Llama2-finetuned LLM for Odia Language Utilizing Domain Knowledge Instruction Set Open
Building LLMs for languages other than English is in great demand due to the unavailability and performance of multilingual LLMs, such as understanding the local context. The problem is critical for low-resource languages due to the need f…
View article: Intentionality for better communication in minimally conscious AI design
Intentionality for better communication in minimally conscious AI design Open
Consciousness is the ability to have intentionality, which is a process that operates at various temporal scales. To qualify as conscious, an artificial device must express functionality capable of solving the Intrinsicality problem, where…
View article: HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language
HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language Open
This paper presents HaVQA, the first multimodal dataset for visual question-answering (VQA) tasks in the Hausa language. The dataset was created by manually translating 6,022 English question-answer pairs, which are associated with 1,555 u…
View article: Machine Translation by Projecting Text into the Same Phonetic-Orthographic Space Using a Common Encoding
Machine Translation by Projecting Text into the Same Phonetic-Orthographic Space Using a Common Encoding Open
The use of subword embedding has proved to be a major innovation in Neural Machine Translation (NMT). It helps NMT to learn better context vectors for Low Resource Languages (LRLs) so as to predict the target words by better modelling the …
View article: HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language
HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language Open
Shantipriya Parida, Idris Abdulmumin, Shamsuddeen Hassan Muhammad, Aneesh Bose, Guneet Singh Kohli, Ibrahim Said Ahmad, Ketan Kotwal, Sayan Deb Sarkar, Ondřej Bojar, Habeebah Kakudi. Findings of the Association for Computational Linguistic…
View article: Silo NLP's Participation at WAT2022
Silo NLP's Participation at WAT2022 Open
This paper provides the system description of "Silo NLP's" submission to the Workshop on Asian Translation (WAT2022). We have participated in the Indic Multimodal tasks (English->Hindi, English->Malayalam, and English->Bengali Multimodal T…
View article: Universal Dependency Treebank for Odia Language
Universal Dependency Treebank for Odia Language Open
This paper presents the first publicly available treebank of Odia, a morphologically rich low resource Indian language. The treebank contains approx. 1082 tokens (100 sentences) in Odia selected from "Samantar", the largest available paral…
View article: Hausa Visual Genome: A Dataset for Multi-Modal English to Hausa Machine Translation
Hausa Visual Genome: A Dataset for Multi-Modal English to Hausa Machine Translation Open
Multi-modal Machine Translation (MMT) enables the use of visual information to enhance the quality of translations. The visual information can serve as a valuable piece of context information to decrease the ambiguity of input sentences. D…
View article: Bengali Visual Genome: A Multimodal Dataset for Machine Translation and Image Captioning
Bengali Visual Genome: A Multimodal Dataset for Machine Translation and Image Captioning Open
Multimodal machine translation (MMT) refers to the extraction of information from more than one modality aiming at performance improvement by utilizing information collected from the modalities other than pure text. The availability of mul…
View article: A Blockchain and NLP Based Electronic Health Record System: Indian Subcontinent Context
A Blockchain and NLP Based Electronic Health Record System: Indian Subcontinent Context Open
The healthcare system in the Indian subcontinent is plagued with numerous issues related to the access, transfer, and storage of patient's medical records. The lack of infrastructure to properly communicate and track records between all ke…
View article: Extreme Learning Machines with feature selection using GA for effective prediction of fetal heart disease: A Novel Approach
Extreme Learning Machines with feature selection using GA for effective prediction of fetal heart disease: A Novel Approach Open
Heart disease is considered to be the most life-threatening ailment in the entire world and has been a major concern of developing countries. Heart disease also affects the fetus, which can be detected by cardiotocography tests conducted o…
View article: Applying Attention-Based Models for Detecting Cognitive Processes and Mental Health Conditions
Applying Attention-Based Models for Detecting Cognitive Processes and Mental Health Conditions Open
View article: Malayalam Visual Genome 1.0
Malayalam Visual Genome 1.0 Open
Data ------- Malayalam Visual Genome (MVG for short) 1.0 has similar goals as Hindi Visual Genome (HVG) 1.1: to support the Malayalam language. Malayalam Visual Genome 1.0 is the first multi-modal dataset in Malayalam for machine translati…
View article: NLPHut’s Participation at WAT2021
NLPHut’s Participation at WAT2021 Open
Shantipriya Parida, Subhadarshi Panda, Ketan Kotwal, Amulya Ratna Dash, Satya Ranjan Dash, Yashvardhan Sharma, Petr Motlicek, Ondřej Bojar. Proceedings of the 8th Workshop on Asian Translation (WAT2021). 2021.
View article: Overview of the 8th Workshop on Asian Translation
Overview of the 8th Workshop on Asian Translation Open
Toshiaki Nakazawa, Hideki Nakayama, Chenchen Ding, Raj Dabre, Shohei Higashiyama, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, Ondřej Bojar, Chenhui Chu, Akiko Eriguchi, Kaori Abe, Yusuke Oda, Sadao Kurohashi.…
View article: Open Machine Translation for Low Resource South American Languages (AmericasNLP 2021 Shared Task Contribution)
Open Machine Translation for Low Resource South American Languages (AmericasNLP 2021 Shared Task Contribution) Open
This paper describes the team (“Tamalli”)’s submission to AmericasNLP2021 shared task on Open Machine Translation for low resource South American languages. Our goal was to evaluate different Machine Translation (MT) techniques, statistica…
View article: Multimodal Neural Machine Translation System for English to Bengali
Multimodal Neural Machine Translation System for English to Bengali Open
Multimodal Machine Translation (MMT) systems utilize additional information from other modalities beyond text to improve the quality of machine translation (MT).The additional modality is typically in the form of images.Despite proven adva…
View article: Predicting the Causal Effect Relationship Between COPD and Cardio Vascular Diseases
Predicting the Causal Effect Relationship Between COPD and Cardio Vascular Diseases Open
Coronary Obstructive Pulmonary Disease (COPD) is one of the critical factors that are affecting the health of the population worldwide and in most cases affects the patient with cardiovascular diseases and their mortality. The onset of COP…
View article: Inferring Highly-dense Representations for Clustering Broadcast Media Content
Inferring Highly-dense Representations for Clustering Broadcast Media Content Open
We propose to employ a low-resolution representation for accurately categorizing spoken documents.Our proposed approach guarantees document clusters using a highly dense representation.Performed experiments, using a dataset from a German T…
View article: OdiEnCorp 2.0: Odia-English Parallel Corpus for Machine Translation
OdiEnCorp 2.0: Odia-English Parallel Corpus for Machine Translation Open
The preparation of parallel corpora is a challenging task, particularly for languages that suffer from under-representation in the digital world. In a multi-lingual country like India, the need for such parallel corpora is stringent for se…
View article: Epileptic seizure detection: a comparative study between deep and traditional machine learning techniques
Epileptic seizure detection: a comparative study between deep and traditional machine learning techniques Open
Electroencephalography is the recording of brain electrical activities that can be used to diagnose brain seizure disorders. By identifying brain activity patterns and their correspondence between symptoms and diseases, it is possible to g…
View article: BertAA : BERT fine-tuning for Authorship Attribution.
BertAA : BERT fine-tuning for Authorship Attribution. Open
View article: Idiap Submission to Swiss-German Language Detection Shared Task.
Idiap Submission to Swiss-German Language Detection Shared Task. Open
Language detection is a key part of the NLP pipeline for text processing. The task of automatically detecting languages belonging to disjoint groups is relatively easy. It is considerably challenging to detect languages that have similar o…
View article: Idiap and UAM Participation at MEX-A3T Evaluation Campaign.
Idiap and UAM Participation at MEX-A3T Evaluation Campaign. Open
This paper describes our participation in the shared evaluation campaign of MexA3T 2020. Our main goal wasto evaluate a Supervised Autoencoder (SAE) learning algorithm in text classification tasks. For our experiments,we used three differe…