Maite Taboada
YOU?
Author Swipe
View article: Attitude in Reported and Non-reported News: A Critique of Sentiment Analysis in Corpus Pragmatics
Attitude in Reported and Non-reported News: A Critique of Sentiment Analysis in Corpus Pragmatics Open
This study uses natural language processing (NLP) tools to examine a large Canadian English-language news corpus with respect to quotation and positive/negative sentiment. Specifically, we analyse sentiment in reported/quoted speech in com…
View article: Reported speech and gender in the news: Who is quoted, how are they quoted, and why it matters
Reported speech and gender in the news: Who is quoted, how are they quoted, and why it matters Open
News stories have a well-defined generic structure, consisting of components such as headline, lede, and body, with reported speech a prominent feature, especially in hard news stories. Reported speech serves multiple purposes, from provid…
View article: The ‘adverb-ly adjective’ construction in English: meanings, distribution and discourse functions
The ‘adverb-ly adjective’ construction in English: meanings, distribution and discourse functions Open
We investigate a class of adjective phrases composed of a deadjectival adverb ending in -ly and an adjective head (e.g. staggeringly incompetent , absolutely terrific , fiscally responsible ), a compact construction whereby two adjectives …
View article: Dimensions of Online Conflict: Towards Modeling Agonism
Dimensions of Online Conflict: Towards Modeling Agonism Open
Agonism plays a vital role in democratic dialogue by fostering diverse perspectives and robust discussions. Within the realm of online conflict there is another type: hateful antagonism, which undermines constructive dialogue. Detecting co…
View article: Radar de Parité: An NLP system to measure gender representation in French news stories
Radar de Parité: An NLP system to measure gender representation in French news stories Open
We present the Radar de Parité, an automated Natural Language Processing (NLP) system that measures the proportion of women and men quoted daily in six Canadian French-language media outlets. We outline the systemâs architecture and det…
View article: Radar de Parité: An NLP system to measure gender representation in French news stories
Radar de Parité: An NLP system to measure gender representation in French news stories Open
We present the Radar de Parité, an automated Natural Language Processing (NLP) system that measures the proportion of women and men quoted daily in six Canadian French-language media outlets. We outline the system's architecture and detail…
View article: Classifying constructive comments
Classifying constructive comments Open
We introduce the Constructive Comments Corpus (C3), comprised of 12,000 annotated news comments, intended to help build new tools for online communities to improve the quality of their discussions. We define constructive comments as high-q…
View article: Dimensions of Online Conflict: Towards Modeling Agonism
Dimensions of Online Conflict: Towards Modeling Agonism Open
Matt Canute, Mali Jin, Hannah Holtzclaw, Alberto Lusoli, Philippa Adams, Mugdha Pandya, Maite Taboada, Diana Maynard, Wendy Hui Kyong Chun. Findings of the Association for Computational Linguistics: EMNLP 2023. 2023.
View article: Gender Bias in the News: A Scalable Topic Modelling and Visualization Framework
Gender Bias in the News: A Scalable Topic Modelling and Visualization Framework Open
We present a topic modelling and data visualization methodology to examine gender-based disparities in news articles by topic. Existing research in topic modelling is largely focused on the text mining of closed corpora, i.e., those that i…
View article: Characterising Online News Comments: A Multi-Dimensional Cruise Through Online Registers
Characterising Online News Comments: A Multi-Dimensional Cruise Through Online Registers Open
News organisations often allow public comments at the bottom of their news stories. These comments constitute a fruitful source of data to investigate linguistic variation online; their characteristics, however, are rather understudied. Th…
View article: A corpus analysis of online news comments using the Appraisal framework
A corpus analysis of online news comments using the Appraisal framework Open
We present detailed analyses of the distribution of Appraisal categories (Martin and White, 2005) in a corpus of online news comments. The corpus consists of just over one thousand comments posted in response to a variety of opinion pieces…
View article: NLE volume 27 issue 2 Cover and Back matter
NLE volume 27 issue 2 Cover and Back matter Open
An abstract is not available for this content so a preview has been provided. As you have access to this content, a full PDF is available via the ‘Save PDF’ action button.
View article: The Gender Gap Tracker: Using Natural Language Processing to measure gender bias in media
The Gender Gap Tracker: Using Natural Language Processing to measure gender bias in media Open
We examine gender bias in media by tallying the number of men and women quoted in news text, using the Gender Gap Tracker, a software system we developed specifically for this purpose. The Gender Gap Tracker downloads and analyzes the onli…
View article: Generic Structure and Rhetorical Relations of Online Book Reviews in English, Japanese and Chinese
Generic Structure and Rhetorical Relations of Online Book Reviews in English, Japanese and Chinese Open
We examine the generic structure and rhetorical relations that characterise online book reviews in English, Japanese and Chinese to describe the pragmatic features of this emerging genre in a contrastive light. The corpus we analyse contai…
View article: Classifying Constructive Comments
Classifying Constructive Comments Open
We introduce the Constructive Comments Corpus (C3), comprised of 12,000 annotated news comments, intended to help build new tools for online communities to improve the quality of their discussions. We define constructive comments as high-q…
View article: The semantics of evaluational adjectives
The semantics of evaluational adjectives Open
We apply the Natural Semantic Metalanguage (NSM) approach ( Goddard & Wierzbicka 2014 ) to the lexical-semantic analysis of English evaluational adjectives and compare the results with the picture developed in the Appraisal Framework ( Mar…
View article: The SFU Opinion and Comments Corpus: A Corpus for the Analysis of Online News Comments
The SFU Opinion and Comments Corpus: A Corpus for the Analysis of Online News Comments Open
We present the SFU Opinion and Comments Corpus (SOCC ), a collection of opinion articles and the comments posted in response to the articles. The articles include all the opinion pieces published in the Canadian newspaper The Globe and Mai…
View article: Multiple Signals of Coherence Relations
Multiple Signals of Coherence Relations Open
In this paper, we investigate the signalling of coherence relations when they are simultaneously indicated by more than one signal. In particular, we examine the co-occurrence of discourse markers and other relational signals when they are…
View article: Big Data and quality data for fake news and misinformation detection
Big Data and quality data for fake news and misinformation detection Open
Fake news has become an important topic of research in a variety of disciplines including linguistics and computer science. In this paper, we explain how the problem is approached from the perspective of natural language processing, with t…
View article: Introduction to the Special Issue on Language in Social Media: Exploiting Discourse and Other Contextual Information
Introduction to the Special Issue on Language in Social Media: Exploiting Discourse and Other Contextual Information Open
Social media content is changing the way people interact with each other and share information, personal messages, and opinions about situations, objects, and past experiences. Most social media texts are short online conversational posts …
View article: The Data Challenge in Misinformation Detection: Source Reputation vs. Content Veracity
The Data Challenge in Misinformation Detection: Source Reputation vs. Content Veracity Open
Misinformation detection at the level of full news articles is a text classification problem. Reliably labeled data in this domain is rare. Previous work relied on news articles collected from so-called "reputable" and "suspicious" website…
View article: Subtopic annotation and automatic segmentation for news texts in Brazilian Portuguese
Subtopic annotation and automatic segmentation for news texts in Brazilian Portuguese Open
Subtopic segmentation aims to break documents into subtopical text passages, which develop a main topic in a text. Being capable of automatically detecting subtopics is very useful for several Natural Language Processing applications. For …
View article: The Good, the Bad, and the Disagreement: Complex ground truth in rhetorical structure analysis
The Good, the Bad, and the Disagreement: Complex ground truth in rhetorical structure analysis Open
We present a proposal to analyze disagreement in Rhetorical Structure Theory annotation which takes into account what we consider "legitimate" disagreements.In rhetorical analysis, as in many other pragmatic annotation tasks, a certain amo…
View article: Constructive Language in News Comments
Constructive Language in News Comments Open
We discuss the characteristics of constructive news comments, and present methods to identify them. First, we define the notion of constructiveness. Second, we annotate a corpus for constructiveness. Third, we explore whether available arg…
View article: Using lexical level information in discourse structures for Basque sentiment analysis
Using lexical level information in discourse structures for Basque sentiment analysis Open
Systems for opinion and sentiment analysis rely on different resources: a lexicon, annotated corpora and constraints (morphological, syntactic or discursive), depending on the nature of the language or text type.In this respect, Basque is …
View article: Using New York Times Picks to Identify Constructive Comments
Using New York Times Picks to Identify Constructive Comments Open
We examine the extent to which we are able to automatically identify constructive online comments. We build several classifiers using New York Times Picks as positive examples and non-constructive thread comments from the Yahoo News Annota…
View article: Evaluative Language Beyond Bags of Words: Linguistic Insights and Computational Applications
Evaluative Language Beyond Bags of Words: Linguistic Insights and Computational Applications Open
The study of evaluation, affect, and subjectivity is a multidisciplinary enterprise, including sociology, psychology, economics, linguistics, and computer science. A number of excellent computational linguistics and linguistic surveys of t…
View article: Semantic descriptions of 24 evaluational adjectives, for application in sentiment analysis
Semantic descriptions of 24 evaluational adjectives, for application in sentiment analysis Open
We apply the Natural Semantic Metalanguage (NSM) approach (Goddard and Wierzbicka 2014) to the lexical-semantic analysis of English evaluational adjectives and compare the results with the picture developed in the Appraisal Framework (Mart…