Anna Hätty
YOU?
Author Swipe
PEDANTIC: A Dataset for the Automatic Examination of Definiteness in Patent Claims Open
Patent claims define the scope of protection for an invention. If there are ambiguities in a claim, it is rejected by the patent office. In the US, this is referred to as indefiniteness (35 U.S.C § 112(b)) and is among the most frequent re…
Pap2Pat: Benchmarking Outline-Guided Long-Text Patent Generation with Patent-Paper Pairs Open
Dealing with long and highly complex technical text is a challenge for Large Language Models (LLMs), which still have to unfold their potential in supporting expensive and timeintensive processes like patent drafting. Within patents, the d…
SURel: Synchronic Usage Relatedness Open
This data collection contains synchronic semantic relatedness judgments for German word usage pairs drawn from general language and the domain of cooking. Find a description of the data format, code to process the data and further datasets…
SURel: Synchronic Usage Relatedness Open
This data collection contains synchronic semantic relatedness judgments for German word usage pairs drawn from general language and the domain of cooking. Find a description of the data format, code to process the data and further datasets…
SURel: Synchronic Usage Relatedness Open
This data collection contains synchronic semantic relatedness judgments for German word usage pairs drawn from general language and the domain of cooking. Find a description of the data format, code to process the data and further datasets…
Compound or Term Features? Analyzing Salience in Predicting the Difficulty of German Noun Compounds across Domains Open
Predicting the difficulty of domain-specific vocabulary is an important task towards a better understanding of a domain, and to enhance the communication between lay people and experts. We investigate German closed noun compounds and focus…
Automatic term extraction for conventional and extended term definitions across domains Open
A terminology is the entirety of concepts which constitute the vocabulary of a domain or subject field. Automatically identifying various linguistic forms of terms in domain-specific corpora is an important basis for further natural langua…
Predicting Degrees of Technicality in Automatic Terminology Extraction Open
While automatic term extraction is a well-researched area, computational approaches to distinguish between degrees of technicality are still understudied. We semi-automatically create a German gold standard of technicality across four doma…
A Wind of Change: Detecting and Evaluating Lexical Semantic Change across Times and Domains Open
We perform an interdisciplinary large-scale evaluation for detecting lexical semantic divergences in a diachronic and in a synchronic task: semantic sense changes across time, and semantic sense changes across domains. Our work addresses t…
A Wind of Change: Detecting and Evaluating Lexical Semantic Change\n across Times and Domains Open
We perform an interdisciplinary large-scale evaluation for detecting lexical\nsemantic divergences in a diachronic and in a synchronic task: semantic sense\nchanges across time, and semantic sense changes across domains. Our work\naddresse…
SURel: Synchronic Usage Relatedness Open
------------------------------------- Siehe unten für die deutsche Version. ------------------------------------- Synchronic Usage Relatedness (SURel) - Test Set and Annotation Data This data collection supplementing the paper referenced b…
SURel: A Gold Standard for Incorporating Meaning Shifts into Term Extraction Open
We introduce SURel, a novel dataset with human-annotated meaning shifts between general-language and domain-specific contexts. We show that meaning shifts of term candidates cause errors in term extraction, and demonstrate that the SURel a…
A Wind of Change: Detecting and Evaluating Lexical Semantic Change across Times and Domains Open
We perform an interdisciplinary large-scale evaluation for detecting lexical semantic divergences in a diachronic and in a synchronic task: semantic sense changes across time, and semantic sense changes across domains. Our work addresses t…
A Laypeople Study on Terminology Identification across Domains and Task Definitions Open
Anna Hätty, Sabine Schulte im Walde. Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers). 2018.
Evaluating the Reliability and Interaction of Recursively Used Feature Classes for Terminology Extraction Open
Feature design and selection is a crucial aspect when treating terminology extraction as a machine learning classification problem. We designed feature classes which characterize different properties of terms based on distributions, and pr…
The Role of Modifier and Head Properties in Predicting the Compositionality of English and German Noun-Noun Compounds: A Vector-Space Perspective Open
In this paper, we explore the role of constituent properties in English and German noun-noun compounds (corpus frequencies of the compounds and their constituents; productivity and ambiguity of the constituents; and semantic relations betw…