Dolf Trieschnigg
YOU?
Author Swipe
View article: "How Old Do You Think I Am?" A Study of Language and Age in Twitter
"How Old Do You Think I Am?" A Study of Language and Age in Twitter Open
In this paper we focus on the connection between age and language use, exploring age prediction of Twitter users based on their tweets. We discuss the construction of a fine-grained annotation effort to assign ages and life stages to Twitt…
View article: Audience and the Use of Minority Languages on Twitter
Audience and the Use of Minority Languages on Twitter Open
On Twitter, many users tweet in more than one language. In this study, we examine the use of two Dutch minority languages. Users can engage with different audiences and by analyzing different types of tweets, we find that characteristics o…
View article: Generating Synthetic Training Data for Supervised De-Identification of Electronic Health Records
Generating Synthetic Training Data for Supervised De-Identification of Electronic Health Records Open
A major hurdle in the development of natural language processing (NLP) methods for Electronic Health Records (EHRs) is the lack of large, annotated datasets. Privacy concerns prevent the distribution of EHRs, and the annotation of data is …
View article: Comparing Rule-based, Feature-based and Deep Neural Methods for De-identification of Dutch Medical Records
Comparing Rule-based, Feature-based and Deep Neural Methods for De-identification of Dutch Medical Records Open
Unstructured information in electronic health records provide an invaluable resource for medical research. To protect the confidentiality of patients and to conform to privacy regulations, de-identification methods automatically remove per…
View article: Comparing rule-based, feature-based and deep neural methods for de-identification of Dutch medical records
Comparing rule-based, feature-based and deep neural methods for de-identification of Dutch medical records Open
Unstructured information in electronic health records provide an invaluable resource for medical research. To protect the confidentiality of patients and to conform to privacy regulations, de-identification methods automatically remove per…
View article: Supporting the Exploration of Online Cultural Heritage Collections : The Case of the Dutch Folktale Database
Supporting the Exploration of Online Cultural Heritage Collections : The Case of the Dutch Folktale Database Open
This paper demonstrates the use of a user-centred design approach for the development of generous interfaces/rich prospect browsers for an online cultural heritage collection, determining its primary user groups and designing different bro…
View article: Resource Selection for Federated Search on the Web
Resource Selection for Federated Search on the Web Open
A publicly available dataset for federated search reflecting a real web environment has long been absent, making it difficult for researchers to test the validity of their federated search algorithms for the web setting. We present several…
View article: Automatic Enrichment and Classification of Folktales in the Dutch Folktale Database
Automatic Enrichment and Classification of Folktales in the Dutch Folktale Database Open
This paper describes the development of the Dutch Folktale Database as a digital archive of intangible heritage and a sophisticated research instrument. Current research focuses on automating the assignment of metadata to folktales and on …
View article: FedWeb Greatest Hits
FedWeb Greatest Hits Open
This paper presents 'FedWeb Greatest Hits', a large new test collection for research in web information retrieval. As a combination and extension of the datasets used in the TREC Federated Web Search Track, this collection opens up new res…