Birgit Pfitzmann
YOU?
Author Swipe
View article: INDUS: Effective and Efficient Language Models for Scientific Applications
INDUS: Effective and Efficient Language Models for Scientific Applications Open
Large language models (LLMs) trained on general domain corpora showed remarkable results on natural language processing (NLP) tasks. However, previous research demonstrated LLMs trained using domain-focused corpora perform better on specia…
View article: Wealth over Woe: global biases in hydro-hazard research
Wealth over Woe: global biases in hydro-hazard research Open
Floods, droughts, and rainfall-induced landslides are hydro-geomorphic hazards that affect millions of people every year. Anticipation, mitigation, and adaptation to these hazards is increasingly outpaced by their changing magnitude and fr…
View article: DocLayNet: A Large Human-Annotated Dataset for Document-Layout Segmentation
DocLayNet: A Large Human-Annotated Dataset for Document-Layout Segmentation Open
Accurate document layout analysis is a key requirement for high-quality PDF document conversion. With the recent availability of public, large ground-truth datasets such as PubLayNet and DocBank, deep-learning models have proven to be very…