Cèsar Berrospi
YOU?
Author Swipe
View article: Advanced Layout Analysis Models for Docling
Advanced Layout Analysis Models for Docling Open
This technical report documents the development of novel Layout Analysis models integrated into the Docling document-conversion pipeline. We trained several state-of-the-art object detectors based on the RT-DETR, RT-DETRv2 and DFINE archit…
View article: <scp>ChemQuery</scp>: A Natural Language Query‐Driven Service for Comprehensive Exploration of Chemistry Patent Literature
<span>ChemQuery</span>: A Natural Language Query‐Driven Service for Comprehensive Exploration of Chemistry Patent Literature Open
Patents are integral to our shared scientific knowledge, requiring companies and inventors to stay informed about them to conduct research, find licensing opportunities, and manage legal risks. However, the rising rate of filings has made …
View article: Know Your RAG: Dataset Taxonomy and Generation Strategies for Evaluating RAG Systems
Know Your RAG: Dataset Taxonomy and Generation Strategies for Evaluating RAG Systems Open
Retrieval Augmented Generation (RAG) systems are a widespread application of Large Language Models (LLMs) in the industry. While many tools exist empowering developers to build their own systems, measuring their performance locally, with d…
View article: Wealth Over Woe: Global Biases in Hydro‐Hazard Research
Wealth Over Woe: Global Biases in Hydro‐Hazard Research Open
Floods, droughts, and rainfall‐induced landslides are hydro‐hazards that affect millions of people every year. Anticipation, mitigation, and adaptation to these hazards is increasingly outpaced by their changing magnitude and frequency due…
View article: Knowledge Enhanced Representation Learning for Drug Discovery
Knowledge Enhanced Representation Learning for Drug Discovery Open
Recent research on predicting the binding affinity between drug molecules and proteins use representations learned, through unsupervised learning techniques, from large databases of molecule SMILES and protein sequences. While these repres…
View article: ESG Accountability Made Easy: DocQA at Your Service
ESG Accountability Made Easy: DocQA at Your Service Open
We present Deep Search DocQA. This application enables information extraction from documents via a question-answering conversational assistant. The system integrates several technologies from different AI disciplines consisting of document…
View article: Identifying global biases in hydro-hazard research by mining the scientific literature
Identifying global biases in hydro-hazard research by mining the scientific literature Open
Floods, droughts, and rainfall-induced landslides are hydro-geomorphic hazards that affect millions of people every year. These hazards are therefore heavily researched topics with several hundred thousand articles published. The large num…
View article: Wealth over Woe: global biases in hydro-hazard research
Wealth over Woe: global biases in hydro-hazard research Open
Floods, droughts, and rainfall-induced landslides are hydro-geomorphic hazards that affect millions of people every year. Anticipation, mitigation, and adaptation to these hazards is increasingly outpaced by their changing magnitude and fr…
View article: ESG Accountability Made Easy: DocQA at Your Service
ESG Accountability Made Easy: DocQA at Your Service Open
We present Deep Search DocQA. This application enables information extraction from documents via a question-answering conversational assistant. The system integrates several technologies from different AI disciplines consisting of document…
View article: Robust PDF Document Conversion using Recurrent Neural Networks
Robust PDF Document Conversion using Recurrent Neural Networks Open
The number of published PDF documents in both the academic and commercial world has increased exponentially in recent decades. There is a growing need to make their rich content discoverable to information retrieval tools. Achieving high-q…
View article: Linear-complexity relaxed word Mover's distance with GPU acceleration
Linear-complexity relaxed word Mover's distance with GPU acceleration Open
The amount of unstructured text-based data is growing every day. Querying, clustering, and classifying this big data requires similarity computations across large sets of documents. Whereas low-complexity similarity metrics are available, …