Explanipedia

Hierarchical Autoregressive Transformers: Combining Byte- and Word-Level Processing for Robust, Adaptable Language Models Open

Pit Neitemeier, Björn Deiseroth, Constantin Eichenberg, Lukas Balles · 2025

Tokenization is a fundamental step in natural language processing, breaking text into units that computational models can process. While learned subword tokenizers have become the de-facto standard, they present challenges such as large vo…

Divergent Token Metrics: Measuring degradation to prune away LLM components -- and optimize quantization Open

Björn Deiseroth, Max Meuer, Nikolas Gritsch, Constantin Eichenberg, Patrick Schramowski , et al. · 2023

Computer science Engineering

Large Language Models (LLMs) have reshaped natural language processing with their impressive capabilities. However, their ever-increasing size has raised concerns about their effective deployment and the need for LLM compression. This stud…

MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation Open

Marco Bellagente, Manuel Brack, Hannah Teufel, F. Friedrich, Björn Deiseroth , et al. · 2023

Computer science Sociology Chemistry

The recent popularity of text-to-image diffusion models (DM) can largely be attributed to the intuitive interface they provide to users. The intended generation can be expressed in natural language, with the model producing faithful interp…

M-VADER: A Model for Diffusion with Multimodal Context Open

Samuel Weinbach, Marco Bellagente, Constantin Eichenberg, Andrew M. Dai, Robert Baldock , et al. · 2022

Computer science Mathematics Biology

We introduce M-VADER: a diffusion model (DM) for image generation where the output can be specified using arbitrary combinations of images and text. We show how M-VADER enables the generation of images specified using combinations of image…

MAGMA – Multimodal Augmentation of Generative Models through Adapter-based Finetuning Open

Constantin Eichenberg, Sidney Black, Samuel Weinbach, Letiția Pârcălăbescu, Anette Frank · 2022

Computer science Geography Biology

Large-scale pretraining is fast becoming the norm in Vision-Language (VL) modeling. However, prevailing VL approaches are limited by the requirement for labeled data and the use of complex multi-step pretraining objectives. We present MAGM…

MAGMA -- Multimodal Augmentation of Generative Models through Adapter-based Finetuning Open

Constantin Eichenberg, Sidney Black, Samuel Weinbach, Letiția Pârcălăbescu, Anette Frank · 2021

Computer science Geography Sociology

Large-scale pretraining is fast becoming the norm in Vision-Language (VL) modeling. However, prevailing VL approaches are limited by the requirement for labeled data and the use of complex multi-step pretraining objectives. We present MAGM…

Constantin Eichenberg YOU? Author Swipe