Sofian Chaybouti YOU? Author Swipe

Last 10y

Open Invitation to Help Curate This Field & Enhance Impact .ORG

REVEAL: Relation-based Video Representation Learning for Video-Question-Answering Open

Sofian Chaybouti, Walid Bousselham, Moritz Wolter, Hilde Kuehne · 2025

Video-Question-Answering (VideoQA) comprises the capturing of complex visual relation changes over time, remaining a challenge even for advanced Video Language Models (VLM), i.a., because of the need to represent the visual content to a re…

MaskInversion: Localized Embeddings via Optimization of Explainability Maps Open

Walid Bousselham, Sofian Chaybouti, Christian Rupprecht, Vittorio Ferrari, Hilde Kuehne · 2024

Computer science Mathematics

Vision-language foundation models such as CLIP have achieved tremendous results in global vision-language alignment, but still show some limitations in creating representations for specific image regions. % To address this problem, we prop…

LeGrad: An Explainability Method for Vision Transformers via Feature Formation Sensitivity Open

Walid Bousselham, Angie Boggust, Sofian Chaybouti, Hendrik Strobelt, Hilde Kuehne · 2024

Computer science Engineering

Vision Transformers (ViTs), with their ability to model long-range dependencies through self-attention mechanisms, have become a standard architecture in computer vision. However, the interpretability of these models remains a challenge. T…

EfficientQA : a RoBERTa Based Phrase-Indexed Question-Answering System Open

Sofian Chaybouti, Achraf Saghe, Aymen Shabou · 2021

Computer science Geography Biology

State-of-the-art extractive question-answering models achieve superhuman performances on the SQuAD benchmark. Yet, they are unreasonably heavy and need expensive GPU computing to answer questions in a reasonable time. Thus, they cannot be …

MIX : a Multi-task Learning Approach to Solve Open-Domain Question Answering Open

Sofian Chaybouti, Achraf Saghe, Aymen Shabou · 2020

Computer science Geography Economics

This paper introduces MIX, a multi-task deep learning approach to solve open-ended question-answering. First, we design our system as a multi-stage pipeline of 3 building blocks: a BM25-based Retriever to reduce the search space, a RoBERTa…

Creating related items for first view…