Albert Clapés
YOU?
Author Swipe
View article: Enhancing Clinical Psychology Practice through Data-driven Machine Learning Monitoring Systems
Enhancing Clinical Psychology Practice through Data-driven Machine Learning Monitoring Systems Open
Mental health is a critical global challenge, with significant societal impacts. Machinelearning has emerged as a valuable and innovative approach to enhancing psychologicalassessment and intervention. This study focuses on leveraging ment…
View article: Action Valuation in Sports: A Survey
Action Valuation in Sports: A Survey Open
Action Valuation (AV) has emerged as a key topic in Sports Analytics, offering valuable insights by assigning scores to individual actions based on their contribution to desired outcomes. Despite a few surveys addressing related concepts s…
View article: Developing a Digital Mental Health Ecosystem for Workplaces: Rationale, Objectives, and Methods of the MetrikaMind Project
Developing a Digital Mental Health Ecosystem for Workplaces: Rationale, Objectives, and Methods of the MetrikaMind Project Open
Background: Depression and anxiety are among the leading causes of disability worldwide, significantly impacting workplace productivity through absenteeism and presenteeism. The MetrikaMind platform offers a scalable, digital solution for …
View article: SoccerNet 2024 Challenges Results
SoccerNet 2024 Challenges Results Open
The SoccerNet 2024 challenges represent the fourth annual video understanding challenges organized by the SoccerNet team. These challenges aim to advance research across multiple themes in football, including broadcast video understanding,…
View article: AI Competitions and Benchmarks: Dataset Development
AI Competitions and Benchmarks: Dataset Development Open
Machine learning is now used in many applications thanks to its ability to predict, generate, or discover patterns from large quantities of data. However, the process of collecting and transforming data for practical use is intricate. Even…
View article: T-DEED: Temporal-Discriminability Enhancer Encoder-Decoder for Precise Event Spotting in Sports Videos
T-DEED: Temporal-Discriminability Enhancer Encoder-Decoder for Precise Event Spotting in Sports Videos Open
In this paper, we introduce T-DEED, a Temporal-Discriminability Enhancer Encoder-Decoder for Precise Event Spotting in sports videos. T-DEED addresses multiple challenges in the task, including the need for discriminability among frame rep…
View article: ASTRA: An Action Spotting TRAnsformer for Soccer Videos
ASTRA: An Action Spotting TRAnsformer for Soccer Videos Open
In this paper, we introduce ASTRA, a Transformer-based model designed for the task of Action Spotting in soccer matches. ASTRA addresses several challenges inherent in the task and dataset, including the requirement for precise action loca…
View article: SADA: Semantic adversarial unsupervised domain adaptation for Temporal Action Localization
SADA: Semantic adversarial unsupervised domain adaptation for Temporal Action Localization Open
Temporal Action Localization (TAL) is a complex task that poses relevant challenges, particularly when attempting to generalize on new -- unseen -- domains in real-world applications. These scenarios, despite realistic, are often neglected…
View article: SoccerNet 2023 Challenges Results
SoccerNet 2023 Challenges Results Open
The SoccerNet 2023 challenges were the third annual video understanding challenges organized by the SoccerNet team. For this third edition, the challenges were composed of seven vision-based tasks split into three main themes. The first th…
View article: Gloss-free Sign Language Translation: Improving from Visual-Language Pretraining
Gloss-free Sign Language Translation: Improving from Visual-Language Pretraining Open
Sign Language Translation (SLT) is a challenging task due to its cross-domain nature, involving the translation of visual-gestural language to text. Many previous methods employ an intermediate representation, i.e., gloss sequences, to fac…
View article: SoccerNet 2022 Challenges Results
SoccerNet 2022 Challenges Results Open
The SoccerNet 2022 challenges were the second annual video understanding challenges organized by the SoccerNet team. In 2022, the challenges were composed of 6 vision-based tasks: (1) action spotting, focusing on retrieving action timestam…
View article: Deep learning with self-supervision and uncertainty regularization to count fish in underwater images
Deep learning with self-supervision and uncertainty regularization to count fish in underwater images Open
Effective conservation actions require effective population monitoring. However, accurately counting animals in the wild to inform conservation decision-making is difficult. Monitoring populations through image sampling has made data colle…
View article: Deep learning with self-supervision and uncertainty regularization to count fish in underwater images
Deep learning with self-supervision and uncertainty regularization to count fish in underwater images Open
Effective conservation actions require effective population monitoring. However, accurately counting animals in the wild to inform conservation decision-making is difficult. Monitoring populations through image sampling has made data colle…
View article: Video Transformers: A Survey
Video Transformers: A Survey Open
Transformer models have shown great success handling long-range interactions, making them a promising tool for modeling video. However, they lack inductive biases and scale quadratically with input length. These limitations are further exa…
View article: Dyadformer: A Multi-modal Transformer for Long-Range Modeling of Dyadic Interactions
Dyadformer: A Multi-modal Transformer for Long-Range Modeling of Dyadic Interactions Open
Personality computing has become an emerging topic in computer vision, due to the wide range of applications it can be used for. However, most works on the topic have focused on analyzing the individual, even when applied to interaction sc…
View article: Dyadformer: A Multi-modal Transformer for Long-Range Modeling of Dyadic\n Interactions
Dyadformer: A Multi-modal Transformer for Long-Range Modeling of Dyadic\n Interactions Open
Personality computing has become an emerging topic in computer vision, due to\nthe wide range of applications it can be used for. However, most works on the\ntopic have focused on analyzing the individual, even when applied to\ninteraction…
View article: Context-Aware Personality Inference in Dyadic Scenarios: Introducing the UDIVA Dataset
Context-Aware Personality Inference in Dyadic Scenarios: Introducing the UDIVA Dataset Open
This paper introduces UDIVA, a new non-acted dataset of face-to-face dyadic interactions, where interlocutors perform competitive and collaborative tasks with different behavior elicitation and cognitive workload. The dataset consists of 9…
View article: Action Recognition Using Single-Pixel Time-of-Flight Detection
Action Recognition Using Single-Pixel Time-of-Flight Detection Open
Action recognition is a challenging task that plays an important role in many robotic systems, which highly depend on visual input feeds. However, due to privacy concerns, it is important to find a method which can recognise actions withou…
View article: From Apparent to Real Age: Gender, Age, Ethnic, Makeup, and Expression Bias Analysis in Real Age Estimation
From Apparent to Real Age: Gender, Age, Ethnic, Makeup, and Expression Bias Analysis in Real Age Estimation Open
Real age estimation in still images of faces is an active area of research in the computer vision community. However, very few works attempted to analyse the apparent age as perceived by observers. Apparent age estimation is a subjective t…
View article: A Survey on Deep Learning Based Approaches for Action and Gesture Recognition in Image Sequences
A Survey on Deep Learning Based Approaches for Action and Gesture Recognition in Image Sequences Open
International audience
View article: ChaLearn Joint Contest on Multimedia Challenges Beyond Visual Analysis: An overview
ChaLearn Joint Contest on Multimedia Challenges Beyond Visual Analysis: An overview Open
This paper provides an overview of the Joint Contest on Multimedia Challenges Beyond Visual Analysis. We organized an academic competition that focused on four problems that require effective processing of multimodal information in order t…
View article: Keep it accurate and diverse: Enhancing action recognition performance by ensemble learning
Keep it accurate and diverse: Enhancing action recognition performance by ensemble learning Open
The performance of different action recognition techniques has recently been studied by several computer vision researchers. However, the potential improvement in classification through classifier fusion by ensemble-based methods has remai…