Explanipedia

PriviRec: Confidential and Decentralized Graph Filtering for Recommender Systems Open

J Nicolas, César Sabater, Mohamed Maouche, Mark Coates, Sonia Ben Mokhtar · 2025

International audience

It Takes Two: Your GRPO Is Secretly DPO Open

Yihong Wu, Liheng Ma, Ding Lei, Muzhi Li, Xinyu Wang , et al. · 2025

Group Relative Policy Optimization (GRPO) is a prominent reinforcement learning algorithm for post-training Large Language Models (LLMs). It is commonly believed that GRPO necessitates a large group size to ensure stable training via preci…

Omni-Thinker: Scaling Multi-Task RL in LLMs with Hybrid Reward and Task Scheduling Open

Danrui Li, Jiaming Zhou, Leo Maxime Brunswic, Abbas Ghaddar, Qibin Sun , et al. · 2025

The pursuit of general-purpose artificial intelligence depends on large language models (LLMs) that can handle both structured reasoning and open-ended generation. We present Omni-Thinker, a unified reinforcement learning (RL) framework th…

Communication Efficient, Differentially Private Distributed Optimization using Correlation-Aware Sketching Open

Jean‐François Nicolas, Mohamed Maouche, Sonia Ben Mokhtar, Mark Coates · 2025

Federated learning with differential privacy suffers from two major costs: each client must transmit $d$-dimensional gradients every round, and the magnitude of DP noise grows with $d$. Yet empirical studies show that gradient updates exhi…

SKOLR: Structured Koopman Operator Linear RNN for Time-Series Forecasting Open

Yitian Zhang, Liheng Ma, Antonios Valkanas, Boris N. Oreshkin, Mark Coates · 2025

Koopman operator theory provides a framework for nonlinear dynamical system analysis and time-series forecasting by mapping dynamics to a space of real-valued measurement functions, enabling a linear operator representation. Despite the ad…

One Demo Is All It Takes: Planning Domain Derivation with LLMs from A Single Demonstration Open

Jinbang Huang, Yixin Xiao, Zhanguang Zhang, Mark Coates, Jianmin Hao , et al. · 2025

Pre-trained large language models (LLMs) show promise for robotic task planning but often struggle to guarantee correctness in long-horizon problems. Task and motion planning (TAMP) addresses this by grounding symbolic plans in low-level e…

Half Search Space is All You Need Open

Pavel O. Rumiantsev, Mark Coates · 2025

Neural Architecture Search (NAS) is a powerful tool for automating architecture design. One-Shot NAS techniques, such as DARTS, have gained substantial popularity due to their combination of search efficiency with simplicity of implementat…

Plain Transformers Can be Powerful Graph Learners Open

Liheng Ma, Soumyasundar Pal, Yingxue Zhang, Philip H. S. Torr, Mark Coates · 2025

Transformers have attained outstanding performance across various modalities, owing to their simple but powerful scaled-dot-product (SDP) attention mechanisms. Researchers have attempted to migrate Transformers to graph learning, but most …

Variation Matters: from Mitigating to Embracing Zero-Shot NAS Ranking Function Variation Open

Pavel O. Rumiantsev, Mark Coates · 2025

Neural Architecture Search (NAS) is a powerful automatic alternative to manual design of a neural network. In the zero-shot version, a fast ranking function is used to compare architectures without training them. The outputs of the ranking…

InnerThoughts: Disentangling Representations and Predictions in Large Language Models Open

Didier Chételat, Joseph Cotnareanu, Ronald N. Thompson, Yingxue Zhang, Mark Coates · 2025

Large language models (LLMs) contain substantial factual knowledge which is commonly elicited by multiple-choice question-answering prompts. Internally, such models process the prompt through multiple transformer layers, building varying r…

Secure Federated Graph-Filtering for Recommender Systems Open

Jean‐François Nicolas, César Sabater, Mohamed Maouche, Sonia Ben Mokhtar, Mark Coates · 2025

Computer science

Recommender systems often rely on graph-based filters, such as normalized item-item adjacency matrices and low-pass filters. While effective, the centralized computation of these components raises concerns about privacy, security, and the …

Path-of-Thoughts: Extracting and Following Paths for Robust Relational Reasoning with Large Language Models Open

Ge Zhang, Mohammad Ali Alomrani, Hongjian Gu, Jiaming Zhou, Yaochen Hu , et al. · 2024

Computer science Philosophy

Large language models (LLMs) possess vast semantic knowledge but often struggle with complex reasoning tasks, particularly in relational reasoning problems such as kinship or spatial reasoning. In this paper, we present Path-of-Thoughts (P…

Refining Answer Distributions for Improved Large Language Model Reasoning Open

Soumyasundar Pal, Didier Chételat, Yingxue Zhang, Mark Coates · 2024

Computer science Psychology Philosophy

Large Language Models (LLMs) have exhibited an impressive capability to perform reasoning tasks, especially if they are encouraged to generate a sequence of intermediate steps. Reasoning performance can be improved by suitably combining mu…

Differentially private and decentralized randomized power method Open

Jean‐François Nicolas, César Sabater, Mohamed Maouche, Sonia Ben Mokhtar, Mark Coates · 2024

Business Computer science Medicine

The randomized power method has gained significant interest due to its simplicity and efficient handling of large-scale spectral analysis and recommendation tasks. However, its application to large datasets containing personal information …

Sparse Decomposition of Graph Neural Networks Open

Yaochen Hu, Mai Zeng, Ge ZHANG, Pavel O. Rumiantsev, Liheng Ma , et al. · 2024

Computer science Chemistry

Graph Neural Networks (GNN) exhibit superior performance in graph representation learning, but their inference cost can be high, due to an aggregation operation that can require a memory fetch for a very large number of nodes. This inferen…

Enhancing Click-through Rate Prediction in Recommendation Domain with Search Query Representation Open

Yuening Wang, Man Chen, Yaochen Hu, Wei Guo, Yingxue Zhang , et al. · 2024

Computer science Mathematics Political science

Many platforms, such as e-commerce websites, offer both search and\nrecommendation services simultaneously to better meet users' diverse needs.\nRecommendation services suggest items based on user preferences, while search\nservices allow …

HardCore Generation: Generating Hard UNSAT Problems for Data Augmentation Open

Joseph Cotnareanu, Zhanguang Zhang, Hui‐Ling Zhen, Yingxue Zhang, Mark Coates · 2024

Computer science

Efficiently determining the satisfiability of a boolean equation -- known as the SAT problem for brevity -- is crucial in various industrial problems. Recently, the advent of deep learning methods has introduced significant potential for e…

Enhancing Logical Reasoning in Large Language Models through Graph-based Synthetic Data Open

Jiaming Zhou, Abbas Ghaddar, Ge Zhang, Liheng Ma, Yaochen Hu , et al. · 2024

Computer science

Despite recent advances in training and prompting strategies for Large Language Models (LLMs), these models continue to face challenges with complex logical reasoning tasks that involve long reasoning chains. In this work, we explore the p…

Predicting Probabilities of Error to Combine Quantization and Early Exiting: QuEE Open

Florence Regol, Joud Chataoui, Bertrand Charpentier, Mark Coates, Pablo Piantanida , et al. · 2024

Computer science Mathematics

Machine learning models can solve complex tasks but often require significant computational resources during inference. This has led to the development of various post-training computation reduction methods that tackle this issue in differ…

Graph Knowledge Distillation to Mixture of Experts Open

Pavel O. Rumiantsev, Mark Coates · 2024

Computer science Chemistry

In terms of accuracy, Graph Neural Networks (GNNs) are the best architectural choice for the node classification task. Their drawback in real-world deployment is the latency that emerges from the neighbourhood processing operation. One sol…

MODL: Multilearner Online Deep Learning Open

Antonios Valkanas, Boris N. Oreshkin, Mark Coates · 2024

Computer science Psychology

Online deep learning tackles the challenge of learning from data streams by balancing two competing goals: fast learning and deep learning. However, existing research primarily emphasizes deep learning solutions, which are more adept at ha…

GraSS: Combining Graph Neural Networks with Expert Knowledge for SAT Solver Selection Open

Zhanguang Zhang, Didier Chételat, Joseph Cotnareanu, Amur Ghose, Wenyi Xiao , et al. · 2024

Computer science

Boolean satisfiability (SAT) problems are routinely solved by SAT solvers in real-life applications, yet solving time can vary drastically between solvers for the same instance. This has motivated research into machine learning models that…

CKGConv: General Graph Convolution with Continuous Kernels Open

Liheng Ma, Soumyasundar Pal, Yitian Zhang, Jiaming Zhou, Yingxue Zhang , et al. · 2024

Computer science Mathematics

The existing definitions of graph convolution, either from spatial or spectral perspectives, are inflexible and not unified. Defining a general convolution operator in the graph domain is challenging due to the lack of canonical coordinate…

Personalized Negative Reservoir for Incremental Learning in Recommender Systems Open

Antonios Valkanas, Yuening Wang, Yingxue Zhang, Mark Coates · 2024

Computer science

Recommender systems have become an integral part of online platforms. Every day the volume of training data is expanding and the number of user interactions is constantly increasing. The exploration of larger and more expressive models has…

Population Monte Carlo with Normalizing Flow Open

Soumyasundar Pal, Antonios Valkanas, Mark Coates · 2023

Computer science Mathematics Sociology

Adaptive importance sampling (AIS) methods provide a useful alternative to Markov Chain Monte Carlo (MCMC) algorithms for performing inference of intractable distributions. Population Monte Carlo (PMC) algorithms constitute a family of AIS…

Multi-resolution Time-Series Transformer for Long-term Forecasting Open

Yitian Zhang, Liheng Ma, Soumyasundar Pal, Yingxue Zhang, Mark Coates · 2023

Computer science Engineering Geography

The performance of transformers for time-series forecasting has improved significantly. Recent architectures learn complex temporal patterns by segmenting a time-series into patches and using the patches as tokens. The patch size controls …

Interacting Diffusion Processes for Event Sequence Forecasting Open

Mai Zeng, Florence Regol, Mark Coates · 2023

Computer science Mathematics Physics

Neural Temporal Point Processes (TPPs) have emerged as the primary framework for predicting sequences of events that occur at irregular time intervals, but their sequential nature can hamper performance for long-horizon forecasts. To addre…

Jointly-Learned Exit and Inference for a Dynamic Neural Network : JEI-DNN Open

Florence Regol, Joud Chataoui, Mark Coates · 2023

Computer science Art Philosophy

Large pretrained models, coupled with fine-tuning, are slowly becoming established as the dominant architecture in machine learning. Even though these models offer impressive performance, their practical application is often limited by the…

Substituting Data Annotation with Balanced Updates and Collective Loss in Multi-label Text Classification Open

Muberra Ozmen, Joseph Cotnareanu, Mark Coates · 2023

Computer science

Multi-label text classification (MLTC) is the task of assigning multiple labels to a given text, and has a wide range of application domains. Most existing approaches require an enormous amount of annotated data to learn a classifier and/o…

Neighbor Auto-Grouping Graph Neural Networks for Handover Parameter Configuration in Cellular Network Open

Mehrtash Mehrabi, Walid Masoudimansour, Yingxue Zhang, Jie Chuai, Zhitang Chen , et al. · 2023

Computer science

The mobile communication enabled by cellular networks is the one of the main foundations of our modern society. Optimizing the performance of cellular networks and providing massive connectivity with improved coverage and user experience h…

Mark Coates YOU? Author Swipe