Timon Deschamps

Research

Utility-based soft masking for continual multi-objective reinforcement learning

Timon Deschamps, Rémy Chaput, Mathieu Guillermin, Laëtitia Matignon

AAMAS 2026

paper

Real-world decision-making usually involves balancing multiple and sometimes conflicting objectives, according to user preferences that can be complex and potentially non-linear. These preferences, or utilities, can also be subject to change over time, requiring continual adaptation. This challenge is at the center of continual multi-objective reinforcement learning (CMORL), and remains vastly understudied, with existing work limited to linear utilities. In this paper, we take a first step towards CMORL with non-linear utilities by proposing utility-based soft masking (UBSM). By generating a discretized representation of the utility and using it to soft-mask the policy's parameters, UBSM harnesses the structure of utility functions, allowing for greater knowledge transfer among them and supporting the learning of policies that adapt to dynamic preferences. We evaluate UBSM on classic multi-objective reinforcement learning environments, demonstrating its improvements over baselines and providing insights on the evaluation of CMORL algorithms.
Multi-objective reinforcement learning: an ethical perspective

Timon Deschamps, Rémy Chaput, Laëtitia Matignon

Multi-Objective Decision Making workshop, ECAI 2024

* also presented at RJCIA 2024

paper slides

Reinforcement learning (RL) is becoming more prevalent in practical domains with human implications, raising ethical questions. Specifically, multi-objective RL has been argued to be an ideal framework for modeling real-world problems and developing human-aligned artificial intelligence. However, the ethical dimension remains underexplored in the field, and no survey covers this aspect. Hence, we propose a review of multi-objective RL from an ethical perspective, highlighting existing works, gaps in the literature, important considerations, and potential areas for future research.
TDMD: A Database for Dynamic Color Mesh Quality Assessment Study

Qi Yang, Joel Jung, Timon Deschamps, Xiaozhong Xu, Shan Liu

IEEE TVCG

* internship at Tencent Media Lab

paper preprint website

Dynamic colored meshes (DCM) are widely used in various applications. However, this kind of meshes may undergo different processes, such as compression or transmission, which can distort them and degrade their quality. To facilitate the development of objective metrics for DCMs and study the influence of typical distortions on their perception, we create the Tencent - Dynamic colored Mesh Database (TDMD) containing eight reference DCM objects with six typical distortions. Using processed video sequences (PVS) derived from the DCM, we conduct a large-scale subjective experiment that resulted in 303 distorted DCM samples with mean opinion scores, making the TDMD the largest available DCM database to our knowledge. This database enables us to study the impact of different types of distortion on human perception and offers recommendations for DCM compression and related tasks. Additionally, we have evaluated three types of state-of-the-art objective metrics on the TDMD, including image-based, point-based, and video-based metrics, on the TDMD. Our experimental results highlight the strengths and weaknesses of each metric, and we provide suggestions about the selection of metrics in practical DCM applications.

Teaching

Algorithms, Programming and Complexity

Teaching assistant · BSc

Spring 2026
Introduction to Deep Learning

Lecturer · MSc AI · MSc DISS (English-taught) · Polytech Lyon (engineering school)

slides exercises PyTorch mini-tutorial

Fall 2025
(Deep) Reinforcement Learning

Teaching assistant · MSc AI

Fall 2025
Applications of Mathematics and Computer Science

Teaching assistant · BSc

Spring 2025
Introduction to Imperative Programming

Teaching assistant · BSc

Fall 2024