Séminaires des doctorants [LaMME]

Séminaires des doctorants

Lieu : Bât. I.B.G.B.I., 23 Bd. de France, en salle de séminaire du 4ème étage.
Quand : le mardi à 14h00.
Organisateurs : Anna Bahrii, Jiarong Fan, Maxence Mansais.

Séminaires à venir

Invité : Karen BOZANIAN (Université d'Evry Paris-Saclay)

Date : 10 mars 2026

Title :

Abstract:

Invité : Bình Thuận Trần (LAMA, LIGM, Université Gustave Eiffel)

Date : 31 mars 2026

Title : Minimax-Optimal Two-Sample Test with Sliced Wasserstein

Abstract: We study the problem of nonparametric two-sample testing using the sliced Wasserstein (SW) distance. While prior theoretical and empirical work indicates that the SW distance offers a promising balance between strong statistical guarantees and computational efficiency, its theoretical foundations for hypothesis testing remain limited. We address this gap by proposing a permutation-based SW test and analyzing its performance. The test inherits finite-sample Type I error control from the permutation principle. Moreover, we establish non-asymptotic power bounds and show that the procedure achieves the minimax separation rate $n^{-1/2}$ over multinomial and bounded-support alternatives, matching the optimal guarantees of kernel-based tests while building on the geometric foundations of Wasserstein distances. Our analysis further quantifies the trade-off between the number of projections and statistical power. Finally, numerical experiments demonstrate that the test combines finite-sample validity with competitive power and scalability, and—unlike kernel-based tests, which require careful kernel tuning—it performs consistently well across all scenarios we consider.

Séminaires passés

Invité : Nicoleta Cazacu (École Polytechnique)

Date : 17 février 2026

Title : From Microscopic to Macroscopic: Convergence of Particle Systems with Singular Interactions to Nonlinear Fokker–Planck PDEs

Abstract: Deriving macroscopic evolution equations from interacting particle systems is a classical problem in mathematical physics, dating back to Kac’s work on the Boltzmann equation. In the mean-field regime, these systems are expected to converge, as the number of particles goes to infinity, to solutions of nonlinear Fokker–Planck PDEs. In this limit, the particles become asymptotically independent, a property known as propagation of chaos.

While convergence is well understood for regular interaction kernels, many important models involve singular interactions, such as those in the Keller–Segel model, Coulomb interactions, or Riesz potentials, where fundamental questions remain open.

In this talk, I will introduce the framework of interacting particle systems and the concept of propagation of chaos. Then I will describe recent techniques for proving convergence with singular kernels and present a result for a stochastic numerical approach based on the Euler–Maruyama scheme, together with quantitative convergence estimates.

Finally, I will discuss long-time behavior and uniform-in-time propagation of chaos, highlighting key techniques and an ongoing work using a mild formulation of the limiting equation and semigroup estimates.

Invité : Pei SU (Charles University, Prague)

Date : 27 janvier 2026 (Zoom)

Title : On the motion of a rigid body in a perfect compressible fluid

Abstract: We consider a rigid body moving in an inviscid compressible fluid within a bounded domain. The fluid is thereby described by the compressible Euler equations, while the rigid body obeys the conservation of linear and angular momentum. This gives us a coupled system comprising an ODE and the initial boundary value problem (IBVP) of a hyperbolic system with characteristic boundary, where the fluid velocity matches the solid velocity along the normal direction of the solid boundary. We establish the existence of a unique classical solution to this coupled system. Our approach involves constructing an approximate system with a non-characteristic boundary, which enables the decoupling of the fluid and solid equations. We then derive the uniform estimate and finally we obtain the solution by compactness principle. This is a joint work with F. Rousset (Orsay).

Invité : Cyril Nefzaoui Blanchard (Université d'Evry Paris-Saclay)

Date : 9 décembre 2025 (13h00)

Title : Deep BSDE method for Quantile Hedging

Abstract: We consider the popular Deep BSDE method of E–Han–Jentzen and tailor it to the quantile hedging problem in the weak stochastic-target framework initiated by Bouchard– Elie–Touzi and Bouchard– ´ Elie–R´eveillac. The success-probability state is modeled as a constrained martingale with square-integrable unbounded controls. We combine piecewiseconstant controls with a projected Euler time discretization and an endogenous stopping rule to preserve the constraint, yielding an implementable discrete-time control problem. Within the policy-timestep framework of Krylov - Jakobsen–Picarelli–Reisinger , we provide a convergence study tailored to the state constraint, unbounded controls, and BSDE-type criteria with convex drivers. Unlike previous works that focus mainly on pricing, our approach outputs both the value function and the associated quantile hedging strategy within the same solver. On classical benchmarks, numerical experiments are consistent with the Follmer–Leukert results and confirm the accuracy and interpretability of the method. This is a joint work with Cyril Bénézet and Sergio Pulido.

Invité : Anna BAHRII (Université d'Evry Paris-Saclay)

Date : 18 novembre 2025

Title : Existence and uniqueness of solutions to McKean-Vlasov SDEs with distributional drift

Abstract: We study the existence and uniqueness of solutions (a.k.a. well-posedness) in weak and strong senses for McKean-Vlasov stochastic differential equations driven by an $\alpha$-stable process, $\alpha\in(1,2]$, when the interaction kernel is not a function but a distribution. We consider a particular structure of dependence on the law given by a spatial convolution. This structure has a double regularization by noise effect, which allows us to handle kernels belonging to the Besov space of negative regularity $-\alpha+\varepsilon$. In this talk, I will first briefly discuss the main phenomena of regularization by noise, illustrating how stochastic perturbations can restore well-posedness in otherwise ill-posed ODEs. Then, I will present the main results obtained for the considered McKean-Vlasov equation. I will also highlight the main subtleties of this SDE and the main techniques used in the proofs. This is a project under the supervision of Stéphane Menozzi.

Invité : Jiarong FAN (Université d'Evry Paris-Saclay)

Date : 04 novembre 2025

Title : Mask-Conditional Conformal Prediction: Valid Uncertainty For All Missing Data Mechanisms

Abstract: Conformal prediction (CP) offers a principled framework for uncertainty quantification, but it fails to guarantee coverage when faced with missing covariates. In addressing the heterogeneity induced by various missing patterns, Mask-Conditional Valid (MCV) Coverage has emerged as a more desirable property than Marginal Coverage. In this work, we adapt split CP to handle missing values by proposing a preimpute-mask-then-correct framework that can offer valid coverage. We show that our method provides guaranteed Marginal Coverage and Mask-Conditional Validity for general missing data mechanisms. A key component of our approach is a reweighted conformal prediction procedure that corrects the prediction sets after distributional imputation (multiple imputation) of the calibration dataset, making our method compatible with standard imputation pipelines. We derive two algorithms, and we show that they are approximately marginally valid and MCV. We evaluate them on synthetic and real-world datasets. It significantly reduces the width of prediction intervals w.r.t. standard MCV methods, while maintaining the target guarantees.

Invité : Maxence MANSAIS (Université d'Evry Paris-Saclay)

Date : 7 octobre 2025

Title: Some general external forces and critical mild solutions for the fractional Navier-Stokes equations

Abstract: In this talk, I will discuss the construction of global in time critical mild solutions for the forced incompressible fractional Navier-Stokes equations, which differ from the classical Navier-Stokes equations through the introduction of a fractional diffusion. These solutions are obtained via a fixed point formulation which relies on suitable estimates for the non linearity, the initial velocity and the external force. Many functional spaces have been considered in the literature, but the focus of this talk will be on some Banach function spaces exploiting the pointwise decay of the kernel appearing in the nonlinearity.

After first recalling the general framework of mild solutions and a few main difficulties in the classical case, I will then discuss some functional spaces allowing us to construct solutions with initial data in the maximal critical Besov space only reachable in the fractional case. One key point of the discussion will be to highlight how despite the presence of a natural embedding of such spaces for the velocity, such structure is much less obvious for the choice of the external force.

Titre : Forces extérieures et solutions mild critiques pour les équations de Navier-Stokes fractionnaire

Résumé : Dans cet exposé, je discuterai de la construction de solutions mild critiques globales en temps pour les équations de Navier-Stokes incompressible fractionnaire avec forçage, qui diffèrent des équations de Navier-Stokes classiques par l'introduction d'une diffusion fractionnaire. Ces solutions sont obtenues par une formulation de point fixe qui repose sur des estimations adéquates pour la non linéarité, la vitesse initiale et la force extérieure. Plusieurs espaces fonctionnels ont été considérés dans la littérature, mais la présentation sera concentrée sur des espaces de Banach exploitant la décroissance ponctuelle des noyaux apparaissant dans la non linéarité.

Après avoir dans un premier temps rappelé le cadre général des solutions mild et quelques difficultés dans le cas classique, nous discuterons ensuite de certains espaces fonctionnels nous permettant de construire des solutions avec donnée initiale dans l'espace de Besov critique maximal seulement atteignable dans le cas fractionnaire. Un point clé de la discussion sera de mettre comment malgré la présence d'un plongement naturel de tels espaces pour la vitesse, une telle structure est bien moins claire pour le choix d'une force extérieure.

Invité : Xiaozhen WANG (Université Paris-Dauphine PSL)

Date : 1 juillet 2025

Title : Convergence of Sinkhorn's Algorithm for Entropic Martingale Optimal Transport Problem

Abstract: We study the Entropic Martingale Optimal Transport (EMOT) problem on R. We begin by introducing the dual formulation and prove the exponential convergence of Sinkhorn's algorithm on the dual potential coefficients. Our analysis does not require prior knowledge of the optimal potential and confirms that there is no primal-dual gap. Our findings provide a theoretical guarantee for solving the EMOT problem using Sinkhorn's algorithm. In applications, our result provides insight into the calibration of stochastic volatility models, as proposed by Henry-Labordere.

Invité : Thi Bao Trâm NGO (Université d’Évry, LaMME)

Date : 10 juin 2025

Title: Optimal Guaranteed Estimation Methods for the Cox–Ingersoll–Ross Models

Abstract: We study parameter estimation problems for Cox–Ingersoll–Ross (CIR) processes. For the first time, sequential estimation procedures are proposed for such models. In a non-asymptotic framework, the proposed procedures provide estimators with a guaranteed fixed mean square accuracy. For scalar parameter estimation problems, the proposed estimators exhibit non-asymptotic normality properties—even in cases where classical (non-sequential) maximum likelihood estimators cannot be computed. Moreover, Laplace transforms for the mean observation durations are derived. In the asymptotic regime, we find the limiting forms of the mean observation durations and show that the constructed sequential estimators converge uniformly in distribution to normal random variables. Finally, using the Local Asymptotic Normality (LAN) property, we establish sharp asymptotic lower bounds for the minimax risk over the class of all sequential procedures with the same mean observation duration, showing that our procedures are optimal in the minimax sense within this class.

Invité : Yichuan Huang (Université Paris Nanterre, MODAL’X)

Date : 27 mai 2025

Title: Model Selection for Nonparametric Estimation of the Diffusion Coefficient in SDEs Joint work with Fabienne Comte (Univ. Paris Cité) and Nicolas Marie (Univ. Paris Nanterre)

Abstract: We propose an adaptive, nonparametric estimator for the diffusion function \sigma(x) of a one-dimensional stochastic differential equation, based on a single high-frequency sample path. Our approach projects squared increments onto a finite Laguerre or Hermite basis, then selects the number of basis functions by minimizing a penalized least-squares criterion. This data-driven procedure automatically balances bias and variance. Numerical experiments—including a hyperbolic diffusion and a Cox–Ingersoll–Ross model—demonstrate both high accuracy and practical flexibility, making the method well suited to financial-modeling applications.

Titre : Sélection de modèle pour l’estimation non paramétrique du coefficient de diffusion dans les EDS Travail conjoint avec Fabienne Comte (Univ. Paris Cité) et Nicolas Marie (Univ. Paris Nanterre)

Résumé : Nous proposons une méthode adaptative et non paramétrique pour estimer la fonction de diffusion \sigma(x) d’une équation différentielle stochastique unidimensionnelle, à partir d’une unique trajectoire observée en haute fréquence. Le principe est de projeter les incréments au carré sur une base Laguerre ou Hermite de taille finie, puis de choisir automatiquement le nombre de fonctions de base via un critère pénalisé de moindres carrés. Cet algorithme réalise un compromis optimal biais-variance. Des simulations numériques—notamment une diffusion hyperbolique et un modèle CIR—illustrent la grande précision et la souplesse pratique de la méthode pour des applications financières.

Invité : Ewen Lallinec (Université Paris-Saclay, LMO)

Date : 13 mai 2025

Titre : High-order methods for Brillouin Zone integration

Résumé : The physical properties of materials are often described by singular integrals over the Brillouin zone, a structure with the topology of a torus. In this talk, we will introduce the quantum formalism and the types of integrals of interest, with a particular focus on the simple case of the Density of States (DOS). First, we will establish the framework for these integrals and outline the computational challenges they present. We will then review standard computational methods for evaluating these integrals and discuss recent advancements. Finally, we will present two recent approaches: the Brillouin Contour Deformation (BCD) technique, and the implicit integration method. Through their application to DOS computations, we demonstrate the efficiency of both methods while highlighting the specific challenges associated with this class of integrals.

Invité : Aurélien Velleret (Université d’Évry, LaMME)

Date : 29 avril 2025

Titre : Dynamiques d'épidémies sur de grands graphes aléatoires et noyaux d'interaction

Résumé : La prédiction de la propagation d'une épidémie au sein d'une population repose en grande partie sur des modèles simplifiés, dans lesquels les interactions sont représentées par des traits individuels. Ces traits caractérisent l'hétérogénéité de la population, tels que l'âge ou la profession des individus. Les méthodes d'échantillonnage habituelles visent à fournir des estimations de la relation entre les traits et les niveaux d'interaction, c'est-à-dire de la fonction définissant le noyau d'interaction dans le modèle. J'inclus ici le cadre des modèles à blocs stochastiques, avec un nombre fini de traits.

À partir d'une description stochastique individu-centrée d'une épidémie se propageant sur un graphe aléatoire, nous analyserons la dynamique lorsque la taille $n$ du graphe tend vers l'infini. Quelle est la généralité d'une telle réduction de modèle au-delà du cas des graphes denses pour lesquels la notion de graphon (en tant que cas particulier du noyau d'interaction) a été initialement proposée ?

Avec le processus d'infection SIS typique entre individus, nous retrouvons à la limite une équation intégrale-différentielle en dimension infinie étudiée par Delmas, Dronnier et Zitt (2022) pour une épidémie SIS se propageant sur un graphon. Cette convergence couvre les cas des graphes denses et dilués, lorsque le nombre d'arêtes est de l'ordre de $0(n^a)$ avec $a\in(1,2]$ (le cas des graphes très dilués avec $a=1$ et des nombres bornés de voisins est de nature différente). Cela fournit une validation pour l'évaluation statistique actuelle, même si les individus sont généralement en contact avec une portion réduite de la population totale.

Ces résultats peuvent être étendus à des histoires d'infection plus élaborées. Lorsque l'on considère des profils d'infectiosité qui varient avec la durée depuis l'événement d'infection, nous retrouvons à la limite un processus structuré âge d'infection qui étend la description proposée par Forien, Pang et Pardoux (2022) pour un nombre fini de traits individuels.

Invité : Leonardo Martins Bianco (Université Paris-Saclay, LMO)

Date : 25 mars 2025

Titre : Robust Estimation and Outlier Detection for Stochastic Block Models via Subgraph Search

Résumé : Community detection is a fundamental task in graph analysis, with methods often relying on fitting models like the Stochastic Block Model (SBM) to observed networks. While many algorithms can accurately estimate SBM parameters when the input graph is a perfect sample from the model, real-world graphs rarely conform to such idealized assumptions. Therefore, robust algorithms are crucial - ones that can recover model parameters even when the data deviates from the assumed distribution. We propose SubSearch, an algorithm for robustly estimating SBM parameters by exploring the space of subgraphs in search of one that aligns with the model's assumptions. Our approach also functions as an outlier detection method, properly identifying nodes responsible for the graph's deviation from the model and going beyond simple techniques like pruning high-degree nodes. Experiments on both synthetic and real-world datasets demonstrate the effectiveness of our method.

Invité : Yadh Hafsi (Université d’Évry, LaMME)

Date : 11 mars 2025

Titre : Optimal Execution under Incomplete Information

Résumé : We study optimal liquidation strategies under partial information for a single asset within a finite time horizon. We propose a model tailored for high-frequency trading, capturing price formation driven solely by order flow through mutually stimulating marked Hawkes processes. The model assumes a limit order book framework, accounting for both permanent price impact and transient market impact. Importantly, we incorporate liquidity as a hidden Markov process, influencing the intensities of the point processes governing bid and ask prices. Within this setting, we formulate the optimal liquidation problem as an impulse control problem. We elucidate the dynamics of the hidden Markov chain's filter and determine the related normalized filtering equations. We then express the value function as the limit of a sequence of auxiliary continuous functions, defined recursively. This characterization enables the use of a dynamic programming principle for optimal stopping problems and the determination of an optimal strategy. It also facilitates the development of an implementable algorithm to approximate the original liquidation problem. We enrich our analysis with numerical results and visualizations of candidate optimal strategies.

Invité : Philippe Anjolras (Université Paris-Saclay, LMO)

Date : 21 février 2025

Titre : Résonances espace-temps dans le système de Born-Infeld

Résumé : Le système de Born-Infeld est un modèle non-linéaire d'électromagnétisme dérivé en 1934, permettant notamment de rendre l'énergie d'un électron au repos finie sans passer par la renormalisation ; en revanche, sa non-linéarité est complexe à analyser. En 2004, Brenier a proposé un modèle étendu qui simplifie la non-linéarité et en conserve les propriétés remarquables. En étudiant ce système augmenté par la méthode des résonances espace-temps introduite par Germain, Masmoudi et Shatah en 2008, on expliquera comment obtenir un résultat de stabilité asymptotique et de scattering pour les équations de Born-Infeld.

Invité : Claire Alamichel

Date : 18 juin 2024

Titre : Modélisation de la motilité cellulaire et simulations numériques.

Résumé : Après avoir présenté le contexte biologique de la motilité cellulaire, je présenterai une modélisation pour ce phénomène biologique qui prend en compte le noyau de la cellule. Je présenterai un schéma numérique permettant de réaliser des simulations numériques du modèle. Je présenterai ensuite des résultats numériques permettant de mettre en avant le rôle du noyau dans la dynamique de la cellule et sur les trajectoires.

Invité : Perrine Chassat

Date : 7 mai 2024

Titre : Analyse de Données Fonctionnelles et Analyse de Formes dans le cadre de Frenet-Serret : Application à l'Analyse de Trajectoires de Mouvement de Langue des Signes.

Résumé : Cette thèse vise à déterminer le cadre mathématique le plus adapté et des descripteurs pertinents pour l'analyse des trajectoires de mouvement en langue des signes. En nous appuyant sur les principes du contrôle moteur, nous avons identifié le cadre défini par les formules de Frenet-Serret comme particulièrement pertinent pour cette tâche. Ainsi, en introduisant de nouvelles approches d'analyse de courbes basées sur le cadre de Frenet, cette thèse contribue au développement de nouvelles méthodes dans les domaines de l'analyse de données fonctionnelles et de l'analyse de forme. La première partie de ce travail aborde le défi de l'estimation lisse des paramètres de courbures de Frenet, en traitant le problème comme une estimation de paramètres d'une équation différentielle dans SO(d), (d >= 1) . Nous introduisons un algorithme EM fonctionnel qui définit une méthode d'estimation unifiée des variables dans le groupe SE(3), fournissant des estimateurs lisses, plus fiables et robustes que les méthodes existantes. Dans la deuxième partie, deux nouvelles représentations des courbes lisses dans R^d sont introduites, dont la Square Root Curvatures (SRC) transform, établissant un nouveau cadre géométrique riemannien qui utilise les informations géométriques d'ordre supérieur et dépend de la paramétrisation, surpassant alors la représentation state-of-the-art Square-Root Velocity Function (SRVF) sur des résultats synthétiques. Étant donné une collection de courbes, ce type de géométrie nous permet de définir des critères statistiques efficaces pour estimer les formes moyennes de Karcher sur les espaces de formes riemanniens associés, qui se révèlent particulièrement performants sur des données bruitées. Enfin, ce cadre développé ouvre la voie à des applications plus pratiques dans le traitement de la langue des signes, comprenant l'étude des lois puissances sur nos données et le développement d'un modèle génératif pour le mouvement d'un point en langue des signes.

Invité : Ludivine Obry

Date : 19 mars 2024

Titre : Procédures de tests multiples avec pondérations dans les études d’association pangénomiques

Résumé : Avec le développement récent des technologies de séquençage, il est aujourd’hui possible de réaliser des études d’association pangénomiques (GWAS) à très large échelle. Dans ce contexte, l’approche standard consiste à tester chaque marqueur génétique individuellement. Afin de limiter le nombre de faux positifs, des procédures de tests multiples visant à contrôler un risque d’erreur global sont appliquées. Cependant, les approches classiques sont limitées, d’une part, par le fait que la sélection initiale ne tire pas parti des informations a priori et des connaissances d’experts, d’autre part, par la difficulté à identifier des variants rares qui peuvent pourtant avoir des effets importants. L’incorporation de pondérations dans les procédures de tests multiples peut alors être une solution. Dans ces travaux, différentes procédures de tests multiples avec pondérations dans le contexte spécifique des GWAS ont été évaluées. Une approche originale permettant d’améliorer la puissance de détection des variants rares tout en maintenant une bonne puissance globale a également été introduite. Les différentes procédures ont été évaluées à travers une étude de simulations dont les résultats montrent les bonnes performances de l’approche développée par rapport aux procédures existantes. Les différentes méthodes ont été appliquées à un jeu de données réelles.

Invité : Gaston Vergara-Hermosilla

Date : 9 janvier 2024

Titre : Quelques nouvelles idées en mécanique des fluides

Résumé : Dans cet exposé nous présenterons quelques résultats récemment obtenus sur l'existence, l'unicité et la régularité des solutions de certaines équations non linéaires apparaissant en mécanique des fluides. La exposé débutera par une présentation des éléments de base avec lesquels nous développerons nos idées, puis procédera à la démonstration des théorèmes.

Invité : Paulin Aubert

Date : 5 décembre 2023

Titre : Résolution de problèmes de contrôle stochastique par apprentissage par renforcement

Résumé : De nombreux problèmes de la finance quantitative requièrent la résolution de problèmes de contrôle stochastique. En l'absence de solutions explicites, il est possible d'estimer les solutions de ces problèmes au moyen de méthodes qui rencontrent rapidement des limitations avec l'augmentation de la dimension. L'apprentissage automatique se présente comme une solution naturelle pour pallier à cette difficulté, souvent désignée sous le terme de “curse of dimensionality”. Dans cette présentation, nous introduisons la méthode numérique élaborée pour résoudre les problèmes de contrôle stochastique. Fondée sur l'apprentissage par renforcement, cette méthode nous permet de retrouver les résultats théoriques énoncés par M. Jeanblanc-Picqué et A. N. Shiryaev en 1995, et démontre des performances convaincantes lorsque le problème se complexifie et que la solution explicite demeure inconnue.

Invité : Arnaud Liehrmann

Date : 14 novembre 2023

Titre : Multiscale analysis of transcriptome: methodological and algorithmic developments

Résumé : My work can be divided into two main parts. First, I have designed tools dedicated to the differential analysis of the transcriptome. Second, I have developed and applied multiple changepoint detection methods for genomic datasets.

The remarkable diversity of RNA isoforms, besides alternative transcription initiation sites, is primarily attributable to post-transcriptional modifications. These alterations span an array of events that can occur along RNA molecules including splicing, processing, alternative polyadenylation, editing, and base modification. The advent of high-throughput transcriptomics has catalyzed an unprecedented understanding of this diversity. However, the analysis of such data presents substantial statistical, computational, technical, and biological challenges.

I actively contributed to the development of two methods, DiffSegR and comaturationTrackeR, dedicated to the differential analysis of transcriptomes. These methods are built to alleviate the complications arising from studying, often unannotated, individual isoforms, focusing instead on event-by-event or pairwise analyses. DiffSegR empowers the identification of transcriptome-wide expression differences across two biological conditions using RNA-Seq data. With the integration of a multiple changepoint detection algorithm, it precisely delineates the boundaries of differentially expressed regions/events, eliminating the necessity for prior annotations. On the other hand, comaturationTrackeR, utilizing long-read RNA-seq data, is tailored for the detection of transcriptome-wide co-maturations—dependencies between pairs of maturation events such as editing and splicing. Crucially, both methods are integrated with the DESeq2 statistical framework. This inclusion allows for rigorous testing of expression differences and co-maturations. Furthermore, these methods have been intuitively encapsulated into R packages, ensuring user-friendliness for both biologists and bioinformaticians. The output from these packages is designed to create IGV (Integrated Genome Viewer) tracks and/or Bioconductor objects. These approaches have proven their effectiveness through practical applications on the transcriptomes of chloroplasts, mitochondria, and bacteria. Importantly, many of the findings have been validated molecularly. This includes a published list of co-matured events within the chloroplast of Arabidopsis thaliana, an comprehensive list of 3' and 5' termini extension of transcripts, as well as the accumulation of antisense RNA and introns from two A. thaliana mutants for chloroplast ribonucleases—Mini-III and PNPase. It also includes potential candidates for direct degradation by Rae1 in Bacillus subtilis.

Another facet of my thesis involves the development and application of multiple changepoint detection methodologies on genomic datasets. The popularity of these models in genomics stems from their inherent capability to reveal unannotated biological events along the genome, such as expression differences resulting from splicing variations (as exemplified in DiffSegR). Various dynamic programming algorithms aimed at maximizing a penalized likelihood have been proposed over the years. These algorithms and the contrasts they optimize display remarkable computational and statistical properties, with their speed performance being a rationale for their use with genomic data. Building upon this line of research, I have designed and implemented an exact and efficient dynamic programming algorithm, Ms.FPOP. This algorithm optimizes a least squares criterion and incorporates a multiscale penalty, which has been demonstrated to possess superior statistical properties compared to the standard least squares criterion with a bayesian information criterion. Ms.FPOP employs functional pruning techniques to accelerate the computation time from quadratic (the best-known algorithmic speed so far) to on average log-linear relative to the length of the signal. Ms.FPOP is implemented in C++ and is interfaced with R for user-friendly access. I have conducted extensive testing of Ms.FPOP across a wide variety of simulated scenarios, and the results have been promising. Concurrently, I have applied multiple changepoint detection algorithms to genomic datasets, and observed that these methods improve the current state-of-the-art methods for detecting differentially expressed regions in RNA-Seq data and peaks in ChIP-Seq data.

Invitée : Assia Benmehdia

Date : 6 juin 2023

Titre : Analysis of the genomic structure of Drosophila melanogaster : is the evolution of duplicated genes related with their environment in transposable elements?

Résumé : Within genomes, duplicated genes (paralogs) are formed by different mechanisms such as the complete duplication of the whole genome, the action of transposable elements (TEs), segmental duplications, and tandem duplications (1). These genes, after duplication, can be subjected to various evolutionary processes allowing their maintenance or their loss (acquisition of a new function (neo-functionalization), sharing of the ancestral function (sub- functionalization), pseudogenization, functional redundancy by dosage effect) (1). Duplicated genes constitute families of genes and are of great importance in the formation of new genes and in creating genetic novelty in organisms. Many new gene functions have evolved through this mechanism. However, the processes allowing the maintenance of duplicated genes within genomes remain poorly understood. In particular, little is known about the influence of TEs at this level. TEs are repeated sequences that have the ability to move within the genome. They are now recognized as having a significant impact on the evolution of genomes and the adaptation of species (2). In the model species Drosophila melanogaster, it has been shown that duplicated genes constitute around 40% of all genes (1), the majority of which are thought to be the result of tandem duplications (3). The most recent duplicated genes seem more often subject to the neo functionalization mechanism (4) and their functions are mainly linked to responses to environmental stresses (5). Within this genome, we find around 15% of TEs, the distribution of which is not random (6). We can therefore wonder about the importance of TEs in the maintaining of the different families of genes in this species.

(1) Zhang. Trends Genet. 2003 
(2) Bourque et al. Genome Biol. 2018 
(3) Zhou et al. Genome Res. 2008 
(4) Assis and Bachtrog. PNAS 2013
(5) Zhong et al. BMC genomics 2013
(6) Adams et al. Science 2000

Invité : Kylliann De Santiago

Date : 25 mai 2023

Titre : Mixture of stochastic block models for multiview clustering

Résumé : In a complex problem, networks are generally used, because of their efficiency in describing relationships. It often happens that these networks come from different sources of information, which do not necessarily bring the same knowledge. In this work, we propose an original method for aggregating multiple clustering coming from different sources of information. Each partition is encoded by a co-membership matrix between observations. Our approach uses a mixture of Stochastic Block Models (SBM) to group co-membership matrices with similar information into components and to partition observations into different clusters, taking into account their specificities within the components.

Invitée : Liudmila Pishchagina

Date : 9 mai 2023

Titre : Geometric-Based Pruning Rules For Change-Point Detection in Multivariate Time-Series

Résumé : We study multiple change-point detection problems for multivariate independent time-series by pruned dynamic programming algorithms optimizing a penalized likelihood. When the number of changes is proportional to the data length, an inequality based pruning rule (as in PELT) leads to a linear time complexity. Another method, called functional pruning, gives a close-to-linear time complexity whatever the number of changes is, but only for univariate models. Functional pruning works by updating the set of parameter values for which a change is optimal. As soon as this set is empty the change is pruned. In dimension p = 1, this set is a union of intervals in R that is easy to describe and update. When the dimension p is greater or equal to 2, this set can be non-convex and unconnected because it is obtained as the intersection and difference of sets in R^p. This complicates the implementation of pruning. We propose an extension of functional pruning using simple geometric shapes (balls and rectangular parallelotopes) for some multivariate parametric models (Gaussian, Poisson, Negative Binomial). In a simulation study we empirically assess the efficiency of our geometric-based pruning rule and show that it is faster than PELT when the dimension p is less 5.

Invité : Mathis Fitoussi

Date : 11 avril 2023

Titre : Estimées de noyaux de la chaleur pour des EDS stables avec une dérive singulière

Résumé : On s'intéresse à l'EDS $\mathrm{d} X_t = b(t,X_t) \mathrm{d}t + \mathrm{d} Z_t \quad (E)$, où la dérive $b$ est fortement irrégulière (une distribution, par exemple) et le bruit $Z_t$ est un processus stable. Pour ce type d'équations, il a été prouvé que le bruit peut restaurer une forme d'existence et d'unicité de la solution. Cet exposé est une introduction aux dérives singulières, aux différentes définitions de solution compatibles avec l'équation $(E)$ et à la manière dont elles sont traitées. On présentera les résultats obtenus récemment, qui consistent en des estimées de noyaux de la chaleur pour la densité du processus $X_t$ solution de $(E)$.

Invité : Salim Amoukou

Date : 28 mars 2023

Titre : Distribution-Free Uncertainty Quantification

Résumé : Machine learning techniques offer single point predictions, such as mean estimates for regression and class labels for classification, without providing any indication of uncertainty or reliability. This can be a major concern in high-stakes applications where precision is vital. Accuracy alone does not suffice for reliable, consequential decision-making; we also need uncertainty. Distribution-free Uncertainty Quantification gives finite-sample statistical guarantees for any predictive model, no matter how bad/misspecified, and any data distribution, even if unknown. I will introduce Conformal Prediction, which is a universal framework that constructs a prediction interval $C(X_{n+1})$ for the unseen response $Y_{n+1}$ given a new feature $X_{n+1}$ with finite-sample (non-asymptotic) coverage guarantee without making any assumptions on the distribution and the model.

Invité : Alejandro Bandera Moreno (Université de Séville)

Date : 14 mars 2023

Titre : An introduction to model order reduction in differential equations.

Résumé : In this talk, we aim to present the difficulties arising when solving numerically a parametric partial differential equation (PDE) or a parametric system of ordinary differential equations (SDE). Then, we will explain some methods developed to deal with these problems, more precisely, we will focus on three methods: Proper Orthogonal Decomposition (POD) for SDE, Reduced Basis (RB) for a turbulence PDE model and Proper Generalized Decomposition (PGD) for symmetric PDE problems.

Invité : David Llerena

Date : 14 février 2023

Titre : Sur la régularité locale de certains modèles de la mécanique des fluides

Résumé : Dans cet exposé, on s'intéresse à la régularité locale des équations des fluides micropolaires incompressibles. Ce système, constitué de 3 variables (la vitesse, la vitesse de microrotation et la pression) décrit le comportement des fluides avec des microstructures. Notre but est d'étudier la régularité de ce système et de mettre en lumière quelques relations entre les variables. En effet, nous montrerons d'abord un résultat récent permettant de déduire un gain d'intégralité pour la vitesse de microrotation à partir de certaines informations Morrey sur la vitesse, d'où une domination d'une variable sur l'autre. Enfin, nous en présentons l'application à la théorie de la régularité partielle.

Invitée : Elisabetta Brocchieri

Date : 18 janvier 2023

Titre : Systèmes de diffusion croisée induits par la diversité alimentaire.

Résumé : Les systèmes de diffusion croisée sont des systèmes paraboliques non linéaires survenant dans la biologie et l’écologie. Dans cet exposé, nous étudions l'existence de solutions faibles d'une classe de systèmes de diffusion croisée triangulaires, induits par la diversité alimentaire, qui s'appliquent à la dynamique des populations. On montre de manière rigoureuse le passage d’un système de réaction-diffusion avec diffusion linéaire et interactions compétitives vers un système de diffusion croisée, obtenu comme limite de réaction rapide. Les outils utilisés pour passer rigoureusement à la limite incluent des estimations a priori, données par l’analyse d’une fonctionnelle d’entropie, et un argument de compacité.

User Tools

Site Tools

Sidebar

Séminaires des doctorants

Page Tools