Summary
Guider les projets d'IA porteurs de demain en santé, environnement et énergie.
Professional experiences
Data scientist student
Paris Digital Lab , Paris - Others
From September 2024 to Today
Agile prototyping at the @ParisDigitalLab Incubator, developing 3 Minimum Viable Products (MVP) in 7-week constraints for real-world businesses, using scrum methodology.
- Implemented an embedded VLM model for real-time VQA, object detection and tracking on iPhone @ Oorion
Ai research intern
Université Paris-Dauphine, PSL , Paris - Temporary contract
From March 2024 to August 2024
I conducted a security audit of Medinote, a medical language model, focusing on patient data leakage risks. My research introduced a novel approach by executing targeted extraction attacks using auxiliary public data to recover private information. I explored how statistical patterns, such as distributions of patient attributes, were encoded within the model and developed sequence-based perplexity metrics to distinguish in-distribution from out-of-distribution data. This could enhance the filtering of non-member data from generated candidate samples.
This work was done at PariSanté Campus within the MILES team, under the supervision of Prof. Jamal Atif and Prof. Olivier Capet and collaboration of Danil Savine. The MILES project-team is hosted by the LAMSADE research unit at Université Paris Dauphine-PSL, France and is heavily involved in the Paris Artificial Research Intelligence Institute (PRAIRIE), a national AI research initiative funded by the 3AI Plan.
In addition to my research work, I:
• Conducted a comprehensive literature review of over 100 papers covering topics such as medical LLMs, memorization, extraction attacks, membership inference attacks, DP-preserving techniques, unlearning, in-context learning (ICL), and cutting-edge transformer architectures (e.g., KNN-LLM, Matryoshka Embedding, Mixture of Experts, State Space Models).
• Re-implemented bigram models, BPE, decoder-only transformers, and MoE from scratch using PyTorch to deepen my understanding of these architectures.
• Re-proved theorems on benign overfitting from Bartlett et al.'s work.
• Fast-tracked Differential Privacy (DP) by following the "DP for ML" course from M2 IASD @ Paris Dauphine and completing Dwork's textbook and programming notebooks on DP.
• Attended reading groups on advanced mathematical topics like higher-order optimization, optimal transport, automatic architecture search, etc.
Additionals trainings
Polytechnique Paris
Échange - Maths Appliquées
2022 à 2023
Université Paris-Saclay
M1 Mathématiques et Applications track Maths&IA - Maths appliquées et Intelligence Artificielle
2023 à 2024
CentraleSupélec
Digital Tech Year (DTY) - Data Science, AI Research/Engineer, Software Engineer
2024 à 2025
The Digital Tech Year is a highly selective excellence program, admitting only 50 top candidates, most from CentralSupélec, with the rest from Paris-Saclay University and postdoctoral researchers. It combines intensive prototyping with real-world projects, followed by a six-month paid internship in industry.
In 2024, I was honored to be one of the four laureates of the MathTech gap year, funded by the FMJH. As I aspire to pursue a PhD, this program offers a unique opportunity to bridge cutting-edge AI applications with my research ambitions.
Associations
Recipient of the MathTech scholarship
https://www.fondation-hadamard.fr/en/
Fondation Mathématique Jacques Hadamard (FMJH)
Degree
Skills
Hobbies
- Guitar
- Powerlifting
- Running