Résumé
Mathematician & AI researcher seeking a 6-month pre-PhD internship starting April 2025 to advance multimodal human-computer interaction through AR/VR and brain-computer interface.
My recent projects combine research and engineering under complex constraints for health and assistive applications :
• Applying differential privacy to LLMs trained on structured health records
• Embedding VLMs on edge devices for real-time applications
• Building an E2E video collection, annotation and pre-processing pipeline to train AVSR model for a post-laryngectomy voice-preserving app
• Developing interpretable computational histopathology for clinically actionable solutions
Award-winning in AI challenges (QRT, Doctolib, Dust, Lovable…)
Expériences professionnelles
Data scientist intern
OWKIN , Paris
De Mars 2025 à Août 2025
Advisor Pierre-Antoine Bannier
Medical Imaging for histo-interpretability.
Data scientist student
Paris Digital Lab , Paris - Autres
De Septembre 2024 à Février 2025
- Florence-2 Deployment @ OOrion
Embedded Florence-2 on iPhone for real-time object detection (TTFT 400ms, TPOT 20ms) for visually impaired
- French Lipreading @ La Ligue Contre Le Cancer
Built a video processing pipeline that ultimately gathered a high-quality, French, lip-focused annotated dataset (+500h) and benchmarked and trained SOTA open-source English Audio-Visual Speech Recognition models for a post-laryngectomy voice-preserving app.
- Software Automation @ Biogoup
Automated FSE processing for Biogroup labs by integrating browser (Playwright) and desktop (RobotJS) automation into a NestJS backend with React frontend, enabling secure CPS card authentication and streamlined teletransmission via Kalisil and Pyxbio under strict constraints.
Ai research intern
Université PSL , Paris
De Mars 2024 à Août 2024
Advisors J. Atif and O. Cappé
During my research internship with the MILES team, I investigated privacy vulnerabilities in language models applied to structured medical data. Using Medinote—a fine-tuned version of Meditron trained on JSON medical records—I introduced a novel targeted extraction (linkage) attack that leverages auxiliary public data to recover sensitive patient information. I designed and evaluated conditional sampling strategies, including prefix conditioning, and analyzed sampling schemes aimed at maximizing joint probability over full sequences. I also applied sequence-level perplexity metrics to perform Membership Inference Attacks, enabling more accurate discrimination between in- and out-of-distribution data to improve the filtering of non-member candidate samples.
Alongside my research, I conducted an extensive literature review on medical LLMs, privacy attacks (extraction, membership inference), DP techniques, unlearning, ICL, and advanced transformer variants (e.g., KNN-LLM, MoE, SSMs). I deepened my understanding by re-implementing models like bigrams, BPE, decoder-only transformers, and MoE in PyTorch, and by revisiting theoretical results on benign overfitting. I also fast-tracked my training in Differential Privacy through coursework (M2 IASD) and Dwork’s materials, and participated in reading groups on advanced topics such as higher-order optimization and optimal transport.
Math tutor
Complétude , Paris
De Septembre 2023 à Avril 2025
Helped students build strong foundations and deepen conceptual understanding through structured exercises, step-by-step explanations, and advanced problem-solving, fostering autonomy and broader insight into mathematical methods.
Software engineer intern
Evenium , Paris
De Juin 2018 à Juin 2018
During my high school internship at Evenium, a Paris-based company specializing in digital tools for events, I explored different departments and gained insights into various job roles. With a strong interest in programming, I dedicated my second week to developing a Web Data Connector that pulled event data from Evenium’s API and integrated it with Tableau for reporting. I also created a basic UI using HTML and CSS to make the tool user-friendly. This experience deepened my passion for coding and gave me hands-on exposure to real-world software development.
Formations complémentaires
Polytechnique Paris
Échange - Maths Appliquées
2022 à 2023
Relevant courses
• Deep Learning
• Foundation of Machine Learning
• Optimization and Control
• Monte Carlo Methods
• Modelling Random Phenomena
• Game Theory
• Stochastic Models in Finance
• Algorithmic Geometry: From Theory to Application
• Large Scale Mathematical Optimization
• Modal d'informatique - Mining, Learning and Reasoning on Web Graphs
Université Paris-Saclay
Master 1 (M1) Mathématiques et Applications - Math&IA
2023 à 2024
Relevant courses
• Mathematics for Artificial Intelligence
• Mathematics of image processing and analysis, and their surprising applications
• From Modeling to Statistical Learning
• Supervised Statistical Learning
• Data Challenge in Predictive Modeling
• Introduction to Deep Learning
• ML Algorithms
• Foundational Principles of ML
• Numerical Optimisation
• Database
CentraleSupélec
Digital Tech Year (DTY) - Data Science, AI Research/Engineer, Software Engineer
2024 à 2025
Selected for the Digital Tech Year, a top-tier prototyping program (50 students/year), and awarded the MathTech Gap Year fellowship (4 laureates, FMJH). This experience bridges real-world AI innovation with my PhD-oriented research goals.
ENS Paris-Saclay
(M2) MRes Mathematics, Vision, Learning (MVA) - Track Health
2025 à 2026
Associations
Recipient of the MathTech scholarship
https://www.fondation-hadamard.fr/en/
Fondation Mathématique Jacques Hadamard (FMJH)
Parcours officiels
Langues
Anglais - Courant
Français - Langue maternelle
Compétences
Centres d'intérêt
- Guitar
- Powerlifting
- Running