profile photo of a member

Gary KLAJER

Summary

Mathematician & AI researcher seeking a 6-month pre-PhD internship starting April 2025 to advance multimodal human-computer interaction through AR/VR and brain-computer interface.

My recent projects combine research and engineering under complex constraints for health and assistive applications :
• Applying differential privacy to LLMs trained on structured health records
• Embedding VLMs on edge devices for real-time applications
• Building an E2E video collection, annotation and pre-processing pipeline to train AVSR model for a post-laryngectomy voice-preserving app
• Developing interpretable computational histopathology for clinically actionable solutions

Award-winning in AI challenges (QRT, Doctolib, Dust, Lovable…)

Professional experiences

Data scientist intern

OWKIN , Paris

From March 2025 to August 2025

Advisor Pierre-Antoine Bannier
Medical Imaging for histo-interpretability.

Data scientist student

Paris Digital Lab , Paris - Others

From September 2024 to February 2025

- Florence-2 Deployment @ OOrion
Embedded Florence-2 on iPhone for real-time object detection (TTFT 400ms, TPOT 20ms) for visually impaired

- French Lipreading @ La Ligue Contre Le Cancer
Built a video processing pipeline that ultimately gathered a high-quality, French, lip-focused annotated dataset (+500h) and benchmarked and trained SOTA open-source English Audio-Visual Speech Recognition models for a post-laryngectomy voice-preserving app.

- Software Automation @ Biogoup
Automated FSE processing for Biogroup labs by integrating browser (Playwright) and desktop (RobotJS) automation into a NestJS backend with React frontend, enabling secure CPS card authentication and streamlined teletransmission via Kalisil and Pyxbio under strict constraints.

Ai research intern

Université PSL , Paris

From March 2024 to August 2024

Advisors J. Atif and O. Cappé
During my research internship with the MILES team, I investigated privacy vulnerabilities in language models applied to structured medical data. Using Medinote—a fine-tuned version of Meditron trained on JSON medical records—I introduced a novel targeted extraction (linkage) attack that leverages auxiliary public data to recover sensitive patient information. I designed and evaluated conditional sampling strategies, including prefix conditioning, and analyzed sampling schemes aimed at maximizing joint probability over full sequences. I also applied sequence-level perplexity metrics to perform Membership Inference Attacks, enabling more accurate discrimination between in- and out-of-distribution data to improve the filtering of non-member candidate samples.

Alongside my research, I conducted an extensive literature review on medical LLMs, privacy attacks (extraction, membership inference), DP techniques, unlearning, ICL, and advanced transformer variants (e.g., KNN-LLM, MoE, SSMs). I deepened my understanding by re-implementing models like bigrams, BPE, decoder-only transformers, and MoE in PyTorch, and by revisiting theoretical results on benign overfitting. I also fast-tracked my training in Differential Privacy through coursework (M2 IASD) and Dwork’s materials, and participated in reading groups on advanced topics such as higher-order optimization and optimal transport.

Math tutor

Complétude , Paris

From September 2023 to April 2025

Helped students build strong foundations and deepen conceptual understanding through structured exercises, step-by-step explanations, and advanced problem-solving, fostering autonomy and broader insight into mathematical methods.

Software engineer intern

Evenium , Paris

From June 2018 to June 2018

During my high school internship at Evenium, a Paris-based company specializing in digital tools for events, I explored different departments and gained insights into various job roles. With a strong interest in programming, I dedicated my second week to developing a Web Data Connector that pulled event data from Evenium’s API and integrated it with Tableau for reporting. I also created a basic UI using HTML and CSS to make the tool user-friendly. This experience deepened my passion for coding and gave me hands-on exposure to real-world software development.

Additionals trainings

Polytechnique Paris

Échange - Maths Appliquées

2022 à 2023

Relevant courses
• Deep Learning
• Foundation of Machine Learning
• Optimization and Control
• Monte Carlo Methods
• Modelling Random Phenomena
• Game Theory
• Stochastic Models in Finance
• Algorithmic Geometry: From Theory to Application
• Large Scale Mathematical Optimization
• Modal d'informatique - Mining, Learning and Reasoning on Web Graphs

Université Paris-Saclay

Master 1 (M1) Mathématiques et Applications - Math&IA

2023 à 2024

Relevant courses
• Mathematics for Artificial Intelligence
• Mathematics of image processing and analysis, and their surprising applications
• From Modeling to Statistical Learning
• Supervised Statistical Learning
• Data Challenge in Predictive Modeling
• Introduction to Deep Learning
• ML Algorithms
• Foundational Principles of ML
• Numerical Optimisation
• Database

CentraleSupélec

Digital Tech Year (DTY) - Data Science, AI Research/Engineer, Software Engineer

2024 à 2025

Selected for the Digital Tech Year, a top-tier prototyping program (50 students/year), and awarded the MathTech Gap Year fellowship (4 laureates, FMJH). This experience bridges real-world AI innovation with my PhD-oriented research goals.

ENS Paris-Saclay

(M2) MRes Mathematics, Vision, Learning (MVA) - Track Health

2025 à 2026

Associations

Recipient of the MathTech scholarship

https://www.fondation-hadamard.fr/en/

Fondation Mathématique Jacques Hadamard (FMJH)

Degree

Bachelor – Mathématiques – Mathématiques (MA) – SB – 2023

Languages

Anglais - Fluent

Français - Native language

Skills

PyTorch
Scikit-learn
numpy
Pandas
seaborn
R, Python, Matlab coding expertise
postgresql
C++
Java
Deep learning
Machine learning
graph theory
Convolutional Neural Nets
Optimisation
Statistics
Bayesian Statistics
Statistical Modelling
Stochastic processes
Linear Algebra
Analysis
Data Science
Vision Language Model (VLM)
CoreML
Swift
TorchScript
Edge AI

Hobbies

  • Guitar
  • Powerlifting
  • Running