photo de profil d'un membre

Gary Gary KLAJER

Résumé

Guider les projets d'IA porteurs de demain en santé, environnement et énergie.

Expériences professionnelles

Data scientist student

Paris Digital Lab , Paris - Autres

De Septembre 2024 à Aujourd'hui

Agile prototyping at the @ParisDigitalLab Incubator, developing 3 Minimum Viable Products (MVP) in 7-week constraints for real-world businesses, using scrum methodology.
- Implemented an embedded VLM model for real-time VQA, object detection and tracking on iPhone @ Oorion

Ai research intern

Université Paris-Dauphine, PSL , Paris - CDD

De Mars 2024 à Août 2024

I conducted a security audit of Medinote, a medical language model, focusing on patient data leakage risks. My research introduced a novel approach by executing targeted extraction attacks using auxiliary public data to recover private information. I explored how statistical patterns, such as distributions of patient attributes, were encoded within the model and developed sequence-based perplexity metrics to distinguish in-distribution from out-of-distribution data. This could enhance the filtering of non-member data from generated candidate samples.

This work was done at PariSanté Campus within the MILES team, under the supervision of Prof. Jamal Atif and Prof. Olivier Capet and collaboration of Danil Savine. The MILES project-team is hosted by the LAMSADE research unit at Université Paris Dauphine-PSL, France and is heavily involved in the Paris Artificial Research Intelligence Institute (PRAIRIE), a national AI research initiative funded by the 3AI Plan.

In addition to my research work, I:
• Conducted a comprehensive literature review of over 100 papers covering topics such as medical LLMs, memorization, extraction attacks, membership inference attacks, DP-preserving techniques, unlearning, in-context learning (ICL), and cutting-edge transformer architectures (e.g., KNN-LLM, Matryoshka Embedding, Mixture of Experts, State Space Models).
• Re-implemented bigram models, BPE, decoder-only transformers, and MoE from scratch using PyTorch to deepen my understanding of these architectures.
• Re-proved theorems on benign overfitting from Bartlett et al.'s work.
• Fast-tracked Differential Privacy (DP) by following the "DP for ML" course from M2 IASD @ Paris Dauphine and completing Dwork's textbook and programming notebooks on DP.
• Attended reading groups on advanced mathematical topics like higher-order optimization, optimal transport, automatic architecture search, etc.

Formations complémentaires

Polytechnique Paris

Échange - Maths Appliquées

2022 à 2023

Université Paris-Saclay

M1 Mathématiques et Applications track Maths&IA - Maths appliquées et Intelligence Artificielle

2023 à 2024

CentraleSupélec

Digital Tech Year (DTY) - Data Science, AI Research/Engineer, Software Engineer

2024 à 2025

The Digital Tech Year is a highly selective excellence program, admitting only 50 top candidates, most from CentralSupélec, with the rest from Paris-Saclay University and postdoctoral researchers. It combines intensive prototyping with real-world projects, followed by a six-month paid internship in industry.

In 2024, I was honored to be one of the four laureates of the MathTech gap year, funded by the FMJH. As I aspire to pursue a PhD, this program offers a unique opportunity to bridge cutting-edge AI applications with my research ambitions.

Associations

Recipient of the MathTech scholarship

https://www.fondation-hadamard.fr/en/

Fondation Mathématique Jacques Hadamard (FMJH)

Parcours officiels

Bachelor – Mathématiques – Mathématiques (MA) – SB – 2023

Compétences

PyTorch
Scikit-learn
numpy
Pandas
seaborn
R, Python, Matlab coding expertise
postgresql
C++
Java
Deep learning
Machine learning
graph theory
Convolutional Neural Nets
Optimisation
Statistics
Bayesian Statistics
Statistical Modelling
Stochastic processes
Linear Algebra
Analysis
Data Science
Vision Language Model (VLM)
CoreML
Swift
TorchScript
Edge AI

Centres d'intérêt

  • Guitar
  • Powerlifting
  • Running