Skip to content
View BastienDussap's full-sized avatar

Highlights

  • Pro

Block or report BastienDussap

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
BastienDussap/README.md

Data Scientist

About Me

I'm a Data Scientist at Metafora Biosystems biotechnology company based at the Cochin Hospital (Paris 14), working on METAflow a novel AI-powered tool for flow cytometry analysis.
Former PhD student in Machine Learning / Statistics at University Paris-Saclay, affiliated with the Institut de Mathématiques d’Orsay and part of the Datashape team at INRIA, under the supervision of Gilles Blanchard and Marc Glisse.

Thesis Overview

My thesis focused on the comparison of cytometric datasets, particularly in the context of Metafora's software, Metaflow. Metaflow enables the automatic analysis of flow cytometry data. My work involves leveraging machine learning models to transfer analysis from one sample to a new, unanalyzed one. This process relies on Reproducing Kernel Hilbert space to embed and store high-dimensional features in Euclidean space. The goal is twofold: estimating the proportions of each population in a new sample and automatically naming the clusters obtained by the software.

Research Interests

  • Machine Learning
  • Label Shift and Quantification Learning
  • Kernel Mean Embedding and kernel methods in general

Publications

  • "Label Shift Quantification with Robustness Guarantees via Distribution Feature Matching" (with G. Blanchard and B. Chérief-Abdellatif) - ArXiv preprint. This paper was published at ECML/PKDD 2023 and obtained the Research Tracks – Best Student Paper Award. Proceedings.

Talks

Invited Talks

  • Journées de Statistique de la Société Française de Statistique, 2023.
  • DataShape Seminar, 2023.
  • Séminaire des doctorants de l'équipe Probabilité et Statistiques de l'Institut de Mathématiques d'Orsay, 2023.
  • ECML/PKDD 2023, Label Shift Quantification with Robustness Guarantees via Distribution Feature Matching (RT Track – Best Student Paper).
  • Workshop Efficient Statistical Testing for high-dimensional model (FAST-BIG).

Poster Presentations

  • ECML/PKDD 2023, Label Shift Quantification with Robustness Guarantees via Distribution Feature Matching (RT Track – Best Student Paper).

Seminar

I co-organize a seminar for master students in Statistics and Machine Learning at Université Paris-Saclay.

Popular repositories Loading

  1. ScientificProgamming ScientificProgamming Public

    PRE4 : Scientific Programming

  2. BastienDussap.github.io BastienDussap.github.io Public

    Forked from daattali/beautiful-jekyll

    ✨ Fork of https://beautifuljekyll.com for my website

    HTML

  3. qunfold qunfold Public

    Forked from mirkobunse/qunfold

    A unified implementation of quantification and unfolding algorithms

    Python

  4. BastienDussap BastienDussap Public

    My ReadMe

  5. FlowUtils FlowUtils Public

    Forked from whitews/FlowUtils

    FlowUtils is a Python package containing various utility functions related to flow cytometry analysis, primarily focused on compensation and transformation tasks commonly used within the flow commu…

    Python