cv

Here is my resume, you can also download the pdf version by clicking on the button on the top right ↗️.

Basics

Name Mathieu Laï-king
Label PhD Student
Email mathieu.lai-king@universite-paris-saclay.fr
Phone (+33) 6 89 26 86 14
Summary Last year PhD Student at Université Paris Saclay, working on biomedical papers automatic quality assessment using NLP and data selection for pretraining biomedical language models.

Work

  • 2021.02 - 2021.07

    Lyon, France

    Machine Learning / NLP Engineer
    SEGULA Technologies
    End-of-study internship in R&D team.
    • Using machine learning and NLP to find recurrent problems in customer service data (vehicles malfunction, text messages from customers).
    • Implementation of NLP pipeline (preprocessing, embedding, clustering, topic modelling) to extract similar groups of problems and identify their main topics.
  • 2019.07 - 2020.06

    Berlin, Germany

    Machine Learning R&D Intern
    Fraunhofer IPK
    Gap year internship in an applied sciences research institute. Working in different teams for 2 projects.
    • 1st project -> using Machine Learning / NLP / Computer Vision to extract and compare product requirements
    • 2nd project -> Web development using Java + JSF for internal project management application.

Education

  • 2022.02 - 2025.04

    Paris, France

    PhD
    Université Paris Saclay
    Natural Language Processing (NLP)
    • Research on biomedical papers automatic quality assessment using NLP and data selection for pretraining biomedical language models, supervised by Patrick Paroubek and Thierry Hamon
    • Participated to ALPS Winter School 2023, TALN 2023, ACL 2024, presented a poster at ACL
    • 3 Publications at n2c2 Shared Task, BioNLP@ACL2024, Revue TAL 65.2
  • 2017.09 - 2021.07

    Lyon, France

    MSc. ('Diplôme d'ingénieur')
    CPE Lyon
    Computer Science
    • Specialization in Big Data and Software conception
    • Gap year between July 2019 and June 2020
    • Relevant Courses -> Machine Learning, Deep Learning, Big Data hackathon, Information systems architecture, Design Patterns, Mobile Development, Data Mining

Certificates

Cambridge C1 Advanced
Cambridge University Press & Assessment 2020-12-20

Skills

General Skills
Statistics
Machine Learning
Deep Learning
Natural Language Processing
Computer Science
Mobile development : Android
Object Oriented Programming, Design patterns
Python,Java,Bash,C,C++
DataBases : MySQL, PostGreSQL
Web Development : React, NodeJs, SpringFramework
Git
NLP / Deep Learning / Machine Learning
Pytorch, Transformers, vLLM, W&B, accelerate, deepspeed, spacy, stanza, nltk, Sentence Transformers, datasets, scikit-learn

Languages

French
Native speaker
English
Fluent
Japanese
Beginner

Interests

Natural Language Processing
Medical-domain NLP methods to improve/assess medical text data quality
Pretraining Data Selection
Agents

References

Patrick Paroubek, PhD, HDR
Research Engineer at CNRS. Corpus linguistics, NLP evaluation, text analysis, sentiment analysis, chatbots
Thierry Hamon, PhD, HDR
Lecturer at Sorbonne Paris Nord Université.