cv
Here is my resume, you can also download the pdf version by clicking on the button on the top right ↗️.
Basics
Name | Mathieu Laï-king |
Label | PhD Student |
mathieu.lai-king@universite-paris-saclay.fr | |
Phone | (+33) 6 89 26 86 14 |
Summary | Last year PhD Student at Université Paris Saclay, working on biomedical papers automatic quality assessment using NLP and data selection for pretraining biomedical language models. |
Work
-
2021.02 - 2021.07 Lyon, France
Machine Learning / NLP Engineer
SEGULA Technologies
End-of-study internship in R&D team.
- Using machine learning and NLP to find recurrent problems in customer service data (vehicles malfunction, text messages from customers).
- Implementation of NLP pipeline (preprocessing, embedding, clustering, topic modelling) to extract similar groups of problems and identify their main topics.
-
2019.07 - 2020.06 Berlin, Germany
Machine Learning R&D Intern
Fraunhofer IPK
Gap year internship in an applied sciences research institute. Working in different teams for 2 projects.
- 1st project -> using Machine Learning / NLP / Computer Vision to extract and compare product requirements
- 2nd project -> Web development using Java + JSF for internal project management application.
Education
-
2022.02 - 2025.04 Paris, France
PhD
Université Paris Saclay
Natural Language Processing (NLP)
- Research on biomedical papers automatic quality assessment using NLP and data selection for pretraining biomedical language models, supervised by Patrick Paroubek and Thierry Hamon
- Participated to ALPS Winter School 2023, TALN 2023, ACL 2024, presented a poster at ACL
- 3 Publications at n2c2 Shared Task, BioNLP@ACL2024, Revue TAL 65.2
-
2017.09 - 2021.07 Lyon, France
MSc. ('Diplôme d'ingénieur')
CPE Lyon
Computer Science
- Specialization in Big Data and Software conception
- Gap year between July 2019 and June 2020
- Relevant Courses -> Machine Learning, Deep Learning, Big Data hackathon, Information systems architecture, Design Patterns, Mobile Development, Data Mining
Certificates
Natural Language Processing Specialization | ||
Coursera | 2021-12-01 |
Cambridge C1 Advanced | ||
Cambridge University Press & Assessment | 2020-12-20 |
Skills
General Skills | |
Statistics | |
Machine Learning | |
Deep Learning | |
Natural Language Processing |
Computer Science | |
Mobile development : Android | |
Object Oriented Programming, Design patterns | |
Python,Java,Bash,C,C++ | |
DataBases : MySQL, PostGreSQL | |
Web Development : React, NodeJs, SpringFramework | |
Git |
NLP / Deep Learning / Machine Learning | |
Pytorch, Transformers, vLLM, W&B, accelerate, deepspeed, spacy, stanza, nltk, Sentence Transformers, datasets, scikit-learn |
Languages
French | |
Native speaker |
English | |
Fluent |
Japanese | |
Beginner |
Interests
Natural Language Processing | |
Medical-domain NLP methods to improve/assess medical text data quality | |
Pretraining Data Selection | |
Agents |
References
Patrick Paroubek, PhD, HDR | |
Research Engineer at CNRS. Corpus linguistics, NLP evaluation, text analysis, sentiment analysis, chatbots |
Thierry Hamon, PhD, HDR | |
Lecturer at Sorbonne Paris Nord Université. |