Hello 👋

💻 I am a research scientist working at NuMind on structured information exctraction with language models. I specifically work on model evaluations and multimodality (text, images, pdf...).
🎸 Before that, I worked on music generation topics at the Metacreation Lab and at Sorbonne University where I got my PhD. I notably worked on the tokenization of symbolic music, and created the MidiTok library which I still maintain to this day, and helped to build and release the GigaMIDI dataset.
🌴 I can be contacted on the plateforms linked above.

Research interests:

Natural language generation (NLG)
Discrete generative models
Music generation