I'm a Research Engineer/Scientist based in Paris 🇫🇷
- 🎓 I studied a MEng CentraleSupélec in Paris-Saclay 🇫🇷 and the MPhil in Machine Learning and Machine Intelligence (MLMI) at the University of Cambridge (Sidney Sussex College) 🇬🇧.
- 💼 I am currently working at H Company on ultrascaling VLMs for Computer Use agents on 2k+ H100 GPUs and on training infra engineering.
- 👀 I am one of the main co-author of ColPali, a SOTA multimodal retriever for documents. [arXiv][GitHub]
- 🔬 Research interests: LLM, Multimodal, Distributed Training, Agents, Information Retrieval, RAG, Speech.
💬 Feel free to reach out to discuss research ideas (mostly active on X)!
Contact: tonywu.ai@outlook.com




