Skip to content
View liliandoublet's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report liliandoublet

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
liliandoublet/README.md

Hi, I'm Lilian Doublet 👋

Python SQL scikit-learn Hugging Face Power BI AWS Docker FastAPI MLflow LangChain

Data Scientist & AI Engineer based in Paris.
I build production-grade NLP and ML systems, including fine-tuned French-language models, and turn technical work into clear value for business stakeholders.

🔭 Open to Data Scientist, AI Engineer and Data Consultant roles.


🛠️ Tech stack

Languages: Python, SQL, DAX

ML / NLP: scikit-learn, Hugging Face Transformers (CamemBERT), time series forecasting (ARIMA, SARIMA, LSTM)

GenAI / LLM: Retrieval-Augmented Generation (RAG), LangGraph, Anthropic Claude API, Gemini API, ChromaDB, Ollama

Data viz: Power BI (star schema modeling)

Cloud / Infra: AWS (Lambda, S3, EventBridge), Docker, FastAPI, Streamlit

MLOps: MLflow, Git, GitHub Actions (CI), pytest, pre-commit, ruff, uv


📌 Featured projects

sherlock: Production-grade French NLP pipeline. Built with uv, Pydantic, Typer CLI, loguru, MLflow, CamemBERT, pytest and GitHub Actions CI.

camembert-discours-politique: CamemBERT fine-tuned for multi-class classification of French political discourse. 15,000+ annotated texts, MLflow experiment tracking, thesis graded 87/100.

techradar: Serverless AWS agent (Lambda, S3, EventBridge) that scrapes and summarizes tech news with Google Gemini and LangGraph, delivered by email via SendGrid.

askmydocs: RAG assistant for querying your own documents. Built with ChromaDB, the Gemini API and a Streamlit / Docker stack, with an evaluation harness measuring hit rate and keyword recall.

HR-Dashboard: HR analytics dashboard built in Python during my Data Analyst internship at Crédit Mutuel Alliance Fédérale (HR Digital Transformation team, 50,000+ records).

newflights: Airfare optimization app. React / TypeScript / Vite frontend, FastAPI backend.


📫 Get in touch

LinkedIn Email Location

Pinned Loading

  1. askmydocs askmydocs Public

    RAG assistant to query your own documents, built with ChromaDB, the Gemini API/ Ollama and a Streamlit stack.

    Python 1

  2. HR-Dashboard HR-Dashboard Public

    HR analytics dashboard built in Python during my Data Analyst internship at Crédit Mutuel Alliance Fédérale (HR Digital Transformation team).

    Python

  3. newflights newflights Public

    Airfare optimization app with a React / TypeScript / Vite frontend and a FastAPI backend.

    Python

  4. sherlock sherlock Public

    Production-grade French NLP pipeline (CamemBERT) with a clean uv / Pydantic / Typer architecture, MLflow tracking and GitHub Actions CI.

    Python

  5. techradar techradar Public

    Serverless AWS agent that scrapes and summarizes tech news with the Gemini API, delivered by email.

    Python