Yousef Radwan yradwan147

👋 About

AI engineer focused on agentic systems, applied ML research, and auditable production pipelines. Currently building healthcare-triage + multi-agent workflows, fine-tuning small LLMs (GRPO + LoRA), and shipping research code that re-runs end-to-end. I care about the reproducibility-vs-velocity trade-off — every artefact in this profile re-executes from a single command.

location:   "🇸🇦 KAUST, Saudi Arabia  ←→  🇪🇬 Cairo, Egypt"
education:  "MS Tech, Innovation & Entrepreneurship — KAUST"
working_on: ["agentic LLM workflows", "small-model fine-tuning (GRPO + LoRA)",
             "multi-modal moderation", "applied research papers"]
publishing: ["NeurIPS 2026 — V-axis emotion centroids",
             "NeurIPS 2026 — Cross-Architecture Substrate",
             "INFOCOM / GSMA telecom-LLM track (×4 papers)"]
shipping:   "EdGame — K-12 stealth-assessment learning games"

🎯 Featured projects

🏥 Healthcare Triage — Capstone Synthesis

End-to-end breast-cancer screening triage workflow that integrates a tuned RandomForest classifier (P3), a smolagents-style audit layer (P6), and a vendored CNN feature hook (P4). Zero malignant cases missed in the 114-case held-out cohort at the screening threshold.

PyTorch sklearn OpenAI joblib

🪙 Beaver's Choice Multi-Agent System

Five-agent smolagents orchestrator-worker system for a paper-supply company: inventory + quoting + sales + finance workers behind a customer-facing orchestrator. Wraps 7 SQLite helpers as @tools; produces an audited cash + inventory ledger on quote_requests_sample.csv.

smolagents gpt-4o-mini SQLAlchemy pandas

🧠 UdaPlay — RAG + Web Fallback

ChromaDB-backed RAG agent over 15 video-game JSON records with LLM-as-judge retrieval evaluation and a Tavily web-search fallback. Three demo queries; the third correctly delegates to web search and cites a Wikipedia URL.

ChromaDB OpenAI Tavily pydantic

🧪 GRPO + LoRA Fine-Tuning of Qwen2.5-3B

Reinforcement-learning fine-tune of Qwen2.5-3B-Instruct on a chain-of-thought letter-counting task using GRPO with LoRA adapters. Demonstrates training-time reward shaping on a small model.

Hugging Face TRL LoRA Qwen transformers

🚀 AgentsVille Trip Planner

CoT + ReAct travel-itinerary system. A Chain-of-Thought planner emits a strict Pydantic-validated TravelPlan; an Itinerary-Revision agent runs a THINK→ACT→OBSERVE loop over four tools with a run_evals_tool-before-final_answer_tool exit invariant.

OpenAI pydantic json-repair

🛰️ NASA Apollo & Challenger RAG Chat

Retrieval-augmented chat over Apollo 11, Apollo 13, and Challenger mission documents. Vector store + reranker + cited answers. Gradio UI; FastAPI backend.

LangChain ChromaDB Gradio FastAPI

🛡️ Multimodal Content Moderation

Pipeline that moderates text, image, audio, and video with pydantic-ai, a streaming Gradio chat UI, and a FastAPI service layer. Structured outputs end-to-end.

pydantic-ai FastAPI Gradio Whisper

🎮 EdGame — Stealth-Assessment Learning Games

Production K-12 ed-tech platform: 5 KAPLAY.js games, ECD (Evidence-Centered Design) analytics, 90K+ event samples. Built for a startup; live in classrooms.

KAPLAY.js Node.js PostgreSQL React

Tip

Full project list — the Udacity AI Mastery Capstone spans 8 chapters (cd001-p1 … cd001-p8); the Agentic AI Nanodegree spans 4 (nd900-p1 … nd900-p4); research code lives under paper1_* … paper4_* and the vaxis-paper / substrate-paper NeurIPS submissions.

🧰 Tech stack

Languages

ML, deep learning, generative AI

Agentic AI & LLM tooling

Data & backend

Web, mobile, ed-tech

DevOps, infra, tooling

📊 GitHub stats

📚 Publications & research

Year	Venue	Title (repo / link)
2026	NeurIPS (under review)	Nine Emotion Centroids — A Label-Free Valence Axis Across Four Modalities
2026	NeurIPS (under review)	The Cross-Architecture Substrate
2026	Telecom-LLM track	Three Levers to Make LLMs Configure 5G Networks (catalog grounding + LoRA + RAG)
2026	INFOCOM track	Geometric V-Metric Instrumentation on Telecom Control Substrates
2026	Wireless control	Rate-Distortion Characterization of a 6-Bit VQ Codec (LLM + linear baselines)
2026	GSMA benchmarking	max_tokens × Prompt-Length Confound in Telecom-MCQ LLM Benchmarking
2025	Frontiers in Human Neuroscience	Stochasticity as a Solution for Overfitting (EEG inner-speech classification)
2025	Scientific Data	ArEEG — Arabic Inner Speech EEG Dataset
2024	EUROCAST	Symbolic Regression — Genetic Programming vs ML/DL
2023	3ICT Conference	Smart Attendance Using BLE

🤝 Connect

_{Reproducibility is a feature, not a constraint.}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly