Skip to content
View aliivaezii's full-sized avatar
✔️
Available
✔️
Available

Highlights

  • Pro

Organizations

@MAL-TO

Block or report aliivaezii

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
aliivaezii/README.md

Ali Vaezi

MSc student at Politecnico di Torino, conducting thesis research at Eindhoven University of Technology. Applied ML and reinforcement learning researcher with five years of industrial analytics experience in automotive supply chain. Based in Turin, Italy.


Current Focus

  • Deep RL: sim-to-real transfer, robust policy evaluation, entropy-based exploration
  • RL for supply chain and sustainable operations decision-making under uncertainty
  • ML systems: training pipelines, REST inference, containerized deployment

Selected Projects

sim2real — Sim-to-real locomotion for MuJoCo Hopper using curriculum randomization, domain randomization, and entropy scheduling; companion paper under review at IEEE Robotics and Automation Letters.

FBWM-FTOPSIS-PPO — PPO agent benchmarked against LP/MIP and fuzzy MCDM baselines for sustainable supplier order allocation under stochastic demand.

math-llm-poc — Decoder-only Transformer trained from scratch in PyTorch on synthetic arithmetic data, served via FastAPI and Docker; 98.44% exact-match on 5,006 held-out test equations.

kpi-lens — Supply chain intelligence platform with statistical and ML anomaly detection, an LLM analyst layer, and automated Excel/PPT reporting via FastAPI and MCP.

weld-anomaly-classifier — Multimodal deep learning pipeline for weld defect classification from sensor, audio, and video streams; F1 0.957, ranked first at I3P AI Hackathon, Politecnico di Torino.


Research

My research sits at the intersection of deep reinforcement learning and operations research: learning robust policies for locomotion transfer and for supply chain decision-making under uncertainty.

Current first-author work: sim-to-real locomotion (IEEE RA-L, under review) and a hybrid RL/MCDM framework for dynamic supplier order allocation (targeting the International Journal of Production Economics). Earlier published work on blockchain-integrated sustainable supplier selection in the Journal of Cleaner Production (52 citations).


Tech Stack

Reinforcement learning: PPO, SAC, DQN, Gymnasium, MuJoCo
ML and engineering: PyTorch, FastAPI, Docker, Python, SQL
Optimization: Gurobi, MILP, fuzzy MCDM


Contact

GitHub Pages · LinkedIn · Google Scholar · ali.vaezi@studenti.polito.it

Pinned Loading

  1. sim2real sim2real Public

    Forked from josephfayyaz/RL

    Sim-to-Real reinforcement learning for MuJoCo Hopper locomotion with domain randomization, curriculum learning, and entropy scheduling

    Python 25

  2. Smart-Precision-Irrigation Smart-Precision-Irrigation Public

    IoT-based smart precision irrigation system using Raspberry Pi, MQTT microservices, and real-time weather-aware scheduling with Telegram alerts

    Python 25

  3. Turin-MicromobilityAnalysis Turin-MicromobilityAnalysis Public

    Spatial-temporal analysis of 2.5M+ e-scooter trips in Turin: operator comparison, public transport integration, and optimal mode choice

    Python 27

  4. weld-anomaly-classifier weld-anomaly-classifier Public

    WeldFusionNet: Multimodal deep learning for automated weld defect classification using sensor, audio, and video data (F1: 0.9567)

    Python 27

  5. RL-Mobility-Optimizer RL-Mobility-Optimizer Public

    MoveWise: Reinforcement Learning for Behaviorally-Aware Mobility-as-a-Service. Deep Q-Network engine with React/Three.js frontend for sustainable urban transport nudging. NEXUS 2026 — Politecnico d…

    HTML 26