Skip to content
View rehan243's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report rehan243

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
rehan243/README.md

waving hand    Typing SVG

GitHub followers  LinkedIn  Kaggle  profile visitors

divider

about me

i'm an AI/ML engineer based in the US. right now i'm building production AI systems at Reallytics.ai and Verticiti, mostly getting large language models to do useful things in the real world. not demos, actual systems with real users and real traffic.

before this i was at Afiniti and Cloud Kinetics for a few years. fraud detection, voice analytics, enterprise search. the kind of stuff that pages you at 3am when something breaks.

honestly what keeps me going is when an agent you built solves something you never explicitly told it to do. that feeling never gets old.

what i'm working on right now:

  • multi-agent systems that don't fall apart when you chain them
  • RAG pipelines that actually return relevant results
  • writing about what i learn every day, check it out here
coding

divider

featured projects  rocket

Agentic AI Workflows
8 specialized AI agents with LangChain + OpenAI function calling. multi-agent orchestration with planning loops and guardrails. the project i'm most excited about right now.

RAG Enterprise Search
production retrieval pipeline over 2TB+ data. hybrid dense+sparse search with FAISS and BM25, cross-encoder re-ranking. deployed on AWS SageMaker.

Voice AI Platform
real-time voice infrastructure handling 500+ concurrent calls. WebSockets, Kafka, VAD, streaming STT. built the sentiment analysis piece from scratch.

LLM Fine-Tuning LoRA
fine-tuning LLaMA and Mistral with LoRA/QLoRA/PEFT. 40% cheaper than hosted APIs. includes the full training loop, data pipeline, merge + quantize scripts.

RLHF LLM Optimization
full RLHF pipeline: reward model with Bradley-Terry loss, PPO trainer with KL scheduling, DPO as an alternative. 68% win rate on eval, 96% safety compliance.

Sentinel Fraud Detection
ensemble XGBoost + neural net with 650+ engineered features. Redis-backed real-time velocity scoring, SHAP explainability, Kafka alert routing.

view all repositories

divider

tech stack

not going to pretend i use everything equally. here's what i actually reach for:

tech stack

the full picture (click to expand)
daily drivers Python, PyTorch, FastAPI, Docker, Git, VS Code
LLM and GenAI LangChain, LlamaIndex, HuggingFace Transformers, vLLM, PEFT/LoRA/QLoRA
data and vector FAISS, ChromaDB, Pinecone, PostgreSQL, MongoDB, Redis, Kafka, Elasticsearch
cloud and MLOps AWS (SageMaker, Bedrock, Lambda, ECS), GCP Vertex AI, Azure OpenAI
ML frameworks TensorFlow, scikit-learn, XGBoost, LightGBM, ONNX
infrastructure Kubernetes, Terraform, GitHub Actions, MLflow, Weights & Biases

divider

github stats

github stats streak stats

top languages


trophies

trophies


contribution graph

contribution graph


my github contributions eating themselves

contribution snake

divider

recent writeups

i write about what i'm building and learning. nothing polished, more like notes to my future self that happen to be public.

Production Scale Fine Tuning Of Proprietary Llms

Production Scale Fine Tuning Of Proprietary Llms
2026-04-28

Automl For Time Series Forecasting

Automl For Time Series Forecasting
2026-04-28

Real Time Data Quality Monitoring For Ml

Real Time Data Quality Monitoring For Ml
2026-04-27

Automl For Hyperparameter Tuning

Automl For Hyperparameter Tuning
2026-04-27

📚 View all articles →


recent activity

💬 Commented on TypeError collecting serde allowlist for collections.abc gen in langchain-ai/langgraph (2026-04-28)

💬 Commented on RuntimeError: Cannot create non-dynamic delegate: cached_pat in Comfy-Org/ComfyUI (2026-04-28)

💬 Commented on Qodo AntiSlop scan found 46 issues across 10 recent PRs in openinterpreter/open-interpreter (2026-04-28)

💬 Commented on Faster CI suite - sub 10min in zenml-io/zenml (2026-04-28)

💬 Commented on Add AEO quality badge to README (score: 45/100) in guidance-ai/guidance (2026-04-28)

💬 Commented on [Feature] Built-in feature drift detection with alerting in feast-dev/feast (2026-04-28)

⭐ Starred tinyfish-io/agentql-mcp (2026-04-28)

⭐ Starred qiantongtech/qKnow (2026-04-28)


what i'm reading lately

stuff i've been digging into recently. mostly papers, blog posts, and rabbit holes that kept me up too late.

🔬 Retrieval-Augmented Generation (RAG) with Streaming and Real-Time Data

🔬 Production-Scale Fine-Tuning of Proprietary LLMs

🔬 AutoML for Hyperparameter Tuning

🔬 Real-time Data Quality Monitoring for ML

🔬 Fine-Tuning and Serving Small Language Models for Edge and On-Device AI

🔬 Production-Ready Retrieval-Augmented Generation (RAG) at Scale


code snippets

📌 RAG Relevance Scorer using Cross-Encoder — Production Pattern (Python) (2026-04-28)

📌 Embedding Cache with LRU Eviction — Production Pattern (Python) (2026-04-28)

📌 Kubernetes Pod Scaler for GPU Workloads — Production Pattern (Python) (2026-04-26)

🤖 Profile auto-updated on 2026-04-28 19:49 UTC

divider

if you made it this far, you should probably just say hi

connect on linkedin   follow on github

Pinned Loading

  1. Voice-AI-Platform Voice-AI-Platform Public

    Real-time voice AI infrastructure — 500+ concurrent calls, WebSockets, Apache Kafka, gRPC/C++ with CUDA. Speech-to-text, sentiment analysis, sales insights.

    Python

  2. Agentic-AI-Workflows Agentic-AI-Workflows Public

    Production AI Agents for enterprise automation — 8+ specialized agents using LangChain, OpenAI function calling, and FastAPI. Multi-agent orchestration, tool use, planning loops, guardrails.

    Python

  3. BiiView-Object-Detection BiiView-Object-Detection Public

    Real-time object detection with Meta AI Segment Anything Model (SAM) — 90% accuracy across 11M+ images and 1.1B+ segmentation masks.

    Python

  4. Digital-People-Platform Digital-People-Platform Public

    Hyper-realistic talking avatars — SadTalker lip-sync + Microsoft SpeechT5 TTS + OpenAI conversational AI. 70% realism improvement.

    Python

  5. LLM-Fine-Tuning-LoRA LLM-Fine-Tuning-LoRA Public

    Fine-tuning LLaMA-2, Mistral with LoRA, QLoRA, PEFT — 40% cost reduction vs hosted APIs. VLLM serving with CUDA optimization on AWS SageMaker.

    Python

  6. RAG-Enterprise-Search RAG-Enterprise-Search Public

    Production RAG pipeline — enterprise knowledge retrieval across 2TB+ data using LangChain, FAISS, ChromaDB, PG-Vector with cross-encoder re-ranking. Deployed on AWS SageMaker.

    Python