varuntej07

Follow

Tej varuntej07

Follow

I'm him

2 followers · 3 following

Seattle
21:23 (UTC -07:00)
https://varuntej.dev/
in/varun-tej07

Achievements

Achievements

varuntej07/README.md

Founding Engineer, Full-Stack Builder, Freelancer. Aspiring Inference Engineer!

Recently deployed hot projects

Depth-wise · Pocket-Panel · Ooink-ai · BugSnap

writing

Triton Is Not CUDA in Python — It's a Tiling DSL · Why PyTorch Wastes Your GPU Memory on Purpose More at

stats

Pinned Loading

CUDA-Kernels CUDA-Kernels Public

Cuda
vlm-inference-profiler vlm-inference-profiler Public

Per-component latency and VRAM profiling of Qwen2.5-VL-7B across modality-selective quantization configs on T4 GPU

Jupyter Notebook
vllm-playground vllm-playground Public

Forked from micytao/vllm-playground

A web interface for managing and interacting with vLLM servers (www.github.com/vllm-project/vllm). Supports both GPU and CPU modes, with special optimizations for enterprise deployment on OpenShift…

JavaScript
ooink-ai ooink-ai Public

An interactive Pig AI assistant that answers customer questions like a host deployed as a tablet kiosk at Ooink Ramen Restaurant in Seattle

Dart
Aura Aura Public

Voice-first AI accountability companion

Python
aws-nki aws-nki Public

Forked from aws-neuron/nki-samples

Python