About

I work on ML systems where research ideas meet production constraints. My focus is on GenAI pipelines, inference efficiency, and first-principles ML implementations.

Selected Projects

Stacks

Python
PyTorch
Transformers
AsyncIO
Docker
FastAPI
Django
PostgreSQL
vLLM
TensorRT

Profiles