Who
EECS at UC Berkeley, graduating May 2027. I build systems at the intersection of infrastructure engineering and applied AI — LLM inference control planes, clinical NLP pipelines, distributed course management platforms, and backend systems for health tech. Recently built mini-vLLM to make paged KV-cache, continuous batching, and prefix caching tangible on a laptop. Currently researching LLM optimization for clinical data extraction at UCSF and engineering backend infrastructure for an AGI health platform at BalanX-BIO. Incoming SWE intern at ASML (Summer 2026). Biased toward things that ship, scale, and matter.
Skills
Languages
Python · Java · C/C++ · SQL · TypeScript · JavaScript
Frameworks
FastAPI · React · Node.js · Spring Boot · Next.js · LangChain
Cloud & DevOps
AWS · Kubernetes · Docker · Terraform · CI/CD
AI & Data
PyTorch · vLLM · LoRA/QLoRA · MedSpacy · HuggingFace