About

About

Hi, I'm a backend developer focused on AI engineering, Go services, and cloud cost optimization.

I write about the real problems I run into while building production systems — LLM API cost spikes, goroutine leaks, Cloud Run scaling issues, and everything in between.

No fluff, just the debugging stories and fixes that actually worked.

→ Email: wnsgh600@gmail.com

Comments

Popular posts from this blog

Optimizing LLM API Latency: Async, Streaming, and Pydantic in Production

How I Built a Semantic Cache to Reduce LLM API Costs

How I Squeezed LLM Inference onto a Raspberry Pi for Local AI