I design systems that
Backend systems, cloud architecture, and AI-powered platforms. Ask me anything about my work.
What I Do
I help teams build production-grade systems that are fast, reliable, and easy to evolve.
A modern backend, visualized.
An interactive topology of the systems I design: edge, API, events, data, workers, and storage — with the trade-offs that hold them together.
Click any node to see its role and the trade-offs that come with it. The dashed paths show event and request flow.
Where the intelligence actually comes from.
Most people see the LLM as one box. In production it's a pipeline. Here's what actually happens between a question and an answer.
Most people think of the LLM as a single step. In production it’s a pipeline: tokenize, retrieve, compose, generate, stream. Each stage has its own trade-offs.
Problems, constraints, and the trade-offs I picked.
Each write-up follows the same shape so you can compare: problem, constraints, architecture, trade-offs, outcome.
- 2025Scaling an Order Pipeline to 50k Events/Sec
The existing order pipeline was a synchronous chain of HTTP calls. At peak traffic, tail latency and failures cascaded: a slow downstream caused order timeouts, duplicate writes from naive retries, and intermittent data inconsistency. The business needed to grow 10x without growing incidents.
- 2025RAG That Actually Works in Production
Support and product teams were drowning in repetitive questions that were already answered in our docs, tickets, and runbooks. Earlier RAG attempts were 'demo-great, production-bad': plausible answers, frequent hallucinations, no trust signals.
- 2024Multi-Tenant Isolation on AWS
A single noisy customer could degrade the platform for everyone. We needed isolation that didn't explode costs, plus a story for running the biggest customers on dedicated infrastructure.
Work With Me
I help companies turn ideas into scalable, production-ready systems.
- MVP development (fast and scalable)
- System design and architecture
- Backend optimization and scaling
- AI integration and LLM applications
Things I've built and shipped.
Event-Driven Commerce Platform
A Kafka + NestJS backbone for a high-throughput commerce product.
Retrieval-Augmented Knowledge Assistant
A production RAG system over product documentation and support data.
Deep dives for the curious.
Essays about distributed systems and AI infrastructure. Focused on decisions and the trade-offs that make them.
Event-Driven Backbones with Kafka
When to reach for Kafka, what the outbox pattern really buys you, partitioning for ordering, and the things that break in year two.
Designing a RAG Pipeline for Production
Chunking, hybrid retrieval, reranking, grounding with citations, and the evals that separate demo-ware from production.
Multi-Tenant Isolation Patterns
Pooled vs siloed vs cells: picking an isolation model that matches your blast-radius budget, not the hype cycle.
Observability That People Actually Use
SLOs, burn-rate alerts, and why your dashboard graveyard is a product problem, not a tooling one.
Have a system design question?
Try the AI assistant — it explains architecture trade-offs, answers hiring questions, and points you to the right next step.
Let’s build something that scales.
If you’re hiring or planning a new backend/AI initiative, I can help with architecture, delivery, and execution.