Hassan Raza
Senior engineer · backend systems · AI infrastructure

I design systems that

Backend systems, cloud architecture, and AI-powered platforms. Ask me anything about my work.

Try:
50k
events/sec peak throughput
99.9%
production SLOs, monitored with burn-rate alerts
28%
drop in support volume via a production RAG system
7+
years shipping distributed backends at scale
Value proposition

What I Do

I help teams build production-grade systems that are fast, reliable, and easy to evolve.

Design and build scalable backend systems
Architect cloud-native applications
Help startups ship MVPs fast
Optimize performance and infrastructure cost
Build AI-powered applications using LLMs
Cloud infrastructure

A modern backend, visualized.

An interactive topology of the systems I design: edge, API, events, data, workers, and storage — with the trade-offs that hold them together.

Live topology · click a service to learn more
cloud.topology.v1
Client
Web / Mobile
Load Balancer
L7 entrypoint
API Gateway
Auth · Rate-limit
Microservices
NestJS services
Redis
Hot path cache
Message Queue
Kafka / SQS
PostgreSQL
System of record
Workers
Projections · Jobs
S3
Blobs · Snapshots
A modern backend topology, end to end.

Click any node to see its role and the trade-offs that come with it. The dashed paths show event and request flow.

LLM systems

Where the intelligence actually comes from.

Most people see the LLM as one box. In production it's a pipeline. Here's what actually happens between a question and an answer.

LLM pipeline · hover to learn, press play to watch it run
How an LLM answer is actually produced.

Most people think of the LLM as a single step. In production it’s a pipeline: tokenize, retrieve, compose, generate, stream. Each stage has its own trade-offs.

Case studies

Problems, constraints, and the trade-offs I picked.

Each write-up follows the same shape so you can compare: problem, constraints, architecture, trade-offs, outcome.

Consulting

Work With Me

I help companies turn ideas into scalable, production-ready systems.

  • MVP development (fast and scalable)
  • System design and architecture
  • Backend optimization and scaling
  • AI integration and LLM applications
Selected projects

Things I've built and shipped.

System design

Deep dives for the curious.

Essays about distributed systems and AI infrastructure. Focused on decisions and the trade-offs that make them.

Ask anything

Have a system design question?

Try the AI assistant — it explains architecture trade-offs, answers hiring questions, and points you to the right next step.

Contact

Let’s build something that scales.

If you’re hiring or planning a new backend/AI initiative, I can help with architecture, delivery, and execution.

Book a Call