Author Archives: Bhawesh Kumar

Redis in Production Part 2: ElastiCache, MemoryDB, and the Self-Hosted Question

By | April 23, 2026

Prerequisites You should read Part 1: Redis in Production — Persistence, Replication, and HA first, or already understand RDB vs AOF, Redis Sentinel, and Redis Cluster mode. Basic AWS familiarity (VPCs, security groups, EC2 instance families) is expected. You don’t need prior hands-on experience with MemoryDB or ElastiCache Serverless. The Problem: “Just Use ElastiCache” Is… Read More »

Redis in Production: Architecture, Persistence, and High Availability

By | April 23, 2026

TL;DR: Redis is simple to start but tricky to run correctly at scale. This guide covers the production deployment decisions that matter: RDB vs AOF vs hybrid persistence, Sentinel vs Cluster topologies, memory eviction policies that won’t kill your app, and the self-hosted vs ElastiCache trade-off. Written for platform engineers who need Redis to stay… Read More »

Production Multi-Agent Systems: Architecture Patterns That Actually Work

By | April 23, 2026

TL;DR: Most production “AI agents” are actually deterministic workflows — and that’s fine, but the architecture decision you make right now determines whether your system costs $0.10 or $50 per request, whether it completes in 3 seconds or 3 minutes, and whether you can debug it when it fails at 2am. This post maps Anthropic’s… Read More »

AI Systems That Learn Without Being Trained: The Rise of Self-Evolving Architectures

By | January 3, 2026

We are entering an era where the most powerful AI systems are not trained—they evolve. The intelligence no longer lives in the model, but in the architecture surrounding it. The AI landscape has reached an inflection point. While the industry obsesses over model parameters and training datasets, a quiet revolution is underway: the emergence of… Read More »

Backend for Frontend (BFF) on AWS — AppSync/GraphQL + Lambda Resolvers

By | December 31, 2025

What you’ll learn The problem BFF solves If multiple clients (web, mobile, partner apps) call the same “generic” backend, you often end up with: Backend for Frontend (BFF) fixes this by creating a backend tailored to a specific frontend: BFF in one sentence A BFF is a client-specific backend that aggregates and shapes data exactly… Read More »

Saga Pattern on AWS — Distributed Transactions with Compensation

By | December 31, 2025

What you’ll learn The problem: distributed transactions without a 2PC In a monolith, you might wrap changes in a single database transaction. In microservices (or distributed systems), a single business action often spans multiple components: You can’t easily use a single ACID transaction across multiple independent datastores and services (and you generally shouldn’t). Saga is… Read More »

CQRS on AWS Demystified: From Commands to Queries with Serverless Architecture

By | December 31, 2025

What you’ll learn Video discussion on YouTube – https://youtu.be/2lFW5H3ZzLA The problem CQRS solves In many systems, we try to make one database schema do everything: As the system grows, that single model starts to hurt: CQRS (Command Query Responsibility Segregation) is a pattern that says: CQRS in one sentence CQRS separates the write path (commands)… Read More »

Your AWS Lambda Deployments Can Suddenly Stop — Here’s the Quota Nobody Talks About

By | December 31, 2025

AWS Lambda is widely loved for its simplicity, auto-scaling, and frictionless deployments. You deploy code, AWS handles the scaling, and everything “just works”. Beneath that convenience lies a largely unknown and rarely discussed limit that can suddenly bring your entire CI/CD pipeline to a halt: If your Lambda-heavy compute has a lot of functions deployed… Read More »

Building Scalable GenAI Apps on AWS: From Prototype to Production

By | November 8, 2025

TL;DR Project timeline snapshot Phase Day Milestone Kickoff 0 Backlog crosses 1,200 tickets; AI assistant charter approved. Prototype (Act I) 3 Bedrock-based helper drafting answers with lineage and manual QA. Reliability (Act II) 14 RAG + eval harness hits ≥85% correct with citations. Production hardening (Act III) 28 Guardrails, SLOs, CI/CD, and cost controls live… Read More »

Building Microservices on AWS: Lambda vs ECS Fargate

By | October 6, 2025

Selecting the right compute model is one of the biggest architectural decisions teams face on AWS. This guide contrasts serverless functions powered by AWS Lambda with containerized services running on Amazon ECS with Fargate. You’ll find side-by-side diagrams, architecture variants (including API Gateway front doors), and a comprehensive comparison of cost, performance, security, scaling, and… Read More »