home
about
work
blogs
contact

My Blog.

Thoughts on AI engineering, backend architecture, and cloud infrastructure.

Building a Hybrid RAG Pipeline with BM25, pgvector & Cohere Reranking

March 2025

A deep dive into combining sparse and dense retrieval with cross-encoder reranking to build a production-grade RAG system.

RAGLLMpgvectorCohere

Agentic Workflows with LangGraph: Tool Calling & Persistent Checkpointing

February 2025

How we architected multi-step agentic workflows using LangGraph with tool-calling, cost tracking, and state persistence.

LangGraphAI AgentsFastAPI

Multi-Tenant SaaS on Azure: From VNet to Multi-Schema PostgreSQL

January 2025

Lessons learned deploying an EU-compliant multi-tenant AI SaaS — private VNets, managed identities, Key Vault, and Bicep IaC.

AzureDevOpsPostgreSQLIaC