Thoughts on AI engineering, backend architecture, and cloud infrastructure.
A deep dive into combining sparse and dense retrieval with cross-encoder reranking to build a production-grade RAG system.
How we architected multi-step agentic workflows using LangGraph with tool-calling, cost tracking, and state persistence.
Lessons learned deploying an EU-compliant multi-tenant AI SaaS — private VNets, managed identities, Key Vault, and Bicep IaC.