Blogs
All the blogs I've posted.
-
Why Pydantic-AI Failed with Gemini 2.5 Pro (But Direct OpenRouter Integration Succeeded)
A technical deep dive into why pydantic_ai + OpenRouter + Gemini 2.5 Pro failed to generate reliable structured outputs, and how switching to direct OpenAI client integration achieved 100% reliability for production systems.
-
๐ EXTERNAL BLOG | Automated patient engagement with Voice AI in Healthcare
Explore how AI can automatically process referrals, extract referral data, integrate it into EHR, and schedule appointments using advanced IVR agents.
-
7 most widely used flavours of Retrieval-Augmented Generation (RAG)
Retrieval-Augmented Generation (RAG) has rapidly evolved into multiple variants, each designed to tackle different challenges of grounding LLMs in external knowledge. In this article, we break down seven of the most widely used RAG โflavorsโ, from standard and advanced setups to graph-based, multi-modal, and self-reflective approaches and explain their unique strengths, trade-offs, and real-world use cases.
-
AI Agents in Production - Bridging the Gaps to Reliable Systems with AWS Strands and the AWS Ecosystem
Challenges and Solutions in Moving AI Agents from PoC to Production.
-
๐๏ธ From Conversations to Conversions ๐ - The Race to Build Real-Time Voice AI Agents and How Your Business Can Benefit
A deep dive into how voice agents are evolving: from classic speech-to-text and text-to-speech pipelines to full speech-to-speech systems, edge streaming infrastructure, and real-time actions during voice flows.
-
Bridging IVR with Conversational Voice AI for improved interactions
Complete guide to integrate Interactive Voice Response and Conversational Voice AI to turn your robotic calls into a natural human-answered call.
-
LiveKit - Powering Real-Time Audio, Video, and Data
LiveKit is an open-source platform offering scalable, production-ready infrastructure for real-time audio, video, and data communication, featuring server-side agents, WebRTC routing, and tools to simplify building multi-user conferencing.
-
Inside Amazon Nova Sonic - The Event-Driven API Behind Real-Time Voice AI
A deep technical exploration of Amazon Nova Sonicโs speech foundation model: how it unifies ASR, LLM and TTS into a single, event-driven, bidirectional stream API for real-time voice interactions.