In Q3 2024, 72% of production RAG pipelines failed to meet p99 latency SLAs for multimodal queries, according to a Datadog survey of 1,200 engineering teams. Most blamed fragmented toolchains for text and image retrieval—until Stable Diffusion 3.0’s embedding API and Llama 4’s 1M-token context window changed the game. This is the definitive guide to building unified multimodal RAG pipelines that c
I spent the long weekend pushing Logic Apps MCP server capabilities further than I had before — and hit two bugs worth documenting. Both are filed. If you're building in this space, save yourself the debugging time. If you've been following along, the MCP server and BODMAS Agent are covered in the previous posts. This post is just about what broke when I wired them together. The Agent Loop fails w