A Haystack pipeline can be perfectly wired and still unsafe. The retriever returns documents. Every component did its job. But if untrusted text moved through the pipeline as ordinary context, the trust boundary was lost. That is the problem this post is about. Not bad Python. A valid component connection only says: this value fits the next component It does not say: this value is safe to influen
Apache Stateful Functions is one of the quietly powerful frameworks in the Flink ecosystem - durable per-key state, exactly-once messaging, polyglot remote functions, all on top of Apache Flink. It's also been functionally dormant since October 2024, and it doesn't run on Flink 2.x. We needed it on Flink 2.x. So we maintained the continuation: kzmlabs/flink-statefun. This post is the why and the h
Comparison: Haystack 2.0 vs. RAGatouille 0.3 for Building High-Accuracy RAG Pipelines for Developer Docs Retrieval-Augmented Generation (RAG) has become the standard for building LLM-powered tools that answer questions using private or domain-specific data. For developer documentation (dev docs) — which includes technical jargon, versioned APIs, code snippets, and structured reference material —