The first article on this blog explained how it was built in 30 minutes with Claude Code. Naturally, a blog needs comments. Same constraints: no database, no external dependencies, no Disqus tracking visitors. Just PHP + JSON files. Built in one session with Claude Code — the interesting part wasn't the code, it was the security audit that followed. A comment system without a database seems trivia
When building applications with large language models (LLMs), one of the most overlooked costs is how structured data is represented. Most systems use JSON. And JSON is inefficient for LLM input. KODA (Knowledge-Oriented Data Abstraction) is a schema-first data format designed to reduce token usage when sending structured data to LLMs. It works by: Defining structure once (schema-first) Encoding v
A Haystack pipeline can be perfectly wired and still unsafe. The retriever returns documents. Every component did its job. But if untrusted text moved through the pipeline as ordinary context, the trust boundary was lost. That is the problem this post is about. Not bad Python. A valid component connection only says: this value fits the next component It does not say: this value is safe to influen
Comparison: Haystack 2.0 vs. RAGatouille 0.3 for Building High-Accuracy RAG Pipelines for Developer Docs Retrieval-Augmented Generation (RAG) has become the standard for building LLM-powered tools that answer questions using private or domain-specific data. For developer documentation (dev docs) — which includes technical jargon, versioned APIs, code snippets, and structured reference material —