Fixed-length chunking requires no external services, yet semantic chunking absolutely needs an Embedding API — why? The core idea of semantic chunking is to split text at semantic boundaries. Determining whether "two pieces of text belong to the same topic" requires converting text into vectors and computing similarity — that's exactly what the Embedding API does. Dimension Fixed-Length / Recur
SQL is widely known for data querying and manipulation but systems do grow; data becomes larger; processes become repetitive and operations become sensitive. SQL has some features which enables it to be considered a fully fledged programming language. Some of the features which I discuss in this article are procedures, functions and transactions. Each of these concepts serve distinct purposes. Sto
Hi 👋, In this post we shall explore Bedrock's structured KB with this architecture: Upload CSVs to S3 > SNS Queue > Crawl data with Glue > Query with Redshift > Bedrock KB > Query with LLM. Let's do some of this with code. Let's get started. Clone the repo and switch to the project directory. git clone [email protected]:networkandcode/networkandcode.github.io.git cd structured-kb-demo/ Do a uv sync
Subqueries vs. CTEs in SQL: A Practical Guide to Writing Cleaner, Smarter Queries Whether you're just getting comfortable with SQL or leveling up your data skills, two tools will come up again and again when working with complex queries: subqueries and Common Table Expressions (CTEs). They solve similar problems — breaking a complex query into manageable pieces — but they do it in different ways
Why Does Switching Embedding Models Make Such a Huge Difference? In the first four articles, we built the RAG pipeline, tuned parameters, and mastered chunking strategies. But there's one question we haven't dived into: After your documents are chunked, how do they become vectors? This process is called Embedding. It transforms human-readable text into machine-computable vectors. The choice of E
In a previous post, I explored Codd's connection trap in PostgreSQL and MongoDB — the classic pitfall where joining two independent many-to-many relationships through a shared attribute produces spurious combinations that look like facts but aren't. The example followed Codd's 1970 suppliers–parts–projects model: we know which suppliers supply which parts, and which projects use which parts, but j
Automating Hermitage to see how transactions differ in MySQL and MariaDB
Barman – Backup and Recovery Manager for PostgreSQL