Every Laravel RAG tutorial builds the same ingestion pipeline (chunk, embed, store) and stops the moment the agent answers on screen. None of them check whether retrieval is any good. But retrieval quality is decided at ingestion, before the model runs once, and four decisions there fail with no error, no exception, no failed test:
- Chunking that severs the answer mid-sentence, so
answer@1falls whilesource hit@1still looks healthy. - An HNSW index built with
vector_l2_opswhile you query with cosine<=>. Postgres silently ignores the index and scans every row. Laravel 13’s nativewhereVectorSimilarTo()hardcodes<=>, so it’s easier to hit than ever. Shown withEXPLAIN. - The embedding dimension baked into the
vector(1536)column type, so “shrink it to save storage” is a migration plus a full re-embed that quietly drops retrieval to 47%. - Ingesting and querying with different models, which turns every distance into noise.
Each bug is real code from a working repo, proven against an eval suite. It’s the prequel to my earlier “Evaluating RAG in Laravel” post: build it, prove it, tune it. Every example verified against laravel/ai v0.7.2 and pgvector, with the full repo to clone.