3 docs tagged with "rag"

Embeddings Deep Dive

How embedding models work, the 2026 model landscape, how to choose one, dimensions and Matryoshka, vector quantization, domain adaptation, hybrid retrieval, and the production pitfalls that quietly halve recall.

Knowledge Management with LLMs

The design axis behind RAG, just-in-time retrieval, structured note-taking, the LLM-wiki pattern, and llms.txt - where synthesized knowledge lives, who maintains it, and when to use each.

RAG (Retrieval-Augmented Generation)

How retrieval-augmented generation grounds an LLM in external knowledge using embeddings and vector databases, how it compares to fine-tuning, and the production levers that make it work.