Understanding Context and Contextual Retrieval in RAGBy team_scrolltonicMarch 8, 2026 In my latest post, I how hybrid search can be utilised to significantly improve the effectiveness of a RAG pipeline. RAG, in its basic version, using…
Zero-Waste Agentic RAG: Designing Caching Architectures to Minimize Latency and LLM Costs at ScaleBy team_scrolltonicMarch 2, 2026 -Augmented Generation (RAG) has moved out of the experimental phase and firmly into enterprise production. We are no longer just building chatbots to test LLM capabilities;…