Back to Works

RAG from Scratch

2026

A minimal RAG implementation built from the ground up without framework abstractions. Follows the core loop: documents → chunk → embed → store → query → retrieve → generate. The goal is to understand each step of the RAG pipeline before reaching for tools like LangChain.

What's Inside

Repository
GitHub
Platform
Python CLI
Stack
Python, NumPy, sentence-transformers, Ollama, Gemini

Architecture

RAG Pipeline Architecture