#RAG#Chunking#BM25#SPLADE#ColBERT#Hybrid Retrieval#Syllabus

Module 16: RAG Basics — Chunking and Retrieval

Syllabus on RAG foundations — building a baseline RAG system, choosing embedding models and chunking strategies, and implementing hybrid retrieval with BM25, SPLADE, and ColBERT-style multi-vector retrieval.

May 28, 2026 at 12:09 PM1 min readFollowFollow (Hindi)

Topics You Will Master

Building a baseline (vanilla) RAG system from first principles
Choosing embedding models for retrieval
Chunking strategies and their effect on retrieval quality
Sparse retrieval with BM25 and learned-sparse SPLADE
Multi-vector late-interaction retrieval (ColBERT-style)
Best For

Engineers implementing their first production-quality retrieval pipeline.

Expected Outcome

The ability to assemble a baseline RAG system and reason about embedding, chunking, and hybrid-retrieval choices.

Module Overview

This module covers the core RAG building blocks. It starts from a vanilla pipeline, then examines the decisions that most affect quality — embedding model selection and chunking — before introducing hybrid retrieval that combines lexical and dense signals with multi-vector late interaction.

Learning Objectives

  • Describe the components of a vanilla RAG pipeline.
  • Select an embedding model appropriate to a corpus and query type.
  • Choose chunking strategies and explain their impact on context quality.
  • Compare BM25, SPLADE, and ColBERT-style retrieval.

Topics Covered

Foundations of RAG

  • Vanilla RAG
  • Embedding models for retrieval
  • Chunking strategies
  • BM25 (lexical sparse retrieval)
  • SPLADE (learned sparse retrieval)
  • Multi-vector ColBERT (late-interaction retrieval)

Key Concepts & Terminology

Retrieve-then-generate, fixed vs recursive vs semantic chunking, chunk overlap, lexical vs dense vs hybrid retrieval, MaxSim late interaction.

Tools & Frameworks Referenced

BM25, SPLADE, ColBERT-style multi-vector retrieval, vector stores.

Prerequisites

Module 14 (embeddings) and Module 15 (LangChain for orchestration).

Find this tutorial useful?

Subscribe to our YouTube channels for more practical production walk-throughs.

Discussion & Comments