University-style syllabus · 32 modules

GenAI Syllabus

A structured curriculum for Production LLM Engineering — Transformer foundations, fine-tuning and alignment, multimodal and speech AI, RAG and retrieval engineering, agentic systems, and prompt/context engineering. Each module lists topics, learning objectives, and the tools and frameworks referenced — concept-first, not a coding tutorial.

Sub-category·3 modules

Transformers and Architecture

Attention, tokenization, encoder/decoder families, fast inference, and scaling laws.

Sub-category·7 modules

LLM Fine-Tuning and Alignment

Pretraining lifecycle, SFT, PEFT, preference alignment, quantization, MoE, reasoning models, and SLMs.

M04Pre-TrainingPost-Training

Module 04: LLM Lifecycle and Pre-Training

Syllabus covering the two-phase LLM lifecycle — pre-training vs post-training, why base models need adaptation, continued pre-training for domain adaptation, and multi-token prediction.

May 28, 2026 at 12:21 PMOpen module
M05Data PreparationSynthetic Data

Module 05: Datasets and Synthetic Data

Syllabus on preparing fine-tuning data — dataset formats, chat templates, loss masking, deduplication, and synthetic instruction and preference dataset generation with self-instruct and LLM-as-judge.

May 28, 2026 at 12:20 PMOpen module
M06SFTLoRA

Module 06: SFT, PEFT and Preference Alignment

Syllabus on adapting and aligning LLMs — parameter-efficient fine-tuning (LoRA, QLoRA, DoRA, AdaLoRA, LoRA+), supervised fine-tuning, and preference alignment with RLHF and DPO.

May 28, 2026 at 12:19 PMOpen module
M07EvaluationQuantization

Module 07: Evaluation, Quantization and Deployment

Syllabus covering post-fine-tuning workflows — benchmark and LLM-as-judge evaluation, quantization formats (GPTQ, AWQ, NF4, FP8, GGUF), serving with vLLM/SGLang/llama.cpp, and fine-tuning frameworks.

May 28, 2026 at 12:18 PMOpen module
M08Mixture of ExpertsMoE

Module 08: Mixture of Experts

Syllabus on Mixture of Experts — why dense models hit scaling limits, the MoE routing idea, load balancing to avoid expert collapse, training and inference trade-offs, and when MoE beats dense.

May 28, 2026 at 12:17 PMOpen module
M09Reasoning ModelsChain-of-Thought

Module 09: Reasoning Models and Chain-of-Thought

Syllabus on reasoning models — what distinguishes them from standard LLMs, chain-of-thought training, RL-only reasoning (GRPO, DeepSeek-R1-Zero), and distilling reasoning into smaller models.

May 28, 2026 at 12:16 PMOpen module
M10Small Language ModelsKnowledge Distillation

Module 10: Small Language Models and Distillation

Syllabus on Small Language Models and knowledge distillation — why SLMs matter for cost, latency, and privacy, the student-teacher paradigm, soft labels, temperature scaling, KL divergence, and attention transfer.

May 28, 2026 at 12:15 PMOpen module
Sub-category·3 modules

Vision and Speech

CNNs to ViT, visual language models, and speech-to-text with Whisper.

Sub-category·6 modules

RAG and Retrieval

Embeddings, LangChain RAG, advanced RAG patterns, vector quantization, multimodal RAG, graph RAG and security.

M14EmbeddingsMatryoshka

Module 14: Embedding Models and Matryoshka Tuning

Syllabus on embedding models — the taxonomy from dense to binary, multi-vector embeddings, Matryoshka Representation Learning for flexible dimensions, and embedding fine-tuning for domain retrieval.

May 28, 2026 at 12:11 PMOpen module
M15LangChainLCEL

Module 15: LangChain for Production RAG

Syllabus on LangChain as the orchestration framework for production RAG — LCEL, integrations, prompting and structured output, memory and retrieval, agents and multimodal, observability and security.

May 28, 2026 at 12:10 PMOpen module
M16RAGChunking

Module 16: RAG Basics — Chunking and Retrieval

Syllabus on RAG foundations — building a baseline RAG system, choosing embedding models and chunking strategies, and implementing hybrid retrieval with BM25, SPLADE, and ColBERT-style multi-vector retrieval.

May 28, 2026 at 12:09 PMOpen module
M17Advanced RAGRerankers

Module 17: Advanced RAG — Rerankers and Adaptive Retrieval

Syllabus on advanced RAG — query transformations, rerankers, self-correcting and adaptive retrieval patterns, contextual retrieval, systematic RAG evaluation, and agentic RAG.

May 28, 2026 at 12:08 PMOpen module
M18Vector QuantizationMultimodal RAG

Module 18: Vector Quantization and Multimodal RAG

Scale vector search with quantization (scalar, binary, product) and retrieve over visually rich documents with the ColPali multimodal RAG paradigm — no OCR required.

May 28, 2026 at 12:07 PMOpen module
M19Graph RAGNeo4j

Module 19: Graph RAG, Caching and RAG Security

Round out a production RAG system — knowledge-graph retrieval with Graph RAG, vectorless patterns, semantic caching, and the security layer (PII masking, input/output guardrails, prompt-injection defence).

May 28, 2026 at 12:06 PMOpen module
Sub-category·4 modules

Agents and Multi-Agent Systems

Function calling, MCP, LangGraph, A2A protocol, observability, and Bedrock AgentCore deployment.

Sub-category·3 modules

Prompting, Context and Evaluation

Prompt engineering, context engineering, and evaluation harnesses with agent CI/CD.

Sub-category·6 modules

Capstone Projects

End-to-end projects integrating fine-tuning, distillation, RAG, multi-agent systems, speech, and LLMOps.

P01CapstoneFine-Tuning

Project 01: ClinicLLM — Medical LLM Fine-Tuning Pipeline

Build a domain-specific medical language model with QLoRA SFT and DPO preference alignment, then serve it with multiple hot-swappable LoRA adapters.

May 28, 2026 at 12:00 PMOpen module
P02CapstoneKnowledge Distillation

Project 02: TinyReason — Distilling Reasoning to a CPU Model

Compress a larger reasoning teacher into a small student using KL divergence and attention transfer, then quantize to GGUF for cost-efficient CPU inference.

May 28, 2026 at 12:00 PMOpen module
P03CapstoneRAG

Project 03: LegalRAG — Hybrid Multi-Modal + Graph RAG for Contracts

Build a production legal-document intelligence system combining ColPali multimodal indexing (no OCR), a Neo4j knowledge graph, hybrid retrieval with RRF, cross-encoder reranking, and RAGAS-gated evaluation.

May 28, 2026 at 12:00 PMOpen module
P04CapstoneMulti-Agent

Project 04: DevOpsCrew — Multi-Agent DevOps with HITL and A2A

Build a production DevOps assistant with a LangGraph supervisor delegating to specialised sub-agents over MCP and A2A, with human-in-the-loop gating every write.

May 28, 2026 at 12:00 PMOpen module
P05CapstoneLLMOps

Project 05: EvalShip — Eval-Gated CI/CD with Auto-Rollback

Wrap all prior projects in a production LLMOps shell where every code or prompt change must pass eval-gated CI stages before deployment, with blue/green and auto-rollback.

May 28, 2026 at 12:00 PMOpen module
P06CapstoneWhisper

Project 06: VoiceTrack — Whisper Fine-Tuning and Production STT Pipeline

Fine-tune Whisper on domain-specific audio and ship a production STT service with streaming transcription, diarisation hooks, and an evaluation gate on WER.

May 28, 2026 at 12:00 PMOpen module