Name	Name	Last commit message	Last commit date
parent directory ..
01-getting-started.md	01-getting-started.md
02-signatures.md	02-signatures.md
03-modules.md	03-modules.md
04-rag.md	04-rag.md
05-optimization.md	05-optimization.md
06-advanced-patterns.md	06-advanced-patterns.md
07-evaluation.md	07-evaluation.md
08-production.md	08-production.md
README.md	README.md

layout	title	nav_order	has_children
default	DSPy Tutorial	74	true

DSPy Tutorial: Programming Language Models

Learn to program language models declaratively with DSPy, the Stanford NLP framework for systematic prompt optimization and modular LLM pipelines.

🧠 Declarative LLM Programming Framework

🎯 What is DSPy?

DSPy^{View Repo} is a framework for algorithmically optimizing LM prompts and weights, developed by researchers at Stanford NLP. Unlike traditional prompt engineering, DSPy allows you to program LMs declaratively - you specify what you want to accomplish, and DSPy figures out how to optimize the prompts and model configurations.

DSPy vs Traditional Approaches

Aspect	Traditional Prompting	DSPy
Approach	Manual prompt crafting	Algorithmic optimization
Optimization	Trial and error	Automatic prompt tuning
Composition	Hard-coded pipelines	Modular, reusable components
Evaluation	Manual testing	Systematic validation
Maintenance	Brittle, manual updates	Self-improving systems

flowchart TD
    A[Define Task] --> B[Write DSPy Program]
    B --> C[Automatic Optimization]
    C --> D[Optimized Prompts & Weights]
    D --> E[Production Deployment]

    F[Data] --> C
    G[Metrics] --> C
    H[Models] --> C

    classDef dspy fill:#e1f5fe,stroke:#01579b
    classDef traditional fill:#fff3e0,stroke:#ef6c00
    classDef result fill:#e8f5e8,stroke:#1b5e20

    class A,B dspy
    class C optimization
    class D,E result

Current Snapshot (auto-updated)

repository: stanfordnlp/dspy
stars: about 32.8k
latest release: 3.1.3 (published 2026-02-05)

Core Concepts

Signatures

Signatures define the input/output behavior of your LM calls. They specify what information goes in and what comes out.

# Basic signature
class BasicQA(dspy.Signature):
    """Answer questions with short factoid answers."""

    question = dspy.InputField()
    answer = dspy.OutputField(desc="often between 1 and 5 words")

# Advanced signature with instructions
class GenerateAnswer(dspy.Signature):
    """Answer questions with evidence. Use the provided context to justify your answer."""

    context = dspy.InputField(desc="facts here are assumed to be true")
    question = dspy.InputField()
    answer = dspy.OutputField(desc="natural language answer, with evidence")
    evidence = dspy.OutputField(desc="supporting facts from context")

Modules

Modules are reusable components that implement specific behaviors using LMs.

# Built-in modules
retriever = dspy.Retrieve(k=3)  # Retrieve top-3 passages
generator = dspy.ChainOfThought()  # Generate with reasoning
classifier = dspy.MultiChainComparison()  # Compare multiple options

# Custom modules
class CustomQA(dspy.Module):
    def __init__(self):
        super().__init__()
        self.retrieve = dspy.Retrieve(k=3)
        self.generate = dspy.ChainOfThought(signature=GenerateAnswer)

    def forward(self, question):
        context = self.retrieve(question).passages
        answer = self.generate(context=context, question=question)
        return answer

Optimizers

Optimizers automatically improve your DSPy programs by tuning prompts and selecting better model configurations.

# Teleprompters optimize prompts
teleprompter = dspy.BootstrapFewShot(metric=my_metric, max_bootstraps=3)
optimized_program = teleprompter.compile(program, trainset=trainset)

# MIPRO optimizes instructions
mipro_optimizer = dspy.MIPROv2(metric=my_metric, num_candidates=10)
optimized_program = mipro_optimizer.compile(program, trainset=trainset)

Tutorial Chapters

Chapter 1: Getting Started - Installation, basic concepts, and your first DSPy program
Chapter 2: Signatures - Defining input/output specifications for LM calls
Chapter 3: Modules - Using built-in and creating custom DSPy modules
Chapter 4: Retrieval-Augmented Generation - Building RAG systems with DSPy
Chapter 5: Optimization - Automatic prompt and model optimization
Chapter 6: Advanced Patterns - Multi-hop reasoning, tool use, and complex workflows
Chapter 7: Evaluation & Metrics - Systematic evaluation and custom metrics
Chapter 8: Production Deployment - Scaling, monitoring, and production best practices

What You'll Learn

Declarative Programming - Specify what you want, let DSPy optimize how
Automatic Optimization - Self-improving prompts and model configurations
Modular Design - Build complex systems from reusable components
Systematic Evaluation - Rigorous testing and validation frameworks
Production Deployment - Scale DSPy systems for real-world applications

Prerequisites

Python 3.9+
Basic understanding of LLMs and prompting
Familiarity with machine learning concepts (helpful but not required)
OpenAI API key or access to other LLM providers

Quick Start

# Install DSPy
pip install dspy-ai

# Set up OpenAI API key
export OPENAI_API_KEY="your-api-key"

import dspy

# Configure LM
lm = dspy.OpenAI(model='gpt-3.5-turbo', api_key='your-key')
dspy.settings.configure(lm=lm)

# Define signature
class BasicQA(dspy.Signature):
    question = dspy.InputField()
    answer = dspy.OutputField()

# Create program
qa_program = dspy.Predict(BasicQA)

# Use program
result = qa_program(question="What is the capital of France?")
print(result.answer)  # "Paris"

Example: RAG System

import dspy

# Configure DSPy
lm = dspy.OpenAI(model='gpt-4')
rm = dspy.ColBERTv2(url='http://20.102.90.50:2017/wiki17_abstracts')
dspy.settings.configure(lm=lm, rm=rm)

# Define RAG signature
class GenerateAnswer(dspy.Signature):
    context = dspy.InputField(desc="may contain relevant facts")
    question = dspy.InputField()
    answer = dspy.OutputField(desc="answer with evidence")

# Create RAG program
class RAG(dspy.Module):
    def __init__(self, num_passages=3):
        super().__init__()
        self.retrieve = dspy.Retrieve(k=num_passages)
        self.generate_answer = dspy.ChainOfThought(GenerateAnswer)

    def forward(self, question):
        context = self.retrieve(question).passages
        prediction = self.generate_answer(context=context, question=question)
        return dspy.Prediction(context=context, answer=prediction.answer)

# Use RAG system
rag = RAG()
result = rag("What are the main components of a computer?")
print(result.answer)

Learning Path

🟢 Beginner Track

Chapters 1-2: Setup and basic signatures
Build simple QA systems

🟡 Intermediate Track

Chapters 3-4: Modules and RAG systems
Create retrieval-augmented applications

🔴 Advanced Track

Chapters 5-8: Optimization, evaluation, and production
Master DSPy for complex real-world applications

Ready to program language models systematically? Let's begin with Chapter 1: Getting Started!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

DSPy Tutorial: Programming Language Models

🎯 What is DSPy?

DSPy vs Traditional Approaches

Current Snapshot (auto-updated)

Core Concepts

Signatures

Modules

Optimizers

Tutorial Chapters

What You'll Learn

Prerequisites

Quick Start

Example: RAG System

Learning Path

🟢 Beginner Track

🟡 Intermediate Track

🔴 Advanced Track

Navigation & Backlinks

Full Chapter Map

Source References

FilesExpand file tree

dspy-tutorial

Directory actions

More options

Directory actions

More options

Latest commit

History

dspy-tutorial

Folders and files

parent directory

README.md

DSPy Tutorial: Programming Language Models

🎯 What is DSPy?

DSPy vs Traditional Approaches

Current Snapshot (auto-updated)

Core Concepts

Signatures

Modules

Optimizers

Tutorial Chapters

What You'll Learn

Prerequisites

Quick Start

Example: RAG System

Learning Path

🟢 Beginner Track

🟡 Intermediate Track

🔴 Advanced Track

Navigation & Backlinks

Full Chapter Map

Source References