AI Foundations
Fine-Tuning vs Prompting vs Retrieval
Compare fine-tuning, prompting, and retrieval in plain English so software engineers know which AI improvement lever they are discussing.
After this, you will understand
This comparison turns a vague request like make the AI better into a concrete choice about behavior, context, and knowledge flow.
Prompting changes the current instructions, retrieval adds useful outside context, and fine-tuning changes model behavior through additional training.
Beginners use fine-tuning as the answer to every weakness or treat retrieval as if it teaches the model permanently.
Decide what is missing first: clearer instructions, current evidence, repeated learned behavior, or a different product workflow.
Think before readingIf an assistant needs today's internal policy to answer correctly, should fine-tuning be your first thought?
Reading in progress
This page is saved in your local study history so you can continue later.
Study path
Read these in order
Start with the mechanics, then move into the patterns that explain why the system is shaped this way.
Concepts Covered
- Prompting
- Retrieval
- Fine-tuning
- Context
- Model behavior
- Current knowledge
- Task adaptation
- Why RAG is not fine-tuning
- When the improvement lever is product design
- How teams decide what to try first
1. Plain-English Definition
Prompting is changing what you ask or instruct the model in the current request.
Retrieval is finding useful outside information and adding it to the workflow, often as context for the current request.
Fine-tuning is additional training that changes model behavior using examples or training signals.
Keep the first contrast:
prompting -> change the request
retrieval -> add relevant information
fine-tuning -> change learned behavior
These are all ways AI systems can improve. They solve different kinds of problems.
2. Why This Idea Exists
Teams often say:
The AI is not good enough.
That sentence is too broad to act on.
Maybe the instruction is vague.
Maybe the model does not have today's document.
Maybe the output format is inconsistent across thousands of calls.
Maybe the workflow needs deterministic validation instead of another model change.
Prompting, retrieval, and fine-tuning are separate vocabulary because they answer different missing pieces:
- what should the model do right now?
- what evidence should it see right now?
- what behavior should be learned into the model more persistently?
3. The Beginner Mental Model
Think of three ways to help a teammate.
Prompting is giving clearer instructions for the task on their desk.
Retrieval is handing them the right reference document before they answer.
Fine-tuning is giving repeated practice and feedback so their future behavior changes.
instruction
reference material
practice
That comparison is simple and surprisingly useful.
If the teammate needs the latest contract clause, repeated practice is not the first fix. Give them the contract.
If they keep returning the wrong output format across a stable task, clearer examples or training may become relevant.
4. What That Mental Model Misses
The teammate analogy misses engineering tradeoffs.
First, prompting can include more than one sentence. It can shape roles, constraints, examples, output formats, and tool instructions, but prompt complexity can become brittle.
Second, retrieval quality is its own system problem. Bad chunks, bad permissions, stale documents, weak ranking, and irrelevant context can all hurt output.
Third, fine-tuning is not a knowledge upload button. It is a training choice. It costs data work, evaluation work, and ongoing maintenance.
Fourth, these levers can be combined. A RAG product still needs prompts. A fine-tuned model may still need retrieval for fresh facts.
Fifth, sometimes none of the three is the main fix. The product may need a tool call, a database lookup, validation logic, a human approval step, or a narrower user promise.
5. A Concrete Example
Imagine an assistant that drafts support replies.
Problem A:
Replies are too long.
A better prompt may say:
Draft a reply under 120 words with one next step.
Problem B:
Replies use the old refund policy.
Retrieval or a policy tool should fetch the current policy and add it to the request.
Problem C:
Across a stable, high-volume workflow, the model repeatedly misses the company's response style even with good prompts and examples.
Fine-tuning may be worth evaluating if you have quality examples, a measurable target, and a model path that supports it.
The same product can use all three. The right first move depends on what is missing.
6. How It Works At A Practical Level
Prompting usually affects inference:
instructions + user input + context -> model -> output
Retrieval adds a selection step:
question -> find relevant information -> include context -> model -> output
Fine-tuning adds training work:
examples + training signal -> adjusted model behavior -> later inference
When deciding between them, ask:
- Is the model missing instructions?
- Is the model missing current, private, or source-backed information?
- Is the task stable enough that learned behavior is worth changing?
- Can we measure whether the change worked?
That last question matters for every lever.
7. Where You See This In Real AI Products
In a document Q&A product, retrieval is central because user documents need to reach the request.
In a coding assistant, prompting may constrain the edit task while retrieval or repository context surfaces the relevant files.
In a support product, retrieval can bring policy and account evidence while prompts shape tone and output structure.
In a specialized extraction workflow, fine-tuning may be considered when the task is stable, examples are available, and general prompting is not reliable enough.
In an agent workflow, prompts, retrieval, tools, and possibly specialized models can all participate. The architecture still needs boundaries and evaluation.
8. Common Confusions
RAG is not fine-tuning.
RAG retrieves context for generation. Fine-tuning changes model behavior through training.
Prompting is not training.
A prompt shapes the current request. It does not by itself rewrite model parameters.
Retrieval is not model memory.
Retrieval fetches information. The product decides whether and when it becomes context.
Fine-tuning is not the first answer to fresh facts.
Current facts usually belong in context, retrieval, tools, or databases.
9. What This Does Not Mean
This does not mean fine-tuning is bad.
It can be valuable when the problem really is learned behavior and the evaluation case is strong.
This does not mean prompts are toy work.
Prompt design is part of the model interface. It just cannot replace evidence and system design.
This does not mean retrieval guarantees truth.
The product still has to retrieve well, ground answers, and test failure cases.
10. What To Learn Next
Move next to Multimodal AI In Plain English so AI input and output stops meaning text only in your head.
Then learn the multi-step workflow vocabulary in What Is An AI Agent?.
What to study next
These links keep the session moving: read prerequisites first, then open the systems, concepts, and patterns that deepen this page.
Prerequisites
Read these first if the mechanics feel unfamiliar.
More Links
Additional references connected to this page.