AI Foundations

Fine-Tuning vs Prompting vs Retrieval

Compare fine-tuning, prompting, and retrieval in plain English so software engineers know which AI improvement lever they are discussing.

foundation5 min readUpdated 2026-05-22FoundationsVocabularyProducts

Fine-TuningPromptingRetrievalContextModel BehaviorProduct Knowledge

After this, you will understand

This comparison turns a vague request like make the AI better into a concrete choice about behavior, context, and knowledge flow.

Beginner version

Prompting changes the current instructions, retrieval adds useful outside context, and fine-tuning changes model behavior through additional training.

Confusion point

Beginners use fine-tuning as the answer to every weakness or treat retrieval as if it teaches the model permanently.

Better mental model

Decide what is missing first: clearer instructions, current evidence, repeated learned behavior, or a different product workflow.

Think before readingIf an assistant needs today's internal policy to answer correctly, should fine-tuning be your first thought?

Usually no. A current policy is better fetched and supplied as context through retrieval or tools than baked into model behavior.

Reading in progress

This page is saved in your local study history so you can continue later.

Next: Multimodal AI In Plain English

Study path

Read these in order

Start with the mechanics, then move into the patterns that explain why the system is shaped this way.

Concepts Covered

Prompting
Retrieval
Fine-tuning
Context
Model behavior
Current knowledge
Task adaptation
Why RAG is not fine-tuning
When the improvement lever is product design
How teams decide what to try first

1. Plain-English Definition

Prompting is changing what you ask or instruct the model in the current request.

Retrieval is finding useful outside information and adding it to the workflow, often as context for the current request.

Fine-tuning is additional training that changes model behavior using examples or training signals.

Keep the first contrast:

prompting -> change the request
retrieval -> add relevant information
fine-tuning -> change learned behavior

These are all ways AI systems can improve. They solve different kinds of problems.

2. Why This Idea Exists

Teams often say:

The AI is not good enough.

That sentence is too broad to act on.

Maybe the instruction is vague.

Maybe the model does not have today's document.

Maybe the output format is inconsistent across thousands of calls.

Maybe the workflow needs deterministic validation instead of another model change.

Prompting, retrieval, and fine-tuning are separate vocabulary because they answer different missing pieces:

what should the model do right now?
what evidence should it see right now?
what behavior should be learned into the model more persistently?

3. The Beginner Mental Model

Think of three ways to help a teammate.

Prompting is giving clearer instructions for the task on their desk.

Retrieval is handing them the right reference document before they answer.

Fine-tuning is giving repeated practice and feedback so their future behavior changes.

instruction
reference material
practice

That comparison is simple and surprisingly useful.

If the teammate needs the latest contract clause, repeated practice is not the first fix. Give them the contract.

If they keep returning the wrong output format across a stable task, clearer examples or training may become relevant.

4. What That Mental Model Misses

The teammate analogy misses engineering tradeoffs.

First, prompting can include more than one sentence. It can shape roles, constraints, examples, output formats, and tool instructions, but prompt complexity can become brittle.

Second, retrieval quality is its own system problem. Bad chunks, bad permissions, stale documents, weak ranking, and irrelevant context can all hurt output.

Third, fine-tuning is not a knowledge upload button. It is a training choice. It costs data work, evaluation work, and ongoing maintenance.

Fourth, these levers can be combined. A RAG product still needs prompts. A fine-tuned model may still need retrieval for fresh facts.

Fifth, sometimes none of the three is the main fix. The product may need a tool call, a database lookup, validation logic, a human approval step, or a narrower user promise.

5. A Concrete Example

Imagine an assistant that drafts support replies.

Problem A:

Replies are too long.

A better prompt may say:

Draft a reply under 120 words with one next step.

Problem B:

Replies use the old refund policy.

Retrieval or a policy tool should fetch the current policy and add it to the request.

Problem C:

Across a stable, high-volume workflow, the model repeatedly misses the company's response style even with good prompts and examples.

Fine-tuning may be worth evaluating if you have quality examples, a measurable target, and a model path that supports it.

The same product can use all three. The right first move depends on what is missing.

6. How It Works At A Practical Level

Prompting usually affects inference:

instructions + user input + context -> model -> output

Retrieval adds a selection step:

question -> find relevant information -> include context -> model -> output

Fine-tuning adds training work:

examples + training signal -> adjusted model behavior -> later inference

When deciding between them, ask:

Is the model missing instructions?
Is the model missing current, private, or source-backed information?
Is the task stable enough that learned behavior is worth changing?
Can we measure whether the change worked?

That last question matters for every lever.

7. Where You See This In Real AI Products

In a document Q&A product, retrieval is central because user documents need to reach the request.

In a coding assistant, prompting may constrain the edit task while retrieval or repository context surfaces the relevant files.

In a support product, retrieval can bring policy and account evidence while prompts shape tone and output structure.

In a specialized extraction workflow, fine-tuning may be considered when the task is stable, examples are available, and general prompting is not reliable enough.

In an agent workflow, prompts, retrieval, tools, and possibly specialized models can all participate. The architecture still needs boundaries and evaluation.

8. Common Confusions

RAG is not fine-tuning.

RAG retrieves context for generation. Fine-tuning changes model behavior through training.

Prompting is not training.

A prompt shapes the current request. It does not by itself rewrite model parameters.

Retrieval is not model memory.

Retrieval fetches information. The product decides whether and when it becomes context.

Fine-tuning is not the first answer to fresh facts.

Current facts usually belong in context, retrieval, tools, or databases.

9. What This Does Not Mean

This does not mean fine-tuning is bad.

It can be valuable when the problem really is learned behavior and the evaluation case is strong.

This does not mean prompts are toy work.

Prompt design is part of the model interface. It just cannot replace evidence and system design.

This does not mean retrieval guarantees truth.

The product still has to retrieve well, ground answers, and test failure cases.

10. What To Learn Next

Move next to Multimodal AI In Plain English so AI input and output stops meaning text only in your head.

Then learn the multi-step workflow vocabulary in What Is An AI Agent?.

What to study next

These links keep the session moving: read prerequisites first, then open the systems, concepts, and patterns that deepen this page.

Prerequisites

Read these first if the mechanics feel unfamiliar.

Training Data vs Context vs MemoryStart here if Training Data vs Context vs Memory is still fuzzy.RAG In Plain EnglishStart here if RAG In Plain English is still fuzzy.

Read these in order

What to study next

Prerequisites

More Links