AI Foundations

Parameters And Weights

Explain parameters and weights in plain English so software engineers understand where learned model behavior lives before transformer internals.

foundation5 min readUpdated 2026-05-22FoundationsVocabularyMechanics

ParametersWeightsTrainingLearned BehaviorModel SizeInference

After this, you will understand

Parameters explain how a model can learn behavior without engineers writing every rule by hand.

Beginner version

Parameters are learned numbers inside a model. Weights are the parameters that control how strongly signals influence later computation.

Confusion point

Beginners imagine a trained model stores neat English rules or a table of answers instead of many learned numeric settings.

Better mental model

Reason about model behavior as learned numeric state shaped during training, then add product context and checks during inference.

Think before readingWhen a model is trained, where does the learned behavior go?

Training adjusts numeric parameters inside the model. During inference, those learned parameters shape how the model transforms input into output.

Reading in progress

This page is saved in your local study history so you can continue later.

Next: What Is A Large Language Model?

Study path

Read these in order

Start with the mechanics, then move into the patterns that explain why the system is shaped this way.

Concepts Covered

Parameters
Weights
Learned numbers
Training
Inference
Model size
Bias terms
Why parameters are not facts
Why more parameters are not automatic truth
How parameters differ from prompts and context

1. Plain-English Definition

Parameters are learned numeric settings inside a model.

Weights are the parameters people mention most often because they control how strongly signals inside the model influence later computation.

For a beginner, keep this picture:

training adjusts parameters
inference uses parameters

When people say a model has millions or billions of parameters, they are talking about how many learned numbers help define that model's behavior.

Those numbers are not written by an engineer one by one. They are adjusted during training.

2. Why This Idea Exists

A model needs somewhere to store what training changed.

Normal code stores behavior in rules written by humans.

if order.total > limit, require review

A learned model gets behavior from examples and an optimization process. During training, it makes outputs, measures error or preference signals, and adjusts internal numbers so future outputs improve for the training objective.

Parameters are those adjustable internal numbers.

Without parameters, the model would have no learned state to carry from training into inference.

Parameters are why a model can be trained once and then used later on new input.

3. The Beginner Mental Model

Think of parameters as knobs inside a huge machine.

Training turns the knobs.

Inference runs the machine with those knob settings.

examples + training process -> adjusted knobs -> trained model

This analogy explains the basic role of parameters without requiring math first.

It also explains why two models with the same high-level architecture can behave differently after different training. Their learned parameter values are different.

4. What That Mental Model Misses

The knob analogy is useful, but it can make the model sound more human-readable than it is.

First, parameters are not usually neat rules you can inspect like:

refund policy = fourteen days

Knowledge and behavior are distributed across many learned values.

Second, weights are not the only kind of parameter in every model. People often use "weights" casually to refer to learned model state, but the broader word is parameters.

Third, parameters are not the same thing as prompts. Parameters were shaped during training. Prompts and context are supplied during a current inference request.

Fourth, a larger parameter count does not guarantee the product is better for your use case. Training quality, data, architecture, latency, cost, context, evaluation, and product design still matter.

Fifth, parameter learning does not make the model a fact database. It stores learned behavior in numeric form, not a clean row-per-fact source of truth.

5. A Concrete Example

Imagine a tiny classifier that decides whether a support message sounds like a billing issue.

During training, it sees examples such as:

"I was charged twice" -> billing
"My upload is stuck" -> not billing

The training process adjusts internal numeric values until signals related to billing-like language influence the output in useful ways.

Later, during inference, a new message arrives:

"The payment went through but my plan is still free."

The trained parameters help turn that message into an output such as:

billing issue: likely

The engineer did not write a full rulebook for every sentence. Training changed the model's parameters so the model can generalize patterns.

6. How It Works At A Practical Level

At a practical level, model training repeatedly does a loop like this:

input -> current model -> output -> compare signal -> adjust parameters

The exact signal depends on the training setup. It may compare a prediction with a target, reward a preferred output, or optimize another objective.

After training, the chosen parameter values are saved as part of the model.

During inference:

new input + learned parameters -> model computation -> output

For large language models, those parameters shape how token sequences are processed and which next tokens become more or less likely under the visible context.

This is why model files can be large and inference can be expensive. The system has to load and compute with many learned values.

7. Where You See This In Real AI Products

When a team chooses between models, parameter scale may come up as one clue about capacity, cost, and deployment needs.

When an inference server loads a model, it needs the learned model state that includes parameters.

When a smaller model is distilled or quantized later, engineers are often changing how model capability and parameter representation trade against latency, memory, and cost.

When a product improves through better retrieval or better prompt context, it may improve without changing model parameters at all.

That contrast matters:

change model behavior through training -> parameters change
change current request through context -> parameters stay the same

8. Common Confusions

Parameters are not prompt parameters.

In normal API talk, "parameter" can mean a request option. In model talk, parameters often mean learned numeric values inside the model.

Weights are not human-written rules.

They are numeric values adjusted by training.

More parameters do not automatically mean better answers.

Capability can improve with scale, but quality is still task-dependent and product-dependent.

Parameters are not context.

Context is what the model can see during the current request. Parameters are learned state already inside the model.

9. What This Does Not Mean

This does not mean you need to inspect billions of weights to use a model responsibly.

At the beginner layer, you need the boundary: training shapes parameters; inference uses them.

This does not mean model learning is identical to a human remembering facts.

Model behavior is encoded numerically and may be incomplete, stale, or wrong for a task.

This does not mean product teams can ignore data and evaluation after choosing a large model.

The surrounding system still decides whether outputs are useful and safe.

10. What To Learn Next

Learn how the current request reaches a language model in Prompts, Context, And Completions.

Then study Hallucinations so fluent model output does not get mistaken for guaranteed truth.

What to study next

These links keep the session moving: read prerequisites first, then open the systems, concepts, and patterns that deepen this page.

Prerequisites

Read these first if the mechanics feel unfamiliar.

What Is A Model?Start here if What Is A Model? is still fuzzy.Training vs InferenceStart here if Training vs Inference is still fuzzy.What Is A Neural Network?Start here if What Is A Neural Network? is still fuzzy.

Read these in order

What to study next

Prerequisites

More Links