AI Foundations
Parameters And Weights
Explain parameters and weights in plain English so software engineers understand where learned model behavior lives before transformer internals.
After this, you will understand
Parameters explain how a model can learn behavior without engineers writing every rule by hand.
Parameters are learned numbers inside a model. Weights are the parameters that control how strongly signals influence later computation.
Beginners imagine a trained model stores neat English rules or a table of answers instead of many learned numeric settings.
Reason about model behavior as learned numeric state shaped during training, then add product context and checks during inference.
Think before readingWhen a model is trained, where does the learned behavior go?
Reading in progress
This page is saved in your local study history so you can continue later.
Study path
Read these in order
Start with the mechanics, then move into the patterns that explain why the system is shaped this way.
Concepts Covered
- Parameters
- Weights
- Learned numbers
- Training
- Inference
- Model size
- Bias terms
- Why parameters are not facts
- Why more parameters are not automatic truth
- How parameters differ from prompts and context
1. Plain-English Definition
Parameters are learned numeric settings inside a model.
Weights are the parameters people mention most often because they control how strongly signals inside the model influence later computation.
For a beginner, keep this picture:
training adjusts parameters
inference uses parameters
When people say a model has millions or billions of parameters, they are talking about how many learned numbers help define that model's behavior.
Those numbers are not written by an engineer one by one. They are adjusted during training.
2. Why This Idea Exists
A model needs somewhere to store what training changed.
Normal code stores behavior in rules written by humans.
if order.total > limit, require review
A learned model gets behavior from examples and an optimization process. During training, it makes outputs, measures error or preference signals, and adjusts internal numbers so future outputs improve for the training objective.
Parameters are those adjustable internal numbers.
Without parameters, the model would have no learned state to carry from training into inference.
Parameters are why a model can be trained once and then used later on new input.
3. The Beginner Mental Model
Think of parameters as knobs inside a huge machine.
Training turns the knobs.
Inference runs the machine with those knob settings.
examples + training process -> adjusted knobs -> trained model
This analogy explains the basic role of parameters without requiring math first.
It also explains why two models with the same high-level architecture can behave differently after different training. Their learned parameter values are different.
4. What That Mental Model Misses
The knob analogy is useful, but it can make the model sound more human-readable than it is.
First, parameters are not usually neat rules you can inspect like:
refund policy = fourteen days
Knowledge and behavior are distributed across many learned values.
Second, weights are not the only kind of parameter in every model. People often use "weights" casually to refer to learned model state, but the broader word is parameters.
Third, parameters are not the same thing as prompts. Parameters were shaped during training. Prompts and context are supplied during a current inference request.
Fourth, a larger parameter count does not guarantee the product is better for your use case. Training quality, data, architecture, latency, cost, context, evaluation, and product design still matter.
Fifth, parameter learning does not make the model a fact database. It stores learned behavior in numeric form, not a clean row-per-fact source of truth.
5. A Concrete Example
Imagine a tiny classifier that decides whether a support message sounds like a billing issue.
During training, it sees examples such as:
"I was charged twice" -> billing
"My upload is stuck" -> not billing
The training process adjusts internal numeric values until signals related to billing-like language influence the output in useful ways.
Later, during inference, a new message arrives:
"The payment went through but my plan is still free."
The trained parameters help turn that message into an output such as:
billing issue: likely
The engineer did not write a full rulebook for every sentence. Training changed the model's parameters so the model can generalize patterns.
6. How It Works At A Practical Level
At a practical level, model training repeatedly does a loop like this:
input -> current model -> output -> compare signal -> adjust parameters
The exact signal depends on the training setup. It may compare a prediction with a target, reward a preferred output, or optimize another objective.
After training, the chosen parameter values are saved as part of the model.
During inference:
new input + learned parameters -> model computation -> output
For large language models, those parameters shape how token sequences are processed and which next tokens become more or less likely under the visible context.
This is why model files can be large and inference can be expensive. The system has to load and compute with many learned values.
7. Where You See This In Real AI Products
When a team chooses between models, parameter scale may come up as one clue about capacity, cost, and deployment needs.
When an inference server loads a model, it needs the learned model state that includes parameters.
When a smaller model is distilled or quantized later, engineers are often changing how model capability and parameter representation trade against latency, memory, and cost.
When a product improves through better retrieval or better prompt context, it may improve without changing model parameters at all.
That contrast matters:
change model behavior through training -> parameters change
change current request through context -> parameters stay the same
8. Common Confusions
Parameters are not prompt parameters.
In normal API talk, "parameter" can mean a request option. In model talk, parameters often mean learned numeric values inside the model.
Weights are not human-written rules.
They are numeric values adjusted by training.
More parameters do not automatically mean better answers.
Capability can improve with scale, but quality is still task-dependent and product-dependent.
Parameters are not context.
Context is what the model can see during the current request. Parameters are learned state already inside the model.
9. What This Does Not Mean
This does not mean you need to inspect billions of weights to use a model responsibly.
At the beginner layer, you need the boundary: training shapes parameters; inference uses them.
This does not mean model learning is identical to a human remembering facts.
Model behavior is encoded numerically and may be incomplete, stale, or wrong for a task.
This does not mean product teams can ignore data and evaluation after choosing a large model.
The surrounding system still decides whether outputs are useful and safe.
10. What To Learn Next
Learn how the current request reaches a language model in Prompts, Context, And Completions.
Then study Hallucinations so fluent model output does not get mistaken for guaranteed truth.
What to study next
These links keep the session moving: read prerequisites first, then open the systems, concepts, and patterns that deepen this page.
Prerequisites
Read these first if the mechanics feel unfamiliar.
More Links
Additional references connected to this page.