AI Engineering

AI Concepts

Core AI engineering mechanisms behind models, embeddings, retrieval, agents, inference, and evaluation.

intermediate

Vector Embeddings

Turn text, images, code, users, or products into learned vectors that preserve useful relationships for search, ranking, retrieval, and modeling.

ai-conceptsembeddingsretrieval

intermediate

Semantic Space

Reason about the learned representation space where embeddings are compared, clustered, ranked, and searched by relative position.

ai-conceptsembeddingssimilarity

intermediate

Vector Search

Retrieve nearby vectors for a query representation so AI systems can find semantically related candidates under latency and scale pressure.

ai-conceptsvector-searchretrieval

intermediate

Vector Databases

Store, index, filter, and retrieve vector embeddings as a production data service for semantic search and AI retrieval workloads.

ai-conceptsvector-databasesretrieval

intermediate

ANN Indexes

Trade exact nearest-neighbor scans for approximate vector indexes that keep similarity retrieval fast enough at larger corpus sizes.

ai-conceptsannvector-search

intermediate

Indexing Techniques For Vector Search

Compare exact, partitioned, graph-based, and compressed vector index techniques by the retrieval work they save and the tradeoffs they introduce.

ai-conceptsvector-searchindexing

intermediate

Search Execution Flow

Follow a vector retrieval request from query embedding through filters, ANN candidates, payload hydration, reranking, and downstream context use.

ai-conceptsvector-searchretrieval

intermediate

Supervised vs Unsupervised vs Self-Supervised Learning

Compare the learning signals behind supervised, unsupervised, and self-supervised training before moving into modern model objectives.

ai-conceptstraininglearning-signals

intermediate

Loss, Optimization, And Gradient Descent

Connect the training objective, loss signal, parameter updates, and gradient descent loop that make model learning concrete.

ai-conceptstrainingoptimization

intermediate

Transformer Architecture

See the transformer as the model shape that turns token representations, attention, feed-forward layers, and repeated blocks into modern language-model computation.

ai-conceptstransformersllms

intermediate

Attention

Understand attention as the mechanism that lets token positions choose which context signals matter when their representations are updated.

ai-conceptstransformersattention

intermediate

Multi-Head Attention

Learn why transformers run several attention heads in parallel so token representations can mix different learned context signals.

ai-conceptstransformersattention

intermediate

Masked Attention

Understand how attention masks control which token positions are allowed to influence each other, especially during next-token generation.

ai-conceptstransformersattention

intermediate

Positional Embeddings

Learn why transformers need position information so token order can influence attention and language-model behavior.

ai-conceptstransformersembeddings

intermediate

KV Cache

Understand how key-value caching makes autoregressive LLM inference faster by reusing attention work from previous tokens.

ai-conceptsinferencetransformers

intermediate

Quantization

Learn how quantization reduces model memory and serving cost by representing weights or activations with lower precision.

ai-conceptsinferenceoptimization

intermediate

Distillation

Understand how knowledge distillation trains a smaller or cheaper model to imitate useful behavior from a larger teacher model.

ai-conceptstrainingoptimization

intermediate

Mixture Of Experts

Learn how mixture-of-experts models increase capacity by routing inputs through selected expert subnetworks instead of activating every parameter.

ai-conceptsmodel-architecturetransformers