"> Foundation Models Foundation Model A large-scale AI model trained on broad data that can be adapted to a wide range of downstream tasks. Examples: GPT-4, Claude, Gemini, LLaMA. Large Language Model (LLM) A neural network trained on vast text corpora to understand and generate human language using

"> Computer Vision Computer Vision dd>The field of AI focused on enabling computers to interpret and understand visual information from images and video. CNN dd>Convolutional Neural Network — uses convolutional filters to detect visual features at different scales and locations. Object Detecti

"> Agent Systems AI Agent dd>An autonomous system that perceives its environment, makes decisions, and takes actions to achieve goals. Autonomous Agent dd>An AI agent that operates independently, making and executing decisions without continuous human oversight. Multi-Agent System dd>A syst

"> AI Hardware GPU dd>Graphics Processing Unit — parallel processors originally designed for graphics, now essential for AI training and inference. TPU dd>Tensor Processing Unit — Google's custom chip optimized specifically for machine learning workloads. NPU dd>Neural Processing Unit — spe

AI Glossary: 100+ Terms Every Practitioner Should Know (2026)

Q: Natural Language Processing

"> Natural Language Processing NLP Natural Language Processing — the field focused on enabling computers to understand, interpret, and generate human language. Named Entity Recognition (NER) Identifying and classifying entities (people, organizations, locations, dates) in unstructured text. Sentimen

Q: Reinforcement Learning

from Human Feedback) Training approach that uses human preference data to align model outputs with human values and preferences. DPO (Direct Preference Optimization) An alignment technique that directly optimizes a model on preference data without training a separate reward model. LoRA (Low-Rank Ada

.glossary-page { max-width: 900px; margin: 0 auto; font-family: -apple-system, BlinkMacSystemFont, ‚Segoe UI‘, Roboto, sans-serif; }
.glossary-page h1 { color: #1a1a2e; border-bottom: 3px solid #16213e; padding-bottom: 10px; }
.glossary-page h2 { color: #16213e; margin-top: 30px; background: #f0f4ff; padding: 10px 15px; border-left: 4px solid #16213e; }
.glossary-page dl { margin: 0 0 20px 0; }
.glossary-page dt { font-weight: 700; color: #0f3460; margin-top: 14px; font-size: 1.05em; }
.glossary-page dd { margin: 4px 0 0 20px; color: #333; line-height: 1.6; }
#glossarySearch { width: 100%; padding: 12px 16px; font-size: 16px; border: 2px solid #16213e; border-radius: 8px; margin-bottom: 24px; box-sizing: border-box; }

AI Glossary: 100+ Terms Every Practitioner Should Know

Reviewed: June 4, 2026

Comprehensive reference for AI, ML, and data science terminology — organized by category. Last updated: May 2026.

Foundation Models

Foundation Model: A large-scale AI model trained on broad data that can be adapted to a wide range of downstream tasks. Examples: GPT-4, Claude, Gemini, LLaMA.
Large Language Model (LLM): A neural network trained on vast text corpora to understand and generate human language using transformer architectures.
Transformer: The neural network architecture powering modern LLMs, introduced in „Attention Is All You Need“ (2017). Uses self-attention to process sequences in parallel.
Attention Mechanism: A component that weighs the importance of different parts of the input when producing output, enabling the model to focus on relevant context.
Self-Attention: Attention where the query, key, and value all come from the same sequence, allowing each position to attend to all other positions.
Multi-Head Attention: Running multiple attention mechanisms in parallel, each learning different types of relationships in the data.
Token: The basic unit of text processed by an LLM — a word, subword, or character. English averages roughly 1.3 tokens per word.
Context Window: The maximum tokens an LLM can consider at once when generating a response. Ranges from 4K to 1M+ in modern models.
Temperature: A parameter controlling output randomness. Lower values (0.1-0.3) produce focused, deterministic responses; higher values (0.7-1.5) produce creative, diverse outputs.
Top-p (Nucleus Sampling): Sampling from the smallest set of tokens whose cumulative probability exceeds p, balancing diversity and quality.
Top-k Sampling: Sampling from only the k most likely next tokens, filtering out low-probability options.
Prompt Engineering: The practice of crafting input text to elicit desired LLM outputs. Includes few-shot examples, role specification, and structured formatting.
Chain of Thought (CoT): A prompting technique that asks the model to reason step-by-step before giving a final answer, improving performance on complex reasoning tasks.
Few-Shot Learning: Teaching a model to perform a task by providing a small number of examples in the prompt, without updating model weights.
In-Context Learning: The ability of LLMs to learn patterns from examples provided within the prompt itself, without any weight updates.
Fine-tuning: Further training a pre-trained model on a specific dataset to adapt it for particular tasks or domains.
Pre-training: The initial phase where a model learns general patterns from a large, diverse dataset before being fine-tuned.
RLHF (Reinforcement Learning from Human Feedback): Training approach that uses human preference data to align model outputs with human values and preferences.
DPO (Direct Preference Optimization): An alignment technique that directly optimizes a model on preference data without training a separate reward model.
LoRA (Low-Rank Adaptation): A parameter-efficient fine-tuning method that adds small trainable matrices to frozen pre-trained weights.
QLoRA: Quantized LoRA — combines 4-bit quantization with LoRA for efficient fine-tuning on consumer hardware.
Mixture of Experts (MoE): An architecture where different parts of the network (experts) handle different types of inputs, enabling large models with lower compute costs.
Emergent Ability: Capabilities that unpredictably arise in large models that were absent in smaller versions, such as multi-step reasoning.

Natural Language Processing

NLP: Natural Language Processing — the field focused on enabling computers to understand, interpret, and generate human language.
Named Entity Recognition (NER): Identifying and classifying entities (people, organizations, locations, dates) in unstructured text.
Sentiment Analysis: Determining the emotional tone expressed in text: positive, negative, or neutral.
Text Classification
Machine Translation: Automatically translating text from one language to another using AI models.
Text Summarization
Question Answering: Systems that automatically answer questions posed in natural language from knowledge bases or open-domain sources.
RAG (Retrieval-Augmented Generation): Enhancing LLM responses by retrieving relevant information from external knowledge sources before generating an answer.
Word Embedding
Word2Vec
BERT
GPT
Seq2Seq
BLEU Score
ROUGE Score
Perplexity

Computer Vision

Computer Vision
CNN
Object Detection
Image Segmentation
Semantic Segmentation
Instance Segmentation
OCR
GAN
Diffusion Model: A generative model that learns to reverse a gradual noise-adding process to create realistic data from random noise.
Vision Transformer (ViT)
CLIP
Stable Diffusion
Image Classification

Reinforcement Learning

Reinforcement Learning (RL)
Agent (RL)
Environment (RL)
State
Action
Reward
Policy
Q-Learning
Deep Q-Network (DQN)
Policy Gradient
Actor-Critic
PPO (Proximal Policy Optimization)
Reward Modeling
Reward Hacking

MLOps

MLOps
Model Registry
Feature Store
Model Drift
Data Drift
Concept Drift
A/B Testing
Shadow Deployment
Canary Deployment
Blue-Green Deployment
Model Serving
Model Monitoring
CI/CD for ML

AI Safety

AI Alignment
AI Safety
Red Teaming
Jailbreak
Prompt Injection
Hallucination
Bias (Algorithmic)
Fairness
Explainability (XAI)
Interpretability
Robustness

Agent Systems

AI Agent
Autonomous Agent
Multi-Agent System
Agent Orchestration
Tool Use
Function Calling
MCP (Model Context Protocol)
A2A (Agent-to-Agent)
ReAct
Planning (Agent)
Short-Term Memory (Agent)
Long-Term Memory (Agent)
Embedding
Vector Database
Semantic Search

AI Hardware

GPU
TPU
NPU
CUDA
Tensor Core
VRAM
Memory Bandwidth
NVLink
InfiniBand
Quantization
GGUF
GGML

📚 Related Posts

DataGate AI Content Intelligence Dashboard — DataGate AI Content Intelligence Dashboard *{box-sizing:border-box;margin:0;padding:0} :root{--bg:#0f172a;--card:#1e293b;--accent:#3b82f6;--accent2:#8b5cf6;--green:#10b981;--yellow:#f59e0b;--red:#ef4444;--text:#e2e8f0;--muted:#94a3b8} body{font-family:'Segoe UI',system-ui,sans-serif;background:var(--bg);color:var(--text);padding:16px;line-height:1.6} .header{display:flex;align-items:center;justify-content:space-between;flex-wrap:wrap;gap:12px;margin-bottom:16px} .header h1{font-size:1.5rem;background:linear-gradient(90deg,var(--accent),var(--accent2));-webkit-background-clip:text;-webkit-text-fill-color:transparent} .header .badge{background:linear-gradient(135deg,var(--accent),var(--accent2));color:#fff;padding:4px 12px;border-radius:20px;font-size:.75rem;font-weight:600}…
Topic Trend Tracker — Topic Trend Tracker *{box-sizing:border-box;margin:0;padding:0} :root{--bg:#0f172a;--card:#1e293b;--accent:#3b82f6;--accent2:#8b5cf6;--green:#10b981;--yellow:#f59e0b;--red:#ef4444;--text:#e2e8f0;--muted:#94a3b8} body{font-family:'Segoe UI',system-ui,sans-serif;background:var(--bg);color:var(--text);padding:20px;line-height:1.6} .wrap{max-width:1100px;margin:0 auto} h1{font-size:1.6rem;margin:4px 0 16px;background:linear-gradient(90deg,var(--accent),var(--accent2));-webkit-background-clip:text;-webkit-text-fill-color:transparent} .sub{color:var(--muted);margin-bottom:20px;font-size:.9rem} .grid{display:grid;grid-template-columns:1fr 1fr;gap:16px}…
Audience Segmentation Explorer — Audience Segmentation Explorer *{box-sizing:border-box;margin:0;padding:0} :root{--bg:#0f172a;--card:#1e293b;--accent:#3b82f6;--accent2:#8b5cf6;--green:#10b981;--yellow:#f59e0b;--red:#ef4444;--text:#e2e8f0;--muted:#94a3b8} body{font-family:'Segoe UI',system-ui,sans-serif;background:var(--bg);color:var(--text);padding:20px;line-height:1.6} .wrap{max-width:1100px;margin:0 auto} h1{font-size:1.6rem;margin:4px 0 16px;background:linear-gradient(90deg,var(--accent),var(--accent2));-webkit-background-clip:text;-webkit-text-fill-color:transparent} .sub{color:var(--muted);margin-bottom:20px;font-size:.9rem} .grid{display:grid;grid-template-columns:1fr 1fr;gap:16px}…
AI Content Performance Analyzer — AI Content Performance Analyzer *{box-sizing:border-box;margin:0;padding:0} :root{--bg:#0f172a;--card:#1e293b;--accent:#3b82f6;--accent2:#8b5cf6;--green:#10b981;--yellow:#f59e0b;--red:#ef4444;--text:#e2e8f0;--muted:#94a3b8} body{font-family:'Segoe UI',system-ui,sans-serif;background:var(--bg);color:var(--text);padding:20px;line-height:1.6} .wrap{max-width:1100px;margin:0 auto} h1{font-size:1.6rem;margin:4px 0 16px;background:linear-gradient(90deg,var(--accent),var(--accent2));-webkit-background-clip:text;-webkit-text-fill-color:transparent} .sub{color:var(--muted);margin-bottom:20px;font-size:.9rem} .stats{display:grid;grid-template-columns:repeat(auto-fit,minmax(140px,1fr));gap:12px;margin-bottom:20px}…
Wave 151 Hub: AI Agent Engineering — 🌊 Wave 151: AI Agent Engineering The definitive guide to building production-grade AI agents —…