The Agent Glossary

Plain definitions for the dialect AI coding shipped before writing the docs.

A

Agent

Contested. The reinforcement-learning sense (an entity that observes, decides, acts, observes again) predates the LLM era by decades. In current AI-coding usage it almost always means a language model wrapped in a loop that can call tools, read their results, and decide what to do next. Whether a given system qualifies is mostly an argument about how much autonomy it has between human turns.

Agentic

Adjective for "of or pertaining to an agent." Often hand-wavy in marketing. Useful in context (an "agentic workflow" is one driven by an agent rather than a fixed script).

AGI

Artificial General Intelligence. The point at which a system can do most cognitive work a human can do. Definitions vary widely; some labs define it operationally (a benchmark threshold), others philosophically. Treat any specific date claim with suspicion.

Alignment

The problem of making a model do what its operator (or, more ambitiously, humanity) actually wants, including in cases the operator did not specify. A research field, not a feature.

Attention

The mechanism inside a transformer that lets each token "look at" every other token in the context window when computing its next representation. The reason transformers replaced recurrent networks in 2017.

Autoregressive

Generating output one token at a time, where each new token is conditioned on all previous tokens. Most chat LLMs are autoregressive; image diffusion models are not.

B

Benchmark

A standardised test for comparing models. Useful in aggregate, gameable individually. Common ones include MMLU, GSM8K, HumanEval, SWE-bench. "Benchmark contamination" is the worry that test questions leaked into training data.

Bias

Two unrelated meanings collide here. In statistics, a model's systematic deviation from the truth. In ethics, a model's tendency to produce outputs that are unfair across groups. Both matter; conflating them causes most public arguments about AI bias.

C

Chain of Thought (CoT)

A prompting pattern (and now a training target) in which the model writes out intermediate reasoning steps before giving an answer. Empirically improves accuracy on multi-step problems. Visible CoT is not the same as the model's actual internal computation.

Claude

Family of LLMs from Anthropic. Named after Claude Shannon. Successive versions (Opus, Sonnet, Haiku) trade off capability and speed.

Constitutional AI

A training method, introduced by Anthropic, in which a model is fine-tuned against a written set of principles ("a constitution") rather than only against human-labelled preferences. Reduces the volume of human labelling needed.

Context Window

The total span of tokens a model can consider in one pass — both the input you supply and the output it generates. Measured in tokens, not words or characters. Once the window is full, older tokens fall out (truncated or summarised) and the model loses access to them.

D

Diffusion

A generative-model family that learns to reverse a gradual noising process. Dominant for image and video; rare for text, though research is active. Not autoregressive.

Distillation

Training a smaller "student" model to imitate a larger "teacher" model's outputs. Used to compress capability into cheaper-to-run weights. The student rarely matches the teacher, but often comes close enough for production use.

E

Embedding

A fixed-length vector of numbers representing the meaning of a piece of text (or image, audio, etc.) such that similar inputs produce nearby vectors. The lookup primitive behind RAG and vector databases.

Emergent capability

An ability that appears (or appears to appear) only above a certain model scale — absent in small models, present in large ones. Some claimed emergence vanishes under better measurement; the term is now used more cautiously than circa-2022.

Eval

Short for evaluation. A test or suite of tests used to measure a model or agent's behaviour on a specific task. The plural ("evals") often refers to a team's internal test set, distinct from public benchmarks.

F

Few-shot

Prompting a model with a small number of examples (typically 2–10) of the desired input-output pattern before asking it to solve a new instance. Contrast with zero-shot.

Fine-tuning

Further training a pre-trained model on a smaller, more specific dataset to adapt its behaviour. Cheaper than training from scratch; more invasive than prompting. Often done with LoRA to avoid updating every weight.

Foundation Model

A model trained on broad data at scale, intended to be adapted (via prompting, fine-tuning, etc.) to many downstream tasks. Coined at Stanford in 2021; useful as an umbrella term covering large LLMs and large vision models alike.

Function Calling

A protocol convention in which the model emits a structured request (typically JSON) naming a function and its arguments, the host runs that function, and the result is fed back into the model's context. The plumbing under most tool use.

G

GPT

Generative Pre-trained Transformer. Originally the name of OpenAI's series of models (GPT-2, GPT-3, …); now also used generically for any autoregressive transformer language model.

Grounding

Tying a model's output to external truth — a retrieved document, a tool result, a sensor reading — rather than relying on what the weights happen to encode. The opposite failure mode is hallucination.

Guardrail

A check, filter, or constraint placed around a model's input or output to prevent specific failure modes (PII leakage, unsafe code, off-topic answers). Distinct from alignment: a guardrail intercepts behaviour; alignment changes it.

H

Hallucination

An LLM output that is fluent and confident but factually wrong — invented citations, non-existent APIs, plausible-looking but fabricated quotes. Some practitioners prefer "confabulation" as more accurate to the mechanism (the model is not perceiving anything).

Harness

The surrounding code that turns a raw model into a working agent: the loop, the tool-call dispatcher, the context manager, the retry logic, the permission system. The model is the engine; the harness is the rest of the car. Different harnesses on the same model can produce wildly different behaviour.

HITL (Human in the Loop)

A workflow design where a human reviews, approves, or corrects the model's output before it has effect. Often required for high-stakes actions (sending money, deleting files, posting publicly).

I

Inference

Running a trained model to produce output, as opposed to training it. Most user-facing costs (latency, dollars) are inference costs.

Instruction Tuning

A fine-tuning stage in which a pre-trained base model is taught to follow natural-language instructions rather than just continue text. Turns a "completion" model into a "chat" model.

J

Jailbreak

An input crafted to make a model violate the rules its trainers or operators imposed — produce disallowed content, ignore its system prompt, leak credentials. Related to but distinct from prompt injection, where the attacker is not the model's user but content the model is reading.

K

KV Cache

Key-Value cache. The intermediate attention state kept around during generation so the model doesn't have to recompute earlier tokens for each new one. Memory-intensive: a major reason long contexts are expensive to serve.

L

Latency

Wall-clock time from request to response. Usually broken into "time to first token" (how long before the user sees anything) and "tokens per second" (how fast the rest streams). Both matter; UX guidance differs by use case.

LLM

Large Language Model. The current dominant model family — autoregressive transformers trained on massive text corpora, scaled to billions or trillions of parameters. "Large" is relative and keeps shifting.

LoRA

Low-Rank Adaptation. A fine-tuning technique that learns small additive matrices on top of a frozen base model, instead of updating every parameter. Cheap to train, cheap to ship (the adapter is megabytes, not gigabytes), composable.

M

MCP (Model Context Protocol)

An open protocol for connecting LLM harnesses to external tools, resources, and prompts via a uniform server interface. Introduced by Anthropic in late 2024 and now implemented across multiple model vendors and editor integrations. Aims to standardise the "USB-C of tool use."

MoE (Mixture of Experts)

An architecture where many specialised sub-networks ("experts") share the load, with a router picking which few to activate for each token. Total parameters can be huge while compute per token stays moderate. Common in large frontier models.

Multimodal

A model that handles more than one type of input or output — text, images, audio, video. The boundaries are fuzzy; "vision-language" is a common subset.

N

N-shot

Umbrella term for prompting with N worked examples. Zero-shot, one-shot, few-shot, many-shot — same idea, different N.

O

Open weights

A model whose trained parameters are publicly downloadable. Not the same as open source: the training code, training data, and full reproduction recipe are usually still closed. "Open source AI" remains contested terminology — the OSI's definition is recent and not universally accepted.

Orchestration

Coordinating multiple model calls, tools, or agents into a larger workflow. The job a framework like LangChain, LlamaIndex, or a custom harness does.

P

Parameter

A single learned number inside a neural network. Modern LLMs have billions to trillions of them. Parameter count is a rough proxy for capability and a precise proxy for memory footprint.

Pre-training

The initial, expensive training stage on enormous unlabelled corpora, before any task-specific tuning. What produces a foundation model. Costs are in the millions to billions of dollars at frontier scale.

Prompt

The text (and sometimes images, tool definitions, etc.) given to a model as input. Usually split into a system prompt (set by the operator) and user messages (set by the end user).

Prompt injection

An attack in which untrusted content the model reads (an email, a webpage, a tool result) contains instructions that hijack its behaviour. Distinct from jailbreak because the victim isn't typing the attack — they're just asking the agent to summarise something.

Q

Quantization

Storing model weights at lower numerical precision (e.g. 4-bit integers instead of 16-bit floats) to shrink memory and speed up inference, accepting some loss in quality. Common levels: Q8, Q5, Q4, Q3, Q2. Lower bits, smaller file, dumber model.

R

RAG

Retrieval-Augmented Generation. At query time, fetch relevant documents from an external store (usually via embedding similarity), splice them into the context, and let the model answer with them in view. The most common way to give an LLM access to information it wasn't trained on.

ReAct

Reason + Act. A prompting pattern that interleaves the model's reasoning steps with tool calls, observed results, and follow-up reasoning. The conceptual ancestor of most current agent loops. Unrelated to the JavaScript library.

Reasoning model

A model trained to spend more inference compute on internal deliberation (typically by generating long, often hidden, chains of thought) before answering. Better on multi-step problems; slower and pricier per response.

RLHF

Reinforcement Learning from Human Feedback. A fine-tuning method in which human raters compare model outputs, a reward model is trained on their preferences, and the LLM is then optimised against that reward model. The technique that made GPT-3.5/4-class models pleasant to talk to.

S

SFT

Supervised Fine-Tuning. Training a model on (input, desired output) pairs. Usually the step that comes before RLHF in the post-training pipeline.

Subagent

An agent spawned by another agent to handle a delimited sub-task, typically with its own context window and its own tool set. Returns a result to the parent. A way to parallelise work and to keep the parent's context from bloating.

System prompt

The instructions the operator (not the end user) gives the model to set its role, constraints, and tone. Usually placed before the user's first message and treated by the model as higher-priority than user input — though not infinitely so; see prompt injection.

T

Temperature

A sampling parameter that controls how random the model's next-token choice is. 0 means "always pick the most likely token" (deterministic-ish); higher values flatten the distribution. Common values: 0 for code and structured output, 0.7–1.0 for prose.

Token

The atomic unit a language model reads and writes — typically a sub-word fragment, somewhere between a character and a word. "Hello, world!" is roughly 4 tokens. Pricing, context windows, and latency are all measured in tokens, not words.

Tool

Anything the model can call from inside its loop: a shell command, an HTTP API, a database query, another model. The model proposes a tool call; the harness runs it; the result returns to the model. "Tool use" is what turns an LLM into an agent.

Transformer

The neural-network architecture introduced in the 2017 paper "Attention Is All You Need." Replaced recurrent networks for most language tasks and underpins essentially every current LLM. The "T" in GPT.

U

Underfit / Overfit

A model is underfit when it hasn't learned enough — high error on both training and test data. Overfit when it has memorised the training data without generalising — low error on training, high error on new examples. The whole training process is a fight to land between the two.

V

Vector database

A database optimised for nearest-neighbour search over high-dimensional embeddings. The storage layer under most RAG systems. Examples: pgvector, Pinecone, Qdrant, Weaviate, Milvus.

VLM

Vision-Language Model. A model that takes images (or video) as input alongside text. A subset of multimodal.

W

Weights

The trained parameters of a neural network — the file you download when you "download a model." See open weights for the licensing nuance.

Workflow

A fixed sequence of model calls and tool calls, scripted ahead of time. Distinct from an agent, which decides its own next step. The line is blurry — many "agents" are mostly workflows with occasional branches; many "workflows" wrap a single agent step.

X

X-risk

Short for existential risk. In AI discourse, the concern that sufficiently powerful systems could cause permanent, civilisation-scale harm. The seriousness assigned to it varies by speaker more than by evidence.

Z

Zero-shot

Asking a model to perform a task with no worked examples, only the instruction. Contrast with few-shot. Most everyday chat use is zero-shot.