Foundation Model

A very large, general-purpose model trained on broad data. The base layer that specific applications build on.

Explained simply.

Foundation models are the big ones: GPT-5, Claude Opus, Gemini, Llama. They cost hundreds of millions to train. They're designed to be broadly capable - handling writing, code, math, reasoning, and more. Everything downstream (fine-tuned models, agents, RAG pipelines, ChatGPT itself) is built on top of a foundation model. The term 'foundation' is literal: without them, nothing else in the AI product layer exists.

An example.

Claude Sonnet is a foundation model. When you use Claude.ai, or an app built with the Claude API, you're hitting that same foundation model underneath. Cursor, Perplexity, and thousands of products are just interfaces on top of foundation models they don't own.

Why it matters.

Picking the right foundation model is the single biggest product decision you'll make. The rest - prompts, tools, UI - can be rebuilt. The model choice determines what's even possible.

Foundation Model

Explained simply.

An example.

Why it matters.

Related terms

Further reading

Watch