MCP transports

📖 4 min readUpdated 2026-04-18

"Transport" just means how the client and server talk to each other. Are they on the same computer passing messages through a pipe, or on different machines passing messages over the internet? MCP supports three ways to connect, and the right choice depends entirely on where your server runs and who's using it. This page walks through all three.

Why there are three, not one.

If MCP had only one transport, it would be limited to one kind of server. In practice, MCP servers come in very different shapes:

Some run as little programs on your own laptop (a Git MCP, a filesystem MCP).
Some run on a company's own servers and are shared by a whole team (an internal knowledge-base MCP).
Some run as cloud APIs that anyone can plug into (the official Notion or GitHub MCPs).

Each situation has different needs. A laptop-local server doesn't need HTTPS or authentication; talking over a simple pipe is fastest. A shared team server needs to handle many clients at once, which means real network infrastructure. The three transports, stdio, SSE, and HTTP, map to those three worlds.

~ three transports, three use cases ~

1. stdio. Same-machine, fastest, simplest.

"stdio" stands for standard input/output. It's the oldest and most boring way to make two programs talk: one launches the other as a child process and sends messages through a pipe. No network, no ports, no HTTPS. Just two programs on the same computer passing newline-delimited JSON back and forth.

How it feels: you add a server to your client's config with a command (like npx @notionhq/notion-mcp), and the client spawns that command whenever you start up. The server process dies when the client does.

Use stdio when: the server needs access to your local file system, your local git repo, or any resource on your machine. Also: if you're doing private work and don't want data leaving the device.

What you trade off: it only works when the client and server are on the same machine. No sharing with teammates, no remote deployment. If your laptop is the server, your laptop has to be running. But within that constraint, it's the fastest transport and has the least operational overhead. This is what Claude Code uses by default for most MCPs.

2. SSE. Server-Sent Events. Remote with streaming.

SSE is a well-established web standard where the server keeps a long-lived HTTP connection open and streams events back to the client. It's been around since before AI. MCP uses it for remote servers that want streaming behavior, for example, a server that returns partial results as they come in.

How it feels: the server is a URL somewhere on the internet. The client connects once, authenticates (usually OAuth), and keeps the connection open for the duration of the session. Messages stream both ways.

Use SSE when: multiple people need to share one server (a team knowledge base, a company's internal tools MCP), or when the server does work that's nice to stream back incrementally (long-running searches, data generation, reports).

What you trade off: operational cost. You need a real server somewhere, TLS certificates, auth handling, and a process that stays alive. Not hard, but not nothing. Also, long-lived connections don't play well with serverless platforms like AWS Lambda, which was designed for short request-response cycles.

3. HTTP. Plain request/response. Boring and reliable.

Plain stateless HTTP. Client sends a JSON-RPC request, server sends a JSON-RPC response. That's the whole story. No streaming, no long-lived connection, each call is independent.

How it feels: the server is an endpoint. The client calls it over HTTPS the same way your browser calls any API. Each tool call is one request. Done.

Use HTTP when: you're hosting on something serverless (AWS Lambda, Cloudflare Workers, Vercel Functions), when your server is simple enough to not need streaming, or when you just want the dumbest-possible deployment story. This is also the right choice for public APIs that many unrelated users will hit; stateless HTTP scales infinitely, while SSE doesn't.

What you trade off: responsiveness on long operations. If a tool takes 30 seconds to produce a result, HTTP makes the client wait for the whole thing with no feedback. SSE would stream partial progress. For most tools this doesn't matter; for a few (long reports, bulk imports) it does.

Pros and cons, side by side.

~ which transport wins on what ~

A decision tree you can actually follow.

Is the server just for you, running on your laptop? → stdio. Done. Move on.
Is the server hosted in the cloud, serving other people too? → go to 3.
Do you need streaming results (long tool calls, progressive output)? → SSE. Otherwise → 4.
Is it on a serverless platform (Lambda, Workers)? → HTTP.
Default for "hosted, request-response" → HTTP. Easier to build and operate than SSE. Start here; upgrade to SSE only if you discover you need streaming.

Can you mix them? Yes, and you should.

A single client can talk to many servers at once, each over a different transport. A realistic Claude Code setup might have:

A local filesystem MCP over stdio (fast access to your files)
A local git MCP over stdio (fast repo operations)
A remote Notion MCP over HTTP (hosted by Notion)
A remote team-docs MCP over SSE (your company's internal server, streaming search results)

The client manages all four transparently. Your agent can call tools from any of them in the same conversation. The transport is completely invisible to the model, which is as it should be. Pick the right one per server, and the rest takes care of itself.