Inference Logoinference.sh

MCP Server

Connect Claude Code, Cursor, or any MCP client to inference.sh and use all platform capabilities as tools.


What You Get

When you connect an MCP client to inference.sh, it gets access to:

Tool GroupTools
Appsapp_list, app_get, app_run, task_get, task_list, task_logs, task_cancel
Knowledgeknowledge_list, knowledge_search, knowledge_get, knowledge_create, knowledge_delete
Skillsskill_use, skill_files, skill_view, skill_upload, skill_store, skill_search
MCP Proxymcp_list, mcp_search, mcp_get, mcp_tools, mcp_run, mcp_connect, mcp_disconnect

The MCP proxy tools are especially powerful — they let your coding agent discover, connect to, and call tools on any MCP server through inference.sh, without running those servers locally.


Setup

Claude Code

Add to your MCP settings (~/.claude/settings.json or project .mcp.json):

json
1{2  "mcpServers": {3    "inference": {4      "type": "streamable-http",5      "url": "https://api.inference.sh/mcp",6      "headers": {7        "Authorization": "Bearer inf_your_api_key"8      }9    }10  }11}

Cursor

Add to your Cursor MCP settings:

json
1{2  "mcpServers": {3    "inference": {4      "url": "https://api.inference.sh/mcp",5      "headers": {6        "Authorization": "Bearer inf_your_api_key"7      }8    }9  }10}

Any MCP Client

The server card is at:

code
1GET https://api.inference.sh/.well-known/mcp-server-card
json
1{2  "name": "sh.inference/mcp-server",3  "description": "inference.sh MCP server",4  "remotes": [5    {6      "type": "streamable-http",7      "url": "https://api.inference.sh/mcp",8      "supportedProtocolVersions": ["2025-11-25", "2025-06-18", "2025-03-26"]9    }10  ]11}

Authentication

All requests to the MCP server require a Bearer token:

code
1Authorization: Bearer inf_your_api_key

Get your API key from team settings or via CLI:

bash
1belt login

Example: Running Apps from Claude Code

Once connected, Claude Code can run inference.sh apps directly:

code
1You: Generate an image of a sunset over mountains23Claude: I'll use inference.sh to generate that image.4        [Calling app_run with app="infsh/flux-schnell", input={prompt: "sunset over mountains"}]56        Here's your image: https://files.inference.sh/...

Example: Using Connectors from Claude Code

Your connected MCP servers are available through the proxy tools:

code
1You: Check my Linear backlog for high-priority items23Claude: [Calling mcp_run with slug="linear", tool="list_issues", input={priority: 1}]45        You have 3 high-priority issues:6        - INF-201: Fix auth token refresh7        - INF-198: Billing webhook timeout8        - INF-195: Rate limit on /agents/run

Protocol

The server implements the Model Context Protocol specification:

  • Transport: Streamable HTTP (JSON-RPC 2.0)
  • Endpoint: POST https://api.inference.sh/mcp
  • Supported versions: 2025-11-25, 2025-06-18, 2025-03-26
  • Methods: initialize, tools/list, tools/call

Next

we use cookies

we use cookies to ensure you get the best experience on our website. for more information on how we use cookies, please see our cookie policy.

by clicking "accept", you agree to our use of cookies.
learn more.