Question 1

what is inference.sh?

Accepted Answer

a platform with 250+ tools, including AI models, dev tools, and integrations, callable through one API. connect more via MCP servers, or compose tools into new tools with flows.

Question 2

how is inference.sh different from Replicate?

Accepted Answer

Replicate has AI models. inference.sh has AI models plus video rendering, email, search, project management, and MCP servers. all composable. plus BYOK: bring your own keys to route through Fal, Google, or your own GPUs.

Question 3

what is MCP?

Accepted Answer

Model Context Protocol, an open standard for connecting tools to AI agents. browse MCP servers on inference.sh, or use inference.sh as an MCP server from Claude Code or Cursor.

Question 4

do I need to build agents to use tools?

Accepted Answer

no. call any tool with a single HTTP request. agents and skills are separate products on the same platform. use what you need.

Question 5

what is BYOK?

Accepted Answer

bring your own keys. route model runs through Fal, Google, or your own GPUs. you're not locked in to any single compute provider.

	replicate	fal	inference.sh
AI models
non-AI tools
MCP connections
flows → tools
BYOK
durable execution

one api for everything.

one api

byok

durable

frequently asked questions

what is inference.sh?

how is inference.sh different from Replicate?

what is MCP?

do I need to build agents to use tools?

what is BYOK?

ready to ship?