tools

one api for everything.

250+ serverless tools. built-in, connected, or composed.

three sources of tools

built-in
250+ apps
connected
MCP servers
composed
flows → tools
one api
same interface for everything
image
FLUX Dev
Seedream
Recraft
GPT Image
video
Seedance 2
Veo 3
Remotion
Wan 2.7
chat
Claude Opus
Gemini 3
GPT-4o
Grok
audio
ElevenLabs TTS
Kokoro
Dia TTS
social
X / Twitter
Slack
Gmail
dev tools
Linear
GitHub
Notion
search
Tavily
Tavily Extract
utilities
Shell
Browser
Image Resize
javascriptjavascript
import Inference from '@inference/sdk'

const client = new Inference()

const result = await client.run('pruna/flux-dev', {
  prompt: 'a minimal geometric logo, white background'
})

console.log(result.image) // → https://cloud.inference.sh/...
output
Generated image: a minimal geometric logo

pruna/flux-dev · 7.9s · $0.005

sdks: javascript · python · go · cli

one api

same interface for every tool. built-in or connected, same shape: input in, output out. no per-provider SDKs.

byok

bring your own keys. route model runs through fal, google, or your own GPUs. you're not locked in.

durable

every tool call retries on failure, persists state, and tracks execution. built in, not bolted on.

how we compare

replicatefalinference.sh
AI models
non-AI tools
MCP connections
flows → tools
BYOK
durable execution

frequently asked questions

what is inference.sh?

a platform with 250+ tools, including AI models, dev tools, and integrations, callable through one API. connect more via MCP servers, or compose tools into new tools with flows.

how is inference.sh different from Replicate?

Replicate has AI models. inference.sh has AI models plus video rendering, email, search, project management, and MCP servers. all composable. plus BYOK: bring your own keys to route through Fal, Google, or your own GPUs.

what is MCP?

Model Context Protocol, an open standard for connecting tools to software. browse MCP servers on inference.sh, or use inference.sh as an MCP server from Claude Code, Cursor, or any MCP client.

can I use tools without building agents?

yes. call any tool with a single HTTP request or compose them into workflows with flows. no agents required.

what is BYOK?

bring your own keys. route model runs through Fal, Google, or your own GPUs. you're not locked in to any single compute provider.

ready to ship?

start with the hosted platform. deploy your own when you're ready.

we use cookies

we use cookies to ensure you get the best experience on our website. for more information on how we use cookies, please see our cookie policy.

by clicking "accept", you agree to our use of cookies.
learn more.