Inference Logoinference.sh

tools

any ai model. one api call.

image, video, audio, text, search, 3D — one key, pay per run, zero vendor lock-in.

three sources of tools

built-in
250+ apps
connected
connectors
composed
flows → tools
one api
same interface for everything
image
FLUX Dev
Seedream
Recraft
GPT Image
video
Seedance 2
Veo 3
Remotion
Wan 2.7
chat
Claude Opus
Gemini 3
GPT-4o
Grok
audio
ElevenLabs TTS
Kokoro
Dia TTS
social
X / Twitter
Slack
Gmail
dev tools
Linear
GitHub
Notion
search
Tavily
Tavily Extract
utilities
Shell
Browser
Image Resize
javascriptjavascript
import Inference from '@inference/sdk'

const client = new Inference()

const result = await client.run('pruna/flux-dev', {
  prompt: 'a minimal geometric logo, white background'
})

console.log(result.image) // → https://cloud.inference.sh/...
output
Generated image: a minimal geometric logo

pruna/flux-dev · 7.9s · $0.005

sdks: javascript · python · go · cli

one api

same interface for every tool. built-in or connected, same shape: input in, output out. no per-provider SDKs.

byok

bring your own keys. route model runs through fal, google, or your own GPUs. you're not locked in.

durable

every tool call retries on failure, persists state, and tracks execution. built in, not bolted on.

how we compare

replicatefalinference.sh
AI models
non-AI tools
connectors
flows → tools
BYOK
durable execution

frequently asked questions

an ai runtime where everything compounds. run any model, compose agents, stack knowledge. also includes a skill registry, connectors, and a team workspace.

ready to ship?

start with the hosted platform. deploy your own when you're ready.

we use cookies

we use cookies to ensure you get the best experience on our website. for more information on how we use cookies, please see our cookie policy.

by clicking "accept", you agree to our use of cookies.
learn more.