tools
any ai model. one api call.
image, video, audio, text, search, 3D — one key, pay per run, zero vendor lock-in.
three sources of tools
import Inference from '@inference/sdk' const client = new Inference() const result = await client.run('pruna/flux-dev', { prompt: 'a minimal geometric logo, white background' }) console.log(result.image) // → https://cloud.inference.sh/...

pruna/flux-dev · 7.9s · $0.005
one api
same interface for every tool. built-in or connected, same shape: input in, output out. no per-provider SDKs.
byok
bring your own keys. route model runs through fal, google, or your own GPUs. you're not locked in.
durable
every tool call retries on failure, persists state, and tracks execution. built in, not bolted on.
how we compare
| replicate | fal | inference.sh | |
|---|---|---|---|
| AI models | |||
| non-AI tools | |||
| connectors | |||
| flows → tools | |||
| BYOK | |||
| durable execution |
frequently asked questions
an ai runtime where everything compounds. run any model, compose agents, stack knowledge. also includes a skill registry, connectors, and a team workspace.
ready to ship?
start with the hosted platform. deploy your own when you're ready.
we use cookies
we use cookies to ensure you get the best experience on our website. for more information on how we use cookies, please see our cookie policy.
by clicking "accept", you agree to our use of cookies.
learn more.