Inference Logoinference.sh

Workers

Workers are computers that run your tasks.


Two types

TypeDescription
CloudManaged by inference.sh
PrivateYour own hardware

Cloud workers

  • Pay-per-use
  • Auto-scaling
  • No setup required

Good for getting started and variable workloads.


Private workers

  • Run on your hardware
  • Via the inference.sh Engine
  • Data stays on your network

Good for data privacy, dedicated resources, or cost control.


How tasks find workers

  1. You run an app
  2. Task goes to the queue
  3. A worker picks it up
  4. Worker runs it and returns results

You can choose cloud vs private when running.


Engines

An engine manages workers on your hardware.

Install it on your server, and your GPUs become available for tasks.


Next

Now you know the concepts! Let's build.

Creating an Agent

we use cookies

we use cookies to ensure you get the best experience on our website. for more information on how we use cookies, please see our cookie policy.

by clicking "accept", you agree to our use of cookies.
learn more.