Configuration

The inf.yml file defines app settings and resource requirements.

Project Structure

1my-app/2├── inf.yml           # Configuration3├── inference.py      # App logic4├── requirements.txt  # Python packages (pip)5└── packages.txt      # System packages (apt) — optional

1my-app/2├── inf.yml           # Configuration3├── src/4│   └── inference.js  # App logic5├── package.json      # Node.js packages (npm/pnpm)6└── packages.txt      # System packages (apt) — optional

Basic structure

yaml

1name: my-app2description: What my app does3category: image4kernel: python-3.1156resources:7  gpu:8    count: 19    vram: 24    # 24GB (auto-converted to bytes)10    type: any11  ram: 32       # 32GB

Fields

Field	Required	Description
`name`	Yes	App identifier (slug format)
`description`	Yes	What it does
`category`	Yes	App category
`kernel`	Yes	Runtime: `python-3.10`, `python-3.11`, `python-3.12`, `node-22`
`resources`	Yes	Hardware requirements

Resources

The CLI automatically converts human-friendly values to bytes:

< 1000 → treated as GB (e.g., 80 = 80GB)
1000 to 1 billion → treated as MB (e.g., 80000 = 80GB)

yaml

1resources:2  gpu:3    count: 1        # Number of GPUs4    vram: 24        # 24GB5    type: any       # GPU type6  ram: 32           # 32GB

GPU Types

Value	Description
`any`	Any GPU will work
`nvidia`	Requires NVIDIA GPU
`amd`	Requires AMD GPU
`apple`	Requires Apple Silicon
`none`	No GPU needed (CPU only)

Note: Currently only NVIDIA CUDA GPUs are supported.

For CPU-only apps:

yaml

1resources:2  gpu:3    count: 04    type: none5  ram: 4

Dependencies

1torch>=2.02transformers3accelerate

System Packages (packages.txt)

For apt-installable system dependencies (both Python and Node.js):

code

1ffmpeg2libgl1-mesa-glx

Base Images

Apps run in containers with these base images:

Type	Image
GPU	`docker.inference.sh/gpu:latest-cuda`
CPU	`docker.inference.sh/cpu:latest`

Environment Variables

yaml

1env:2  MODEL_NAME: gpt-43  MAX_TOKENS: "2000"4  HF_HUB_ENABLE_HF_TRANSFER: "1"

Access in code:

1import os2model = os.environ["MODEL_NAME"]

Secrets and Integrations

Declare required secrets and OAuth integrations:

yaml

1secrets:2  - key: HF_TOKEN3    description: HuggingFace token for gated models4    optional: false56integrations:7  - key: google.sheets8    description: Access to Google Sheets9    optional: true

See Secrets and Integrations for details.

→ Deploying

previousapp code nextsecrets

we use cookies

we use cookies to ensure you get the best experience on our website. for more information on how we use cookies, please see our cookie policy.

by clicking "accept", you agree to our use of cookies.
learn more.

Category	Use For
`image`	Image generation, editing
`video`	Video generation, processing
`audio`	Audio generation, TTS
`text`	Text generation
`chat`	Conversational AI
`3d`	3D model generation
`other`	Everything else

Configuration

Project Structure

Basic structure

Fields

Resources

GPU Types

Categories

Dependencies

System Packages (packages.txt)

Base Images

Environment Variables

Secrets and Integrations

Next