blog

insights on building production AI agents with durable execution

all articles

Qwen-Image-2.0: Professional Infographics, Exquisite Photorealism

Alibaba just released Qwen-Image-2.0, and it redefines what image generation models can do with text. This is not another incremental improvement to text rendering - Qwen-Image-2.0 can generate comple...

Mar 3

Seedream 5 Lite: ByteDance's Smartest Image Generator

ByteDance just released Seedream 5.0 Lite, and it represents a significant leap in controllable image generation. This is not an incremental update to the Seedream line - it introduces web-connected r...

Mar 3

Nano Banana 2: Pro Quality at Flash Speed

Google just released Nano Banana 2, internally codenamed Gemini 3.1 Flash Image, and the AI image generation landscape shifted overnight. This is not a minor upgrade. Nano Banana 2 combines the advanc...

Feb 26

Seedance 2.0 Is Coming to inference.sh

ByteDance just dropped Seedance 2.0 and the internet lost its mind. Within hours of launch, clips of Superman fighting Darkseid, Tom Cruise trading punches with John Wick, and Stranger Things fan edit...

Feb 16

Agent Skills: The Open Standard for AI Capabilities

AI agents are increasingly powerful, but they often lack the context and procedural knowledge to do real work reliably. Anthropic recognized this gap and introduced Agent Skills - a simple, open forma...

Feb 2

Introducing ui.inference.sh

Search for "AI chat UI" and you'll find dozens of component libraries. They look promising - sleek message bubbles, typing indicators, file upload buttons. Install one and the reality sets in. These a...

Jan 23

Agent UX Patterns That Work

Users interacting with agents have different needs than users interacting with traditional software. Agents think, which takes time. Agents take actions, which carry consequences. Agents make mistakes...

Jan 7

Agents That Generate UI

The standard agent interface is text in, text out. Users type messages, agents respond with text. This works for many cases but ignores that some information is better conveyed through structured inte...

Jan 7

Client-Side Tools

Most agent tools run on servers. The agent requests an action, the server executes it, results return to the agent. But some operations need to happen where the user is - accessing local files, using ...

Jan 7

Building Custom Apps for Your Agents

Pre-built tools cover the common cases - web search, document processing, image generation, standard API integrations. But every organization has unique systems, proprietary APIs, and domain-specific ...

Jan 7

Workflows vs Agents: When to Use Each

Workflows are predetermined sequences; agents make runtime decisions. The distinction matters because most production AI systems need both. Explore the inference.sh runtime →

Jan 7

Building a Research Agent

Research tasks are among the best applications for AI agents. They involve gathering information from multiple sources, synthesizing findings, and producing structured output - exactly the kind of mul...

blog

all articles

Qwen-Image-2.0: Professional Infographics, Exquisite Photorealism

Seedream 5 Lite: ByteDance's Smartest Image Generator

Nano Banana 2: Pro Quality at Flash Speed

Seedance 2.0 Is Coming to inference.sh

Agent Skills: The Open Standard for AI Capabilities

Introducing ui.inference.sh

Agent UX Patterns That Work

Agents That Generate UI

Client-Side Tools

Building Custom Apps for Your Agents

Workflows vs Agents: When to Use Each

Building a Research Agent

Tool Approval Gates

Sandboxed Code Execution for AI Agents

Real-Time Agent Streaming

Debugging AI Agents in Production

Concurrent Agent Execution

When to Use Multi-Agent Systems

The Real Cost of Agent Infrastructure

Agent Memory That Actually Works

From Demo to Production

The Tool Integration Tax

Built-In Agent Observability

Hierarchical Agent Delegation

Human-in-the-Loop for AI Agents

Durable Execution for AI Agents

Why Agent Runtimes Matter