Agentex Overview¶

Understanding how Agentex works will help you make better decisions when building and deploying agents.

Agentex System Diagram

A Helpful Analogy (Formula 1)¶

Think of Agentex like Formula 1 racing. As an agent developer, you're like an F1 team. You want to build the fastest possible car—that's your agent (on the right). But no team wants to build the track. You just need a high-performance foundation to race on.

The Agentex Server (in the center) is that track. The dotted lines in the diagram demonstrate how the track connects everything—all agents are on the same track, so they can race together and be invoked the same way. Agentex handles all the routing, async execution, and infrastructure that keeps agents reliable at scale.

Because of Agentex, agent developers don't have to (and shouldn't) spend time thinking about communication, streaming, credential management, deployment, hosting, or how to move from simple sync agents (L1-L3) to complex async agents (L4+).

Clients (on the left)—like chat UIs, API callers, or other systems—are the entities that trigger your agents to perform. They send requests through our unified API. The ACP (Agent-to-Client Protocol) is the standardized interface we implemented to ensures every agent communicates the same way, regardless of how it's built internally.

And just like F1 doesn't dictate how teams build their cars, Agentex doesn't dictate how you build agents. Use Scale's SDKs, MCP, OpenAI Agents, LangChain, or any tools you choose. Agentex doesn't replace your tools—it connects them.

The result: you focus on your business logic while Agentex handles everything else.

Focus on Business logic

How Agentex Works¶

In Agentex, agents are "just code". In production, we package up this code as docker images and deploy these agents on Kubernetes using Helm charts (more on this later).

To allow for complete code flexibility, but also make it easy to invoke all agents the same way, we abstract code written by agent developers behind standard entrypoints. We call this abstraction "Agent-to-Client Protocol" (ACP).

Here is an example of how simple this makes it for Agent developers to write arbitrary code behind a standard entrypoint. Here is a Sync Agent's ACP definition:

acp = FastACP.create(acp_type="sync")

@acp.on_message_send
async def handle_message_send(params: SendMessageParams) -> Union[TaskMessageContent, list[TaskMessageContent], AsyncGenerator[TaskMessageUpdate, None]]:
    # Write arbitrary agent logic here
    pass

The developer just needs to implement this function; calling upon any SDK or resources they'd like and returning the type hinted response shapes. The Agentex server handles data persistence and sending messages back to the client so you don't have to, just focus on your internal logic.

This allows all agent callers (web UIs, APIs, other agents, etc.) to call any Agentex agent the exact same way. Callers only need to know these 3 things:

Request payload (standardized by the ACP)
API route (domain of the hosted Agentex Server)
The name or ID of the Agent to call

If you're developing a frontend, we recommend you start with our open source Agentex UI. It takes care of the message sending above across all different types of Agents. It also handles stream aggregation logic, and a bunch of other complexity that makes it hard to "get off the ground" with most agent libraries.

Message Flow¶

sequenceDiagram
    participant Client
    participant Agentex Server
    participant Agent Code

    Client->>Agentex Server: Send Tasks and Messages via unified API
    Agentex Server->>Agent Code: Route request via ACP
    Agent Code->>Agent Code: Process using any logic, models, or tools
    Agent Code-->>Agentex Server: Return response via ACP
    Agentex Server-->>Client: Stream response to client
    Note over Client,Agent Code: All agents expose the same API regardless of implementation

Components¶

Agentex provides three key components, each with their own open source package:

Component	Description	Source
Agentic Infrastructure	Production platform for running agents at scale • Agent-to-Client Protocol (ACP) - Standardized communication • Asynchronous Architecture - Sync and async execution • Real-Time Streaming - Updates to clients • Scalable Deployments - Independent service scaling	scaleapi/scale-agentex/agentex
Agentex Python SDK	Developer toolkit for building agents locally • FastACP - Define handlers with decorators to serve agent code via FastAPI. • SDK - Tools for agent-builders to handle messages, state, tasks, streaming, etc. • CLI - Local development and deployment tools	scaleapi/scale-agentex-python
Agentex UI	Frontend components for chat interfaces • Agentex UI Kit - React components for customer apps • Agentex Web - Hosted UI for local development and demos	scaleapi/scale-agentex/agentex-ui

Design Principles¶

For Developers¶

We believe the most advanced applications are made code-first. With AI advancements increasingly accessible via SDKs, Agentex prioritizes developer flexibility and rapid iteration without sacrificing capability.

Agents are just code - Build with complete code flexibility using any programming patterns
Code is unopinionated and usable with any library - Use Scale's SDKs, MCP, OpenAI Agents, LangChain, or any tools you choose
Local development is fast and easy - Iterate quickly without complex setup or infrastructure
Both simple sync and complex async use cases are supported - From basic chatbots (L1-L3) to advanced autonomous agents (L4+)
All agents can be called with a unified communication protocol - Standardized ACP ensures consistent invocation across all agent types

For Customers¶

Hybrid development creates win-win opportunities for Scale and our partners. Enterprises need the path of least resistance to AI-ify their companies as rapidly as possible. Agentex enables this by opening doors—providing production-grade infrastructure that teams can own and operate independently, rather than creating dependencies on Scale as the sole-developers of enterprise AI.

Deployment process must be acceptable for every enterprise - Production-grade deployment with Kubernetes and Helm charts
Agents are modifiable by customers without Scale assistance - Full code ownership and customization
Agentic Infrastructure can be used self-service without Scale assistance - Open-source and independently deployable
Agentic Infrastructure is well-documented and easy to use for the public - Comprehensive documentation and examples