Where Does it Fit?¶

Xians is an Agent Control Plane (ACP) that provides centralized governance, orchestration, and monitoring for your entire fleet of AI agents. It integrates into your architecture as a dedicated Agent Microservice, sitting between your web server and AI agents to coordinate multi-tenant operations, orchestrate business processes, and scale agent workloads across your infrastructure.

Architecture Overview¶

Xians Architecture

The diagram above illustrates how Xians operates as a control plane in a typical microservices architecture:

Your Product UI - Your frontend application where users interact with your product
Your Web Server - Your backend/BFF (Backend for Frontend) layer that handles UI logic
Agent Control Plane (Xians Server) - The centralized platform governing all AI agents
Your Agents - The AI agents registered with and controlled by Xians

Centralized Governance & Orchestration¶

As a control plane, Xians provides a single point of governance and coordination for your entire agent fleet:

Multi-Tenant Governance: Complete tenant isolation with centralized management - deploy agents across tenants from a single control point
Business Process Orchestration: Coordinate complex, long-running workflows with fault tolerance and automatic retries
Unified Visibility: Monitor health, performance, costs, and operations across all agents and tenants from one platform
Lifecycle Management: Register, deploy, version, configure, and decommission agents centrally with template-based rollouts
Knowledge Management: Centralize prompts, configurations, and agent knowledge accessible to both code and UI

Direct Integration Option¶

The Xians Server can use OIDC to connect directly with your web UI in scenarios where a BFF/Web Server layer is not mediating the communication. Developers may use the Xians TypeScript SDK (Optional) to easily integrate with the server's APIs and leverage the control plane's capabilities.

Control Plane Components¶

Xians is built with a modular architecture comprising the following major components:

Xians Components

Xians Server - The core control plane engine that provides APIs for policy enforcement, monitoring, and agent orchestration. Includes both Web and Temporal servers.
Xians UI - A comprehensive management console for governing agents, monitoring operations, viewing metrics, configuring policies, and managing lifecycles across your entire fleet
Xians Lib - A .NET Core class library that agents use to register with the control plane and communicate securely with Xians Server
Xians Agent Studio – An optional, standalone UI that allows you to quickly onboard human users and enable collaboration with AI agents through chat conversations and human-in-the-loop tasks, right out of the box.

Layered Architecture View¶

Understanding where an Agent Control Plane fits in your architecture helps clarify its role and value. The diagram below illustrates Xians' position in a layered view:

Layered Architecture View

The ACP sits between your agents and the external world - mediating all interactions with users, systems, and external services. This architectural position enables the control plane to:

Orchestrate External Interactions:

Route messages between users and agents through conversation APIs
Handle webhook events from external systems to trigger workflows
Coordinate multi-agent collaboration for A2A
Manage integrations with third-party platforms and services

Provide Agent Utilities:

Tenant-scoped data storage for state and memory
Centralized prompt and knowledge management
Scheduling infrastructure for time-based automation
Conversation history with hierarchical organization

Governance & Observability:

Multi-tenant isolation with centralized policy enforcement
Real-time monitoring with logs, metrics, and distributed tracing
Cost tracking across agents and tenants
Audit trails for compliance and debugging

Infrastructure Abstraction:

Distributed networking and load balancing across agent workers
Fault tolerance with automatic retries and state recovery
Horizontal scalability with dynamic worker pools
Network isolation (agents run in private subnets with no incoming ports)

By handling these cross-cutting concerns, the control plane allows your agents to focus purely on business logic while the platform manages multi-tenancy, orchestration, scalability, monitoring, and infrastructure complexity.

Built on Temporal for Durable Orchestration¶

Xians leverages the industry-proven open source Temporal workflow server as its orchestration foundation. This enables the control plane to reliably manage business processes and agent workflows at scale:

Distributed Execution - Agents run as independent worker processes focused on business logic, while the control plane handles orchestration and coordination
Fault Tolerance - Built-in timeout and retry policies ensure resilient execution even when agents fail or infrastructure restarts
Long-running Processes - Orchestrate durable, multi-step business workflows that can span minutes, hours, days, or even years with state managed reliably
Scheduled Workflows - Time-based automation with cron expressions, intervals, and calendar schedules for recurring business processes
Horizontal Scalability - Simply add more agent worker containers and watch Xians automatically distribute the workload
Audit Trail - Every workflow execution is recorded with complete history for debugging and compliance tracking

Scalable & Secure Worker Architecture¶

Agent workers operate in a pull-based architecture that enables massive scalability and secure deployment:

graph LR
    CP[Xians Control Plane<br/>Public Endpoint] -->|Task Queue| TQ[Temporal Queue]

    subgraph Private Subnet - No Incoming Ports
        TQ -.->|Pull Tasks| W1[Agent Worker 1]
        TQ -.->|Pull Tasks| W2[Agent Worker 2]
        TQ -.->|Pull Tasks| W3[Agent Worker N]
    end

    style CP fill:#538cfc,stroke:#333,stroke-width:2px,color:#fff
    style TQ fill:#41c18a,stroke:#333,stroke-width:2px,color:#fff
    style W1 fill:#eea52d,stroke:#333,stroke-width:2px,color:#1b1f2f
    style W2 fill:#eea52d,stroke:#333,stroke-width:2px,color:#1b1f2f
    style W3 fill:#eea52d,stroke:#333,stroke-width:2px,color:#1b1f2f

Key benefits:

Subnet Isolation: Agent workers run in private subnets with no incoming ports - they only make outbound connections to pull tasks
Horizontal Scaling: Add or remove worker containers dynamically based on load - Temporal automatically balances work distribution
Network Security: Workers never accept inbound connections, reducing attack surface and simplifying firewall rules
Multi-Tenant Isolation: Different worker pools can serve different tenants or security zones
Auto-Discovery: Workers automatically register with the control plane when they start - no manual configuration needed

Asynchronous Processing & Scalability¶

The control plane manages agent interactions through a fully asynchronous message queue architecture, ensuring massive scalability and resilience:

sequenceDiagram
    participant External as External System
    participant XServer as Xians Control Plane
    participant Queue as Temporal Queue
    participant Agent as Agent Worker (Xians Lib)

    External->>XServer: HTTP Request (e.g., Webhook)
    Note over XServer: Route & Queue
    XServer->>Queue: Enqueue Task
    Note over Queue: Work Distribution
    Queue->>Agent: Pull Task (Outbound Only)
    Note over Agent: Execute Business Logic
    Agent->>XServer: Return Response
    Note over XServer: Log & Metrics
    XServer->>External: HTTP Response

Even when you send a synchronous HTTP webhook request, the control plane processes it asynchronously using queues with configurable timeouts. This architecture enables:

Massive Scalability - Handle high loads by distributing work across hundreds or thousands of agent workers
Stateless Execution - Each task is processed independently, making workers easy to scale horizontally without shared state
Resilient Operations - Automatic retries, timeouts, and failure handling ensure reliable execution even during outages
Load Balancing - Temporal automatically distributes tasks to available workers based on capacity
Complete Observability - Every interaction is logged with distributed tracing for debugging and performance analysis