Skip to main content

Platform Overview

Agent Red Customer Experience automates customer support for e-commerce businesses using a pipeline of six specialized AI agents. Each agent handles a distinct responsibility — from classifying the customer's intent to generating a response and validating it for safety — so that routine conversations are resolved accurately with minimal human intervention.

Agent Red admin dashboard

The agent pipeline

When a customer sends a message, it flows through the agent pipeline:

  1. Intent Classification — Determines what the customer needs (order status, return request, product question, and others) across 18 total intent categories (17 customer-facing plus an admin-assistance route for admin-authenticated traffic) using GPT-4o-mini.
  2. Knowledge Retrieval — Searches your product catalog, FAQ database, and policy documents using hybrid semantic vector + keyword search to find relevant context.
  3. Response Generation — Composes a natural-language reply using the retrieved context and conversation history, personalized to your brand voice.
  4. Critic / Supervisor — Validates the response for factual accuracy, policy compliance, and content safety before it reaches the customer.
  5. Escalation Detection — Evaluates whether the conversation requires human attention (angry customer, complex issue, VIP account) and routes accordingly.
  6. Analytics — Records conversation metrics for quality monitoring, reporting, and continuous improvement.

Architecture

Agent Red runs as a unified API Gateway on Azure Container Apps (East US) with native auto-scaling. The six AI agents run in-process within the gateway, communicating via synchronous HTTP endpoints. Customer data is stored in Cosmos DB (Serverless) with tenant-level partition isolation. A dedicated Customer Memory layer stores customer profiles and vectorized conversation transcripts — enabling the response generator to personalize replies based on each customer's interaction history.

ComponentTechnology
Agent runtimeAzure Container Apps (native auto-scaling)
Agent communicationHTTP (in-process, synchronous pipeline)
DatabaseAzure Cosmos DB (Serverless, DiskANN vector index)
CacheAzure Cache for Redis
AI modelsAzure OpenAI Service (GPT-4o, GPT-4o-mini)
Embeddingstext-embedding-3-large
SecretsAzure Key Vault (Managed Identity)
Customer MemoryCosmos DB (profiles + vectorized conversation transcripts)
MonitoringOpenTelemetry (Application Insights)

Design targets

These are the performance targets Agent Red is designed to achieve:

MetricTarget
Response latency (P95)< 2 seconds
Uptime SLA (Enterprise)99.95%
Concurrent tenants at launch680
Rate limit (all tiers)300 requests per minute per tenant

Next steps

  • How It Works — Deep dive into the six-agent pipeline, communication protocols, and data flow.
  • Initial Setup — What you need to get Agent Red running for your store.

© 2026 Remaker Digital, a DBA of VanDusen & Palmeter, LLC. All rights reserved.

Was this page helpful?