TechTrend Agentic Commerce Platform

Production-grade agentic e-commerce CX platform — AI agents handle product discovery, cart management, checkout, order tracking, and support via natural language + Generative UI components.

Core Design Principles

Principle	Description	Implementation
AGENT-FIRST	Every user action is a conversation turn, not page navigation	LangGraph supervisor routes 14 intent types to specialized agents
FAIL-SAFE	Every write requires confirmation, every tool is idempotent	Idempotency keys in Redis, human-in-the-loop for refunds
STATELESS COMPUTE	Container Apps scale to zero, state in PostgreSQL + Redis only	Azure Container Apps, Prisma migrations, Redis checkpoints
OBSERVABLE BY DEFAULT	Every agent turn traced in Langfuse	Per-span tracing, LLM-as-Judge scoring, RAGAS metrics

System Architecture

Tier 1: High-Level Architecture

┌─────────────────────────────────────────────────────────────────────────────┐
│                         CLIENT LAYER                                         │
│  ┌─────────────────────────────────────────────────────────────────────┐    │
│  │  apps/web/ → Next.js 15 Chat-First Canvas (no navigation tabs)      │    │
│  │  - Shell using 100dvh CSS Grid (Rail + Chat Column)                 │    │
│  │  - Virtualized anchor scroll for high-performance streaming         │    │
│  │  - Generative UI components (ProductGrid, CartCanvas, OrderTimeline)│    │
│  └─────────────────────────────────────────────────────────────────────┘    │
└─────────────────────────────────────────────────────────────────────────────┘
                                    │
                                    ▼
┌─────────────────────────────────────────────────────────────────────────────┐
│                         API GATEWAY LAYER                                    │
│  ┌──────────────────┐  ┌──────────────────┐  ┌──────────────────────────┐  │
│  │  /api/chat       │  │  /api/agent      │  │  /api/copilotkit         │  │
│  │  (OpenAI SDK)    │  │  (LangGraph)     │  │  (GenUI actions)         │  │
│  └──────────────────┘  └──────────────────┘  └──────────────────────────┘  │
└─────────────────────────────────────────────────────────────────────────────┘
                                    │
                    ┌───────────────┴───────────────┐
                    ▼                               ▼
┌─────────────────────────────┐     ┌─────────────────────────────────────────┐
│   apps/commerce-api/        │     │   apps/agent-core/                      │
│   Hono + Bun                │     │   FastAPI + Python                      │
│   GraphQL Yoga              │     │   LangGraph agents                      │
│   MCP HTTP endpoints        │◄────│   GraphQL tool → commerce-api           │
│   Prisma JS                 │     │   asyncpg (notifications only)          │
└─────────────────────────────┘     └─────────────────────────────────────────┘
                    │                               │
                    └───────────────┬───────────────┘
                                    ▼
┌─────────────────────────────────────────────────────────────────────────────┐
│                         SHARED LAYER                                         │
│  ┌─────────────────────────────────────────────────────────────────────┐    │
│  │  prisma/ → Single schema.prisma (Prisma JS only)                    │    │
│  │  packages/types/ → Shared TS interfaces                             │    │
│  │  packages/errors/ → CommerceError (TS + Python mirror)              │    │
│  └─────────────────────────────────────────────────────────────────────┘    │
└─────────────────────────────────────────────────────────────────────────────┘
                                    │
                                    ▼
┌─────────────────────────────────────────────────────────────────────────────┐
│                         DATA LAYER                                           │
│  ┌─────────────┐  ┌─────────────┐  ┌─────────────┐  ┌─────────────────┐    │
│  │ PostgreSQL  │  │ Redis 7     │  │ Langfuse    │  │ Azure AI        │    │
│  │ 16 + pgvector│ │ (Cache)     │  │ (Tracing)   │  │ Foundry         │    │
│  └─────────────┘  └─────────────┘  └─────────────┘  └─────────────────┘    │
└─────────────────────────────────────────────────────────────────────────────┘

Tier 2: Fault Tolerance

┌──────────────────────────────────────────────────────────────┐
│                    RESILIENCE PATTERNS                        │
├──────────────────────────────────────────────────────────────┤
│  ┌────────────────┐    ┌────────────────┐    ┌────────────┐ │
│  │ Circuit Breaker│    │ Retry (3x)     │    │ Fallback   │ │
│  │ (Redis state)  │    │ Exponential    │    │ (Keyword   │ │
│  │                │    │ Backoff        │    │ classify)  │ │
│  └────────────────┘    └────────────────┘    └────────────┘ │
│                                                              │
│  ┌────────────────┐    ┌────────────────┐    ┌────────────┐ │
│  │ Idempotency    │    │ Timeout        │    │ Dead Letter│ │
│  │ (Redis keys)   │    │ (30s LLM)      │    │ Queue      │ │
│  └────────────────┘    └────────────────┘    └────────────┘ │
└──────────────────────────────────────────────────────────────┘

Tier 3: Concurrency Model

┌──────────────────────────────────────────────────────────────┐
│                 ASYNC PROCESSING FLOW                         │
├──────────────────────────────────────────────────────────────┤
│                                                              │
│  User Request → API Route → LangGraph Graph                  │
│       │              │              │                        │
│       │              │              ├─→ classify_intent      │
│       │              │              ├─→ generate_tool_calls  │
│       │              │              ├─→ tools (parallel)     │
│       │              │              └─→ generate_response    │
│       │              │                                      │
│       │              └─→ SSE Stream ←───────────────────────│
│       │                                                      │
│       └─→ GenUI Component Render                             │
│                                                              │
│  Background Jobs (Celery):                                   │
│  - Email notifications                                       │
│  - Inventory sync                                            │
│  - Vector embedding generation                               │
└──────────────────────────────────────────────────────────────┘

Tier 4: Data Model

┌──────────────────────────────────────────────────────────────┐
│                   POSTGRESQL SCHEMA                           │
├──────────────────────────────────────────────────────────────┤
│                                                              │
│  ┌──────────────┐      ┌──────────────┐      ┌────────────┐│
│  │ Customer     │◄─────│ Order        │─────►│ OrderItem  ││
│  │ - id         │      │ - id         │      │ - id       ││
│  │ - email      │      │ - customerId │      │ - orderId  ││
│  │ - name       │      │ - total      │      │ - productId││
│  └──────────────┘      │ - status     │      │ - quantity ││
│         │              │ - embeddings │      └────────────┘│
│         │              └──────────────┘                     │
│         │                       │                           │
│         ▼                       ▼                           │
│  ┌──────────────┐      ┌──────────────┐      ┌────────────┐│
│  │ SupportTicket│      │ Product      │      │ Cart       ││
│  │ - id         │      │ - id         │      │ - id       ││
│  │ - customerId │      │ - name       │      │ - customerId││
│  │ - status     │      │ - price      │      │ - items    ││
│  │ - sentiment  │      │ - embeddings │      │ - total    ││
│  └──────────────┘      │ - stock      │      └────────────┘│
│                        └──────────────┘                     │
│                                                              │
│  Vector Tables (pgvector 1536-dim):                          │
│  - product_embeddings (HNSW index)                           │
│  - document_chunks (GIN + vector)                            │
└──────────────────────────────────────────────────────────────┘

Tier 5: Request Lifecycle

┌──────────────────────────────────────────────────────────────────────────┐
│                    CONVERSATION TURN FLOW                                 │
├──────────────────────────────────────────────────────────────────────────┤
│                                                                          │
│  1. User Message                                                         │
│     │                                                                    │
│     ▼                                                                    │
│  2. classify_intent (LLM) ──→ Intent: product_search                     │
│     │                               Entities: ["wireless headphones"]    │
│     │                               Sentiment: neutral                   │
│     ▼                                                                    │
│  3. shouldUseTools? ──→ YES                                              │
│     │                                                                    │
│     ▼                                                                    │
│  4. generate_tool_calls ──→ [catalog.search(query="wireless headphones")]│
│     │                                                                    │
│     ▼                                                                    │
│  5. tools (ToolNode) ──→ Prisma query → 12 products                      │
│     │                               Redis cache set (5min)               │
│     ▼                                                                    │
│  6. generate_response (LLM) ──→ "Found 12 wireless headphones..."        │
│     │                                + GenUI: <ProductGrid items={12}/>  │
│     ▼                                                                    │
│  7. Stream via SSE ──→ Client renders message + component                │
│     │                                                                    │
│     ▼                                                                    │
│  8. Langfuse Trace ──→ Span: classify (234ms)                            │
│                        Span: tools (89ms)                                │
│                        Span: generate (1.2s)                             │
│                        Score: faithfulness=0.92                          │
│                                                                          │
└──────────────────────────────────────────────────────────────────────────┘

Tier 6: Scalability Strategy

┌──────────────────────────────────────────────────────────────┐
│              HORIZONTAL SCALING APPROACH                      │
├──────────────────────────────────────────────────────────────┤
│                                                              │
│  Tier          Component          Scale Strategy            │
│  ─────────────────────────────────────────────────────────  │
│  Frontend      Next.js            Vercel (auto-scale)       │
│  API Gateway   Hono               Azure Container Apps      │
│  Agent Core    FastAPI            Azure Container Apps      │
│  Database      PostgreSQL         Azure DB (read replicas)  │
│  Cache         Redis              Azure Cache (cluster)     │
│  Vector        pgvector           Horizontal partitioning   │
│  LLM           Azure AI           Rate limit + queue        │
│                                                              │
│  Bottleneck Mitigation:                                      │
│  - LLM calls: Semantic cache (Redis, 5min TTL)              │
│  - DB queries: Connection pool (PgBouncer)                  │
│  - Vector search: HNSW index + candidate limit              │
│  - Agent state: Redis checkpoints (shared)                  │
└──────────────────────────────────────────────────────────────┘

Tier 7: Security Architecture

┌──────────────────────────────────────────────────────────────┐
│                  SECURITY LAYERS                              │
├──────────────────────────────────────────────────────────────┤
│                                                              │
│  ┌────────────────────────────────────────────────────────┐ │
│  │  Layer 1: Authentication                                │ │
│  │  - Custom JWT (HS256, 1h expiry)                        │ │
│  │  - Refresh tokens (7d, Redis blacklist)                 │ │
│  │  - OAuth 2.0 (Google, GitHub)                           │ │
│  └────────────────────────────────────────────────────────┘ │
│                          │                                   │
│                          ▼                                   │
│  ┌────────────────────────────────────────────────────────┐ │
│  │  Layer 2: Authorization                                 │ │
│  │  - Role-based (customer, merchant, support, admin)      │ │
│  │  - Resource-level (userId checks on every write)        │ │
│  │  - Idempotency keys (prevent replay)                    │ │
│  └────────────────────────────────────────────────────────┘ │
│                          │                                   │
│                          ▼                                   │
│  ┌────────────────────────────────────────────────────────┐ │
│  │  Layer 3: Input Validation                              │ │
│  │  - Zod schemas (frontend)                               │ │
│  │  - Pydantic models (backend)                            │ │
│  │  - SQL injection prevention (Prisma params)             │ │
│  └────────────────────────────────────────────────────────┘ │
│                          │                                   │
│                          ▼                                   │
│  ┌────────────────────────────────────────────────────────┐ │
│  │  Layer 4: Content Safety                                │ │
│  │  - PII detection (email, phone, CC, SSN)                │ │
│  │  - Toxicity filtering                                   │ │
│  │  - Jailbreak prevention                                 │ │
│  └────────────────────────────────────────────────────────┘ │
│                          │                                   │
│                          ▼                                   │
│  ┌────────────────────────────────────────────────────────┐ │
│  │  Layer 5: Infrastructure                                │ │
│  │  - CORS (whitelisted origins)                           │ │
│  │  - Rate limiting (Redis, sliding window)                │ │
│  │  - CSP headers                                          │ │
│  │  - TLS 1.3 (enforced)                                   │ │
│  └────────────────────────────────────────────────────────┘ │
└──────────────────────────────────────────────────────────────┘

Local Development

Prerequisites

Tool	Version	Purpose
Node.js	20+	TypeScript runtime
pnpm	8+	Package manager
Bun	1.0+	Hono runtime
Python	3.12+	LangGraph agents
uv	0.1+	Python package manager
Docker	24+	Infrastructure containers

Step 1: Clone & Install

# Clone repository
git clone https://github.com/your-org/vercel-ai-sdk.git
cd vercel-ai-sdk

# Install dependencies
pnpm install

# Install Python dependencies (agent-core)
cd apps/agent-core && uv sync && cd ../..

Step 2: Environment Configuration

Create .env.local at project root:

# ── Database ─────────────────────────────────────────────────
DATABASE_URL=postgresql://postgres:postgres@localhost:5432/smart_commerce

# ── Redis ────────────────────────────────────────────────────
REDIS_URL=redis://localhost:6379

# ── Authentication ───────────────────────────────────────────
JWT_SECRET=your-super-secret-jwt-key-min-32-chars

# ── Service URLs ─────────────────────────────────────────────
AGENT_CORE_URL=http://localhost:8000
COMMERCE_API_URL=http://localhost:3001

# ── LLM Provider (Azure AI Foundry) ─────────────────────────
# Swap providers by changing these — zero code changes
OPENAI_BASE_URL=https://your-resource.openai.azure.com/openai/v1
OPENAI_API_KEY=your-azure-api-key
OPENAI_MODEL=gpt-oss-120b
OPENAI_API_VERSION=2024-10-21

# ── Embeddings ───────────────────────────────────────────────
EMBEDDING_MODEL=text-embedding-3-small

# ── Langfuse (Observability) ────────────────────────────────
LANGFUSE_PUBLIC_KEY=pk-...
LANGFUSE_SECRET_KEY=sk-...
LANGFUSE_BASE_URL=http://localhost:3003

# ── Langfuse Docker (optional) ──────────────────────────────
LANGFUSE_NEXTAUTH_SECRET=dev-secret

Step 3: Start Infrastructure (Sequential, One at a Time)

Why Sequential? To prevent PC heating and reduce RAM usage, containers are started one at a time, tested, then kept running. Only one container runs during setup.

# Start PostgreSQL first (required for all services)
make start-postgres

# Test PostgreSQL (starts → tests → stops)
make test-postgres
# Expected: PostgreSQL 16.x

# Start Redis (with PostgreSQL still running)
make start-redis

# Test Redis (starts → tests → stops)
make test-redis
# Expected: PONG

# Start Langfuse (optional — high memory usage, creates DB first)
make start-langfuse

# Test Langfuse (starts → tests → stops)
make test-langfuse
# Expected: Langfuse health check passed

Step 4: Database Setup

# Run migrations (requires PostgreSQL running)
make db-migrate

# Seed database (20 products)
make db-seed

# Open Prisma Studio (optional)
make db-studio

Step 5: Start Services

Local development (recommended for coding)

# Terminal 1: Commerce API
make dev-api

# Terminal 2: Agent Core
make dev-agent

# Terminal 3: Web Frontend
make dev-web

Step 6: Verify Setup

# Check service health (all must be running)
make health

# Expected output:
# ✅ commerce-api :3001
# ✅ agent-core   :8000

# Run agent briefing (full system status)
make agent-briefing

Sequential Container Startup (One at a Time)

To prevent PC heating, containers are started one at a time, tested, then stopped before starting the next.

Step 1: Start PostgreSQL (Required First)

make start-postgres
# Or: ./scripts/start-postgres.sh

# Test PostgreSQL
make test-postgres
# Runs: SELECT version(); then stops container

# Keep PostgreSQL running for development:
# (Don't run stop-postgres yet)

Step 2: Start Redis (With PostgreSQL Running)

make start-redis
# Or: ./scripts/start-redis.sh

# Test Redis
make test-redis
# Runs: redis-cli ping; then stops container

# Keep Redis running for development

Step 3: Start Langfuse (Optional, Requires PostgreSQL)

make start-langfuse
# This will:
# 1. Start PostgreSQL temporarily
# 2. Create langfuse database
# 3. Stop PostgreSQL
# 4. Start Langfuse

# Test Langfuse
make test-langfuse

Daily Development Workflow

# Morning: Start all needed containers (sequentially)
make start-postgres
make start-redis
# (Optional) make start-langfuse

# Start Next.js app
pnpm dev

# Check individual container health
make health-postgres
make health-redis
make health-langfuse

# Evening: Stop containers one by one (or all at once)
make stop-postgres
make stop-redis
make stop-langfuse
# Or: make clean-all (stops all)

Sequential Testing (All Containers, One at a Time)

# This will:
# 1. Start PostgreSQL → Test → Stop
# 2. Start Redis → Test → Stop
# 3. Start Langfuse → Test → Stop
make test-all-sequential

Individual Container Commands

Command	Description
`make start-postgres`	Start PostgreSQL + pgvector
`make stop-postgres`	Stop and remove PostgreSQL
`make test-postgres`	Start → Test version() → Stop
`make start-redis`	Start Redis
`make stop-redis`	Stop and remove Redis
`make test-redis`	Start → Test ping → Stop
`make start-langfuse`	Start Langfuse (creates DB first)
`make stop-langfuse`	Stop and remove Langfuse
`make test-langfuse`	Start → Test health → Stop
`make clean-all`	Stop all containers
`make health`	Check health of all (shows which are running)

Resource Usage

Only ONE container runs at a time:

Container	RAM Usage
PostgreSQL	~500MB
Redis	~50MB
Langfuse	~800MB

Typical development setup:

PostgreSQL + Redis running: ~550MB RAM
Add Langfuse (optional): ~1.35GB RAM total

To minimize heating:

Only run containers you need
Stop containers when not coding: make clean-all
Use make test-all-sequential for testing (runs one at a time)

Makefile Commands

# Infrastructure (Sequential)
make start-postgres    # Start PostgreSQL + pgvector
make stop-postgres     # Stop PostgreSQL
make start-redis       # Start Redis
make stop-redis        # Stop Redis
make start-langfuse    # Start Langfuse (creates DB)
make stop-langfuse     # Stop Langfuse
make clean-all         # Stop all containers

# Testing (Sequential, One at a Time)
make test-postgres     # Start → Test → Stop
make test-redis        # Start → Test → Stop
make test-langfuse     # Start → Test → Stop
make test-all-sequential # Run all tests sequentially

# Development
make dev-web           # Start Next.js frontend (:3000)
make dev-api           # Start Hono commerce-api (:3001)
make dev-agent         # Start FastAPI agent-core (:8000)

# Full Test Suite
make test-all          # Run all unit tests
make e2e-test          # Run E2E tests (requires all services)
make test-web          # Web tests only
make test-api          # API tests only
make test-agent        # Agent tests only

# Database
make db-migrate        # Prisma migrate dev
make db-seed           # Seed database
make db-reset          # Reset + seed
make db-studio         # Prisma Studio GUI

# Health Checks
make health            # Check all services
make health-postgres   # Check PostgreSQL
make health-redis      # Check Redis
make health-langfuse   # Check Langfuse

# Cleanup
make clean             # Remove __pycache__, .next
make prune             # Docker system prune

Testing Strategy

Test Pyramid

                         ┌─────────────┐
                         │    E2E      │  ← Playwright (Full flows)
                         │   Tests     │  Target: 10+ scenarios
                         └──────┬──────┘
                                │
                    ┌───────────┴───────────┐
                    │   Integration Tests   │  ← Real Docker containers
                    │   (API + Tool)        │  Target: 50+ tests
                    └───────────┬───────────┘
                                │
        ┌───────────────────────┴───────────────────────┐
        │              Unit Tests                        │  ← Pure functions
        │        (Vitest + pytest + Mocks)              │  Target: 200+ tests
        └───────────────────────────────────────────────┘

Layer 1: Unit Tests

Purpose: Test pure functions, tools, and utilities in isolation.

Framework	Location	Command	Coverage
Vitest	`apps/web/tests/unit/`	`pnpm vitest run`	TypeScript
pytest	`apps/agent-core/tests/`	`pytest`	Python

Test Categories:

Agent State (state.test.ts): Reducer logic, state transitions
Intent Classification (classify.test.ts): 14 intent types, entity extraction
MCP Tools (server.test.ts): Auth, rate limiting, validation
RAG Core (semantic-chunker.test.ts, reranker.test.ts): Chunking, scoring
Guardrails (guardrails.test.ts): PII, toxicity, jailbreak detection

Target Metrics:

✅ >90% code coverage on new code
✅ Zero any types in TypeScript tests
✅ All tests pass with mypy --strict and tsc --noEmit

Layer 2: Integration Tests

Purpose: Test API endpoints, database operations, and tool execution with real infrastructure.

Framework	Location	Command	Infrastructure
Vitest	`apps/web/tests/integration/`	`pnpm vitest run tests/integration/`	Docker postgres, redis
pytest	`apps/agent-core/tests/integration/`	`pytest tests/integration/`	Docker postgres, redis

Test Scenarios:

Database CRUD operations (Prisma)
Redis caching and checkpoints
GraphQL queries and mutations
MCP tool execution with auth
Vector search with pgvector

Target Metrics:

✅ All tests pass with real Docker containers
✅ No mocked database calls
✅ Idempotency verified (duplicate requests handled)

Layer 3: E2E Tests

Purpose: Test full user flows from frontend to database.

Framework	Location	Command	Output
Playwright	`apps/web/tests/e2e/`	`pnpm test:e2e`	Screenshots, videos

Test Scenarios:

Product Discovery: Search → View results → Add to cart
Cart Management: Add → Update quantity → Remove → Clear
Checkout Flow: Start checkout → Payment → Order confirmation
Order Tracking: View orders → Track status
Support Ticket: Create ticket → Agent response → Resolution

Target Metrics:

✅ All critical paths covered
✅ Video recording on failure
✅ Visual regression testing

LLM Evaluations

Purpose: Evaluate agent response quality, tool selection accuracy, and context retention.

Metric	Target	Measurement
Tool Selection Accuracy	>90%	Correct tool chosen for intent
Parameter Extraction	>95%	Entities extracted correctly
Context Retention	>90%	Previous turns remembered
Hallucination Rate	0%	No fabricated information
Response Relevance	>70%	RAGAS relevancy score
Faithfulness	>75%	RAGAS faithfulness score

Evaluation Pipeline:

User Query → Agent Response → LLM-as-Judge → Score (0-1)
                                      ↓
                              Langfuse Dashboard

Tools:

Langfuse: Per-span tracing, scoring dashboards
RAGAS: Industry-standard RAG metrics
LLM-as-Judge: Custom evaluation prompts

Running Tests

# Full test suite
make test-all

# With coverage
pnpm vitest run --coverage
pytest --cov=apps/agent-core

# Specific test file
pnpm vitest run tests/unit/rag/semantic-chunker.test.ts
pytest apps/agent-core/tests/test_classify.py -v

# E2E tests (requires all services running)
make e2e-test

# LLM evaluation script
python scripts/llm_eval.py

Test Evidence Requirements

Per our Gatekeeper Protocol, all test claims must include evidence:

# Example: Running unit tests
$ pnpm vitest run tests/unit/

 RUN  v1.6.0 /home/aparna/Desktop/vercel-ai-sdk

 ✓ tests/unit/agents/state.test.ts (7)
 ✓ tests/unit/agents/classify.test.ts (11)
 ✓ tests/unit/agents/supervisor.test.ts (10)
 ✓ tests/unit/mcp/server.test.ts (17)
 ✓ tests/unit/rag/semantic-chunker.test.ts (22)
 ✓ tests/unit/rag/reranker.test.ts (15)
 ✓ tests/unit/guardrails.test.ts (24)

 Test Files  7 passed (7)
      Tests  106 passed (106)
   Start at  10:30:45
   Duration  2.34s (transform: 890ms, setup: 120ms, collect: 1.8s, tests: 234ms)

✅ 106/106 tests passing (100%)

Production Readiness

Ship Criteria Checklist

Before deploying to production, verify all items below:

1. Infrastructure ✅

PostgreSQL 16 + pgvector deployed (Azure DB or Neon)
Redis 7 cluster configured (Azure Cache or Upstash)
Container Apps configured (Azure or Fly.io)
Connection pooling enabled (PgBouncer)
Health checks passing (all services)
Auto-scaling rules defined (CPU >70%, Memory >80%)

2. Security ✅

JWT secrets rotated (min 32 chars, cryptographically random)
TLS 1.3 enforced (no HTTP fallback)
CORS whitelist configured (production domains only)
Rate limiting active (100 req/min per user)
PII detection enabled (guardrails active)
SQL injection prevention verified (Prisma params only)
CSP headers configured
Security headers (HSTS, X-Frame-Options, X-Content-Type-Options)

3. Observability ✅

Langfuse connected (traces visible)
Per-span tracing enabled (classify, tools, generate)
LLM-as-Judge scoring active
RAGAS metrics tracked (relevancy, faithfulness)
Error alerts configured (Slack/PagerDuty)
Latency dashboards (P50, P95, P99)
Cost tracking enabled (LLM token usage)

4. Performance ✅

5. Reliability ✅

Circuit breaker configured (LLM calls)
Retry logic active (3x, exponential backoff)
Fallback classification (keyword-based)
Idempotency keys enforced (all writes)
Dead letter queue configured (failed jobs)
Backup strategy (daily DB snapshots)
Disaster recovery plan documented

6. Testing ✅

Unit tests: >90% coverage
Integration tests: All passing with real infra
E2E tests: Critical paths covered
Load tests: 100 concurrent users
Chaos tests: Service recovery verified
LLM evals: Tool selection >90%, Hallucination 0%

7. Documentation ✅

API documentation (OpenAPI 3.1)
Runbook (incident response)
On-call rotation defined
Architecture diagrams updated
Environment variables documented
Deployment guide (step-by-step)
Rollback procedure documented

Production Deployment Commands

# 1. Build production images (individual, not compose)
docker build -t commerce-api-prod -f apps/commerce-api/Dockerfile .
docker build -t agent-core-prod -f apps/agent-core/Dockerfile .

# 2. Run migrations
pnpm prisma migrate deploy

# 3. Deploy to Azure Container Apps
az containerapp up --name commerce-api --source ./apps/commerce-api
az containerapp up --name agent-core --source ./apps/agent-core

# 4. Verify deployment
curl https://commerce-api.azurewebsites.net/health
curl https://agent-core.azurewebsites.net/health

# 5. Monitor Langfuse dashboard
open https://cloud.langfuse.com

Demo Script

6-Minute Walkthrough

Minute 0-1: System Overview

1. Show README.md architecture diagrams
2. Explain 4 core design principles
3. Highlight tech stack (Next.js, Hono, LangGraph, Azure AI)

Minute 1-2: Local Development

# Show container health (sequential startup)
make health

# Show agent briefing
make agent-briefing

# Show database seeded
make db-studio  # Open Prisma Studio

Minute 2-4: Live Agent Demo

1. Open http://localhost:3000
2. User query: "Find wireless headphones under $200"
3. Show agent response:
   - Intent classification (product_search)
   - Entity extraction (category: headphones, maxPrice: 200)
   - Tool execution (catalog.search)
   - GenUI render (<ProductGrid />)
4. Add product to cart via chat
5. Show cart update in real-time

Minute 4-5: Observability

1. Open Langfuse dashboard (http://localhost:3003)
2. Show trace for last conversation turn
3. Highlight:
   - classify_intent span (234ms)
   - tools span (89ms)
   - generate_response span (1.2s)
   - LLM-as-Judge score (faithfulness: 0.92)

Minute 5-6: Testing Evidence

# Run unit tests
make test-all

# Show test output
# ✅ 106/106 tests passing (100%)

# Run E2E test
make e2e-test

# Show Playwright report

Key Design Decisions

Decision	Rationale	Trade-offs
LangGraph for orchestration	Explicit workflow control, built-in checkpointers, human-in-the-loop	Additional dependency, learning curve
Azure AI Foundry only	Production-ready, industry-standard, compliance	No local Ollama fallback
pgvector over Qdrant	Single database, simpler ops, familiar SQL	Slightly slower than dedicated vector DB
Prisma owns all migrations	Type-safe, single source of truth	No Alembic for Python
Agent-core never queries DB	Clean separation, commerce-api owns data	Extra network hop
Stripe MCP Toolkit	Official, maintained, secure	Less customization
Custom JWT auth	Full control, no vendor lock-in	More code to maintain
Idempotency keys on all writes	Prevents duplicate charges, safe retries	Redis dependency

Tech Stack

Layer	Technology	Purpose
Frontend	Next.js 15, React 19, shadcn/ui, CopilotKit	Chat-first canvas, GenUI components
Commerce API	Hono + Bun, GraphQL Yoga, Prisma JS	Data access layer, MCP endpoints
Agent Core	FastAPI + Python, LangGraph, Azure OpenAI	Multi-agent orchestration
LLM	Azure AI Foundry (gpt-oss-120b)	Intent classification, response generation
Embeddings	text-embedding-3-small (1536-dim)	Semantic search, RAG
Database	PostgreSQL 16 + pgvector	Primary data store, vector search
Cache	Redis 7	Session storage, checkpoints, rate limiting
Observability	Langfuse	Tracing, scoring, dashboards

Project Structure

vercel-ai-sdk/
├── apps/
│   ├── web/                    # Next.js 15 frontend
│   │   ├── app/                # App Router pages
│   │   ├── lib/                # Shared utilities
│   │   │   ├── agents/         # LangGraph client
│   │   │   ├── mcp/            # MCP tools
│   │   │   ├── rag/            # RAG pipeline
│   │   │   └── observability/  # Langfuse integration
│   │   └── tests/              # Vitest + Playwright
│   ├── commerce-api/           # Hono + Bun backend
│   │   ├── src/
│   │   │   ├── graphql/        # GraphQL schema
│   │   │   ├── mcp/            # MCP endpoints
│   │   │   └── prisma/         # Database client
│   │   └── tests/              # Bun test
│   └── agent-core/             # FastAPI + Python
│       ├── main.py             # FastAPI app
│       ├── agents/             # LangGraph graphs
│       ├── tools/              # MCP tools
│       └── tests/              # pytest
├── prisma/
│   ├── schema.prisma           # Database schema
│   ├── migrations/             # SQL migrations
│   └── seed.ts                 # Seed data
├── packages/
│   ├── types/                  # Shared TypeScript types
│   └── errors/                 # Error classes (TS + Python)
├── docs/
│   ├── ARCHITECTURE.md         # Detailed architecture
│   ├── PRD.md                  # Product requirements
│   └── adr/                    # Architecture Decision Records
├── scripts/
│   ├── start-postgres.sh       # Start PostgreSQL container
│   ├── stop-postgres.sh        # Stop PostgreSQL container
│   ├── start-redis.sh          # Start Redis container
│   ├── stop-redis.sh           # Stop Redis container
│   └── start-langfuse.sh       # Start Langfuse container
├── docker-compose.yml          # Infrastructure reference (not used directly)
├── docker-compose.langfuse.yml # Optional Langfuse reference
├── Makefile                    # Development commands (sequential containers)
└── README.md                   # This file

Links & Resources

PRD.md - Product Requirements Document
TASKS.md - Living task board (what's done, what's next)
ARCHITECTURE.md - Detailed architecture documentation
TESTING_STRATEGY.md - Comprehensive testing guide
CLAUDE.md - AI agent coding instructions
AGENTS.md - Multi-agent system documentation

License

Contributing

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit changes using Conventional Commits (git commit -m 'feat: add amazing feature')
Test thoroughly (unit + integration + E2E)
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Remember: Show the logs or it didn't happen. All PRs must include test evidence.

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.cursor/rules		.cursor/rules
apps		apps
docs		docs
e2e-screenshots		e2e-screenshots
e2e-test-screenshots		e2e-test-screenshots
mocks		mocks
packages		packages
playwright-report		playwright-report
prisma		prisma
scripts		scripts
supabase		supabase
test-logs		test-logs
test-reports		test-reports
test-screenshots		test-screenshots
.env.example		.env.example
.gitignore		.gitignore
AGENTIC_CODING_SYSTEM.md		AGENTIC_CODING_SYSTEM.md
AGENTS.md		AGENTS.md
AI_COMMERCE_UX_PATTERNS_2026.md		AI_COMMERCE_UX_PATTERNS_2026.md
ALL_TESTS_FIXED_100_PERCENT.md		ALL_TESTS_FIXED_100_PERCENT.md
ARCHITECTURE_AUDIT_AND_TRANSFORMATION.md		ARCHITECTURE_AUDIT_AND_TRANSFORMATION.md
AZURE_AI_FAILING_TESTS_ANALYSIS.md		AZURE_AI_FAILING_TESTS_ANALYSIS.md
AZURE_AI_FOUNDRY_INTEGRATION.md		AZURE_AI_FOUNDRY_INTEGRATION.md
AZURE_MIGRATION_ARCHITECTURE.md		AZURE_MIGRATION_ARCHITECTURE.md
CHAT_DASHBOARD_ACTIONS_IMPLEMENTATION.md		CHAT_DASHBOARD_ACTIONS_IMPLEMENTATION.md
CHAT_FIRST_DASHBOARD_IMPLEMENTATION.md		CHAT_FIRST_DASHBOARD_IMPLEMENTATION.md
CHAT_FIRST_DASHBOARD_VERIFIED.md		CHAT_FIRST_DASHBOARD_VERIFIED.md
CLAUDE.md		CLAUDE.md
CODEBASE_AUDIT.md		CODEBASE_AUDIT.md
COMPLETE_ALL_PHASES_DONE.md		COMPLETE_ALL_PHASES_DONE.md
COMPLETE_IMPLEMENTATION_SUMMARY.md		COMPLETE_IMPLEMENTATION_SUMMARY.md
COMPREHENSIVE_INTEGRATION_TESTS_COMPLETE.md		COMPREHENSIVE_INTEGRATION_TESTS_COMPLETE.md
COMPREHENSIVE_TESTING_REPORT.md		COMPREHENSIVE_TESTING_REPORT.md
DOCUMENTATION_UPDATE_COMPLETE.md		DOCUMENTATION_UPDATE_COMPLETE.md
E2E_TEST_REPORT.md		E2E_TEST_REPORT.md
E2E_TEST_REPORT_FINAL.md		E2E_TEST_REPORT_FINAL.md
ECOMMERCE_AGENT_PLAN.md		ECOMMERCE_AGENT_PLAN.md
EVIDENCE_TESTING.md		EVIDENCE_TESTING.md
FEATURES.md		FEATURES.md
FINAL_AUDIT.md		FINAL_AUDIT.md
FINAL_IMPLEMENTATION_PLAN.md		FINAL_IMPLEMENTATION_PLAN.md
FINAL_STATUS.md		FINAL_STATUS.md
GENUI_AUDIT.md		GENUI_AUDIT.md
GENUI_COMPONENTS_FIX_VERIFICATION.md		GENUI_COMPONENTS_FIX_VERIFICATION.md
GPT_OSS_120B_COMPLETE.md		GPT_OSS_120B_COMPLETE.md
HONEST_TESTING_REPORT.md		HONEST_TESTING_REPORT.md
IDEMPOTENCY_IMPLEMENTATION_COMPLETE.md		IDEMPOTENCY_IMPLEMENTATION_COMPLETE.md
IMPLEMENTATION_COMPLETE_SUMMARY.md		IMPLEMENTATION_COMPLETE_SUMMARY.md
IMPLEMENTATION_PLAN_COMPLETE.md		IMPLEMENTATION_PLAN_COMPLETE.md
IMPLEMENTATION_STATUS.md		IMPLEMENTATION_STATUS.md
MAX_STEPS_LIMIT_DOCUMENTATION.md		MAX_STEPS_LIMIT_DOCUMENTATION.md
Makefile		Makefile
NL_FILTERING_IMPLEMENTATION_COMPLETE.md		NL_FILTERING_IMPLEMENTATION_COMPLETE.md
PHASE_9_COMPLETE.md		PHASE_9_COMPLETE.md
PHASE_9_FINAL_REPORT.md		PHASE_9_FINAL_REPORT.md
PRD.md		PRD.md
PROACTIVE_CART_RECOVERY_IMPLEMENTATION.md		PROACTIVE_CART_RECOVERY_IMPLEMENTATION.md
PROGRESS_REPORT.md		PROGRESS_REPORT.md
PROTOCOL_FIRST_IMPLEMENTATION.md		PROTOCOL_FIRST_IMPLEMENTATION.md
PROTOCOL_TRANSFORMATION_PLAN.md		PROTOCOL_TRANSFORMATION_PLAN.md
PUSH_SUCCESS_REPORT.md		PUSH_SUCCESS_REPORT.md
RAG_ENHANCEMENT_COMPLETE.md		RAG_ENHANCEMENT_COMPLETE.md
README.md		README.md
STRATEGIC_IMPLEMENTATION_GUIDE.md		STRATEGIC_IMPLEMENTATION_GUIDE.md
STRATEGIC_TDD_PLAN.md		STRATEGIC_TDD_PLAN.md
SYSTEM_VERIFICATION.md		SYSTEM_VERIFICATION.md
TASKS.md		TASKS.md
TDD_REAL_VERIFICATION.md		TDD_REAL_VERIFICATION.md
TDD_VERIFICATION_REPORT.md		TDD_VERIFICATION_REPORT.md
TESTING_STRATEGY.md		TESTING_STRATEGY.md
TEST_EXECUTION_REPORT.md		TEST_EXECUTION_REPORT.md
TEST_REPORT.md		TEST_REPORT.md
TRANSFORMATION_REPORT.md		TRANSFORMATION_REPORT.md
check-validator.cjs		check-validator.cjs
comprehensive_playwright_test.js		comprehensive_playwright_test.js
docker-compose.langfuse.yml		docker-compose.langfuse.yml
docker-compose.yml		docker-compose.yml
eslint.config.mjs		eslint.config.mjs
final_verification.js		final_verification.js
generate_comprehensive_report.js		generate_comprehensive_report.js
issues.md		issues.md
jsconfig.json		jsconfig.json
package.json		package.json
playwright_test.js		playwright_test.js
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
schema_support_system.sql		schema_support_system.sql
server.log		server.log
simple_test.js		simple_test.js
simple_test_pg.js		simple_test_pg.js
test-core.log		test-core.log
test-core.ts		test-core.ts
test_ecommerce_db.js		test_ecommerce_db.js
test_ecommerce_playwright.js		test_ecommerce_playwright.js
test_end_to_end_user_flow.js		test_end_to_end_user_flow.js
test_final_api.js		test_final_api.js
test_final_integration.js		test_final_integration.js
test_hallucination.js		test_hallucination.js
test_integration.js		test_integration.js
test_llm_config.js		test_llm_config.js

Folders and files

Latest commit

History

Repository files navigation

TechTrend Agentic Commerce Platform

Core Design Principles

System Architecture

Tier 1: High-Level Architecture

Tier 2: Fault Tolerance

Tier 3: Concurrency Model

Tier 4: Data Model

Tier 5: Request Lifecycle

Tier 6: Scalability Strategy

Tier 7: Security Architecture

Local Development

Prerequisites

Step 1: Clone & Install

Step 2: Environment Configuration

Step 3: Start Infrastructure (Sequential, One at a Time)

Step 4: Database Setup

Step 5: Start Services

Step 6: Verify Setup

Sequential Container Startup (One at a Time)

Step 1: Start PostgreSQL (Required First)

Step 2: Start Redis (With PostgreSQL Running)

Step 3: Start Langfuse (Optional, Requires PostgreSQL)

Daily Development Workflow

Sequential Testing (All Containers, One at a Time)

Individual Container Commands

Resource Usage

Makefile Commands

Testing Strategy

Test Pyramid

Layer 1: Unit Tests

Layer 2: Integration Tests

Layer 3: E2E Tests

LLM Evaluations

Running Tests

Test Evidence Requirements

Production Readiness

Ship Criteria Checklist

1. Infrastructure ✅

2. Security ✅

3. Observability ✅

4. Performance ✅

5. Reliability ✅

6. Testing ✅

7. Documentation ✅

Production Deployment Commands

Demo Script

6-Minute Walkthrough

Key Design Decisions

Tech Stack

Project Structure

Links & Resources

License

Contributing

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages