feat(openspec): propose phase-5-scale-ecosystem change

6 workstreams, 119 tasks — Scale & Ecosystem: - WS1: Rust SDK - WS2: Agent-to-Agent (A2A) Authorization - WS3: Advanced Analytics Dashboard - WS4: Public API Gateway & Rate Limiting SaaS - WS5: Developer Experience (DX) improvements - WS6: AGNTCY Compliance Certification Package Awaiting CEO approval to begin implementation. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-04-02 15:33:08 +00:00
parent 831e91c467
commit 389a764e8d
10 changed files with 2230 additions and 0 deletions
--- a/openspec/changes/phase-5-scale-ecosystem/specs/a2a-authorization/spec.md
+++ b/openspec/changes/phase-5-scale-ecosystem/specs/a2a-authorization/spec.md
@@ -0,0 +1,254 @@
+## WS2: Agent-to-Agent (A2A) Authorization
+
+### Purpose
+
+Enable AI agents to delegate authority to other AI agents via verifiable, auditable, revocable delegation chains. This is a first-class authorization primitive aligned with the AGNTCY multi-agent orchestration model: an orchestrator agent issues sub-tasks to worker agents and must grant those workers scoped authority to act on its behalf.
+
+A delegation chain is: Agent A (delegator) issues a delegation token granting Agent B (delegatee) a subset of A's own scopes for a bounded time period. Agent B presents this token to verify its delegated authority. The chain is stored in PostgreSQL, signed cryptographically, and audited in the existing audit log.
+
+### New Endpoints
+
+#### `POST /oauth2/token/delegate`
+
+**Summary:** Delegate authority from one agent to another.
+
+**Authentication:** Bearer token (the delegating agent's access token).
+
+**Request Body** (`application/json`):
+```json
+{
+  "delegateeAgentId": "string",
+  "scopes": ["string"],
+  "ttlSeconds": 3600
+}
+```
+
+| Field | Type | Required | Constraints |
+|---|---|---|---|
+| `delegateeAgentId` | string | yes | Must be an existing, active agent in the same tenant |
+| `scopes` | string[] | yes | Min 1 item. Each scope must be a subset of the delegator's own scopes |
+| `ttlSeconds` | integer | yes | Min: 60, Max: 86400 (24 hours) |
+
+**Response 201** (`application/json`):
+```json
+{
+  "delegationToken": "string",
+  "chainId": "string (UUID)",
+  "delegatorAgentId": "string",
+  "delegateeAgentId": "string",
+  "scopes": ["string"],
+  "expiresAt": "string (ISO 8601)"
+}
+```
+
+**Error Responses:**
+
+| Status | Code | Description |
+|---|---|---|
+| 400 | `INVALID_SCOPES` | Requested scopes exceed delegator's own scopes |
+| 400 | `INVALID_TTL` | `ttlSeconds` outside allowed range [60, 86400] |
+| 401 | `UNAUTHORIZED` | Missing or invalid Bearer token |
+| 404 | `AGENT_NOT_FOUND` | `delegateeAgentId` does not exist or is in a different tenant |
+| 422 | `SELF_DELEGATION` | Delegator and delegatee are the same agent |
+| 429 | `RATE_LIMITED` | Rate limit exceeded |
+
+**Business Rules:**
+- Delegated scopes MUST be a strict subset of the delegator's own scopes (no privilege escalation)
+- The delegatee must be an active agent in the same tenant as the delegator
+- An agent may not delegate to itself
+- A delegation entry is written to `delegation_chains` and an audit log entry is created with `event_type: "delegation.created"`
+
+---
+
+#### `POST /oauth2/token/verify-delegation`
+
+**Summary:** Verify a delegation token and return the delegation chain details.
+
+**Authentication:** Bearer token (any authenticated agent in the same tenant, or unauthenticated if `A2A_PUBLIC_VERIFY=true`).
+
+**Request Body** (`application/json`):
+```json
+{
+  "delegationToken": "string"
+}
+```
+
+| Field | Type | Required | Constraints |
+|---|---|---|---|
+| `delegationToken` | string | yes | The `delegationToken` value returned by `POST /oauth2/token/delegate` |
+
+**Response 200** (`application/json`):
+```json
+{
+  "valid": true,
+  "chainId": "string (UUID)",
+  "delegatorAgentId": "string",
+  "delegateeAgentId": "string",
+  "scopes": ["string"],
+  "issuedAt": "string (ISO 8601)",
+  "expiresAt": "string (ISO 8601)",
+  "revokedAt": null
+}
+```
+
+**Response when delegation is expired or revoked** (HTTP 200, not 4xx — the token exists but is not valid):
+```json
+{
+  "valid": false,
+  "chainId": "string (UUID)",
+  "delegatorAgentId": "string",
+  "delegateeAgentId": "string",
+  "scopes": ["string"],
+  "issuedAt": "string (ISO 8601)",
+  "expiresAt": "string (ISO 8601)",
+  "revokedAt": "string (ISO 8601) | null"
+}
+```
+
+**Error Responses:**
+
+| Status | Code | Description |
+|---|---|---|
+| 400 | `MALFORMED_TOKEN` | Token is not a valid delegation token format |
+| 401 | `UNAUTHORIZED` | Missing Bearer token (when `A2A_PUBLIC_VERIFY=false`) |
+| 404 | `CHAIN_NOT_FOUND` | No delegation chain found for the given token |
+| 429 | `RATE_LIMITED` | Rate limit exceeded |
+
+**Business Rules:**
+- Expired delegations return `valid: false` — not an error response
+- Revoked delegations return `valid: false` with `revokedAt` populated
+- Verification is non-destructive (does not consume or modify the delegation)
+- An audit log entry is created with `event_type: "delegation.verified"` on every call
+
+---
+
+#### `DELETE /oauth2/token/delegate/:chainId`
+
+**Summary:** Revoke a delegation chain. Only the delegator agent can revoke.
+
+**Authentication:** Bearer token (must be the delegator agent's token).
+
+**Path Parameter:**
+| Parameter | Type | Description |
+|---|---|---|
+| `chainId` | string (UUID) | The chain ID returned at delegation creation |
+
+**Response 204:** No body.
+
+**Error Responses:**
+
+| Status | Code | Description |
+|---|---|---|
+| 401 | `UNAUTHORIZED` | Missing or invalid Bearer token |
+| 403 | `FORBIDDEN` | Authenticated agent is not the delegator of this chain |
+| 404 | `CHAIN_NOT_FOUND` | No delegation chain with this ID |
+| 409 | `ALREADY_REVOKED` | Delegation chain has already been revoked |
+
+**Business Rules:**
+- Sets `revoked_at` timestamp on the `delegation_chains` row
+- Audit log entry created with `event_type: "delegation.revoked"`
+- Revoking a parent chain does NOT cascade-revoke child chains — each link must be revoked explicitly
+
+---
+
+### Database Schema Changes
+
+#### Migration: `008_add_delegation_chains.sql`
+
+```sql
+CREATE TABLE delegation_chains (
+    id              UUID PRIMARY KEY DEFAULT gen_random_uuid(),
+    tenant_id       UUID NOT NULL REFERENCES tenants(id) ON DELETE CASCADE,
+    delegator_agent_id UUID NOT NULL REFERENCES agents(id) ON DELETE CASCADE,
+    delegatee_agent_id UUID NOT NULL REFERENCES agents(id) ON DELETE CASCADE,
+    scopes          TEXT[] NOT NULL,
+    delegation_token TEXT NOT NULL UNIQUE,
+    signature       TEXT NOT NULL,          -- HMAC-SHA256 of delegation payload, keyed by delegator secret
+    ttl_seconds     INTEGER NOT NULL CHECK (ttl_seconds BETWEEN 60 AND 86400),
+    issued_at       TIMESTAMPTZ NOT NULL DEFAULT NOW(),
+    expires_at      TIMESTAMPTZ NOT NULL,
+    revoked_at      TIMESTAMPTZ,
+    created_at      TIMESTAMPTZ NOT NULL DEFAULT NOW()
+);
+
+-- Index for token lookup (verify-delegation hot path)
+CREATE UNIQUE INDEX idx_delegation_chains_token ON delegation_chains(delegation_token);
+
+-- Index for listing delegations by agent
+CREATE INDEX idx_delegation_chains_delegator ON delegation_chains(delegator_agent_id, tenant_id);
+CREATE INDEX idx_delegation_chains_delegatee ON delegation_chains(delegatee_agent_id, tenant_id);
+
+-- Index for cleanup of expired chains
+CREATE INDEX idx_delegation_chains_expires_at ON delegation_chains(expires_at);
+```
+
+### New Source Files
+
+| File | Description |
+|---|---|
+| `src/services/DelegationService.ts` | Business logic: create delegation, verify chain, revoke chain |
+| `src/controllers/DelegationController.ts` | HTTP handlers for delegation endpoints |
+| `src/routes/delegation.ts` | Express router: `POST /oauth2/token/delegate`, `POST /oauth2/token/verify-delegation`, `DELETE /oauth2/token/delegate/:chainId` |
+| `src/types/delegation.ts` | TypeScript interfaces: `DelegationChain`, `CreateDelegationRequest`, `VerifyDelegationRequest`, `DelegationTokenPayload` |
+| `src/utils/delegationCrypto.ts` | HMAC-SHA256 signing and verification for delegation payloads — extracted utility, no duplication |
+
+### Modified Source Files
+
+| File | Change |
+|---|---|
+| `src/routes/index.ts` | Register `delegation` router |
+| `src/infrastructure/migrations/` | Add `008_add_delegation_chains.sql` |
+| `docs/openapi.yaml` | Add delegation endpoints |
+
+### `DelegationService` Interface
+
+```typescript
+interface IDelegationService {
+    /**
+     * Create a delegation chain from delegator to delegatee.
+     * Validates scope subset, signs payload, inserts DB row, writes audit log.
+     */
+    createDelegation(
+        tenantId: string,
+        delegatorAgentId: string,
+        request: CreateDelegationRequest
+    ): Promise<DelegationChain>;
+
+    /**
+     * Verify a delegation token. Returns chain details with valid flag.
+     * Does not throw on expired/revoked — returns valid: false.
+     */
+    verifyDelegation(delegationToken: string): Promise<DelegationVerificationResult>;
+
+    /**
+     * Revoke a delegation chain. Only the delegator may revoke.
+     */
+    revokeDelegation(chainId: string, requestingAgentId: string): Promise<void>;
+}
+```
+
+### Prometheus Metrics
+
+| Metric | Type | Labels | Description |
+|---|---|---|---|
+| `agentidp_delegations_created_total` | Counter | `tenant_id` | Total delegation chains created |
+| `agentidp_delegations_verified_total` | Counter | `tenant_id`, `result` (valid/invalid/expired/revoked) | Delegation verification outcomes |
+| `agentidp_delegations_revoked_total` | Counter | `tenant_id` | Total delegations revoked |
+| `agentidp_delegation_chain_depth` | Histogram | `tenant_id` | Distribution of delegation chain nesting depth |
+
+### Feature Flag
+
+`A2A_ENABLED` environment variable (default: `true`). When `false`, all `/oauth2/token/delegate*` routes return HTTP 404.
+
+### Acceptance Criteria
+
+- `POST /oauth2/token/delegate` creates a delegation chain and returns a delegation token
+- Scope subset validation rejects any scope not held by the delegating agent
+- `POST /oauth2/token/verify-delegation` returns `valid: true` for active chains
+- `POST /oauth2/token/verify-delegation` returns `valid: false` (not 4xx) for expired or revoked chains
+- `DELETE /oauth2/token/delegate/:chainId` sets `revoked_at` and subsequent verification returns `valid: false`
+- A 403 is returned when a non-delegator agent attempts to revoke a chain
+- All delegation events are written to the audit log with correct `event_type`
+- Delegation crypto signature uses HMAC-SHA256 — verified at `verify-delegation` time
+- Unit test coverage >= 80% on `DelegationService` and `delegationCrypto`
+- Integration tests cover: create, verify (valid), verify (expired), verify (revoked), revoke, unauthorized revoke
--- a/openspec/changes/phase-5-scale-ecosystem/specs/agntcy-compliance/spec.md
+++ b/openspec/changes/phase-5-scale-ecosystem/specs/agntcy-compliance/spec.md
@@ -0,0 +1,320 @@
+## WS6: AGNTCY Compliance Certification Package
+
+### Purpose
+
+Position SentryAgent.ai as the reference implementation for the AGNTCY standard. Deliver four artifacts: (1) an auto-generated machine-readable AGNTCY compliance report endpoint; (2) an agent card export endpoint per the AGNTCY Agent Card specification; (3) a Jest-based interoperability test suite verifying AGNTCY alignment on every CI run; (4) a human-readable certification guide documenting how SentryAgent.ai satisfies each AGNTCY requirement.
+
+This workstream produces no user-facing UI changes. It is infrastructure for compliance, certification, and ecosystem trust.
+
+### New Endpoints
+
+#### `GET /agntcy/compliance-report`
+
+**Summary:** Generate and return a real-time AGNTCY compliance report for the authenticated tenant's environment.
+
+**Authentication:** Bearer token (tenant-scoped). The tenant's subscription tier must be `pro` or `enterprise`.
+
+**Response 200** (`application/json`):
+```json
+{
+  "reportId": "string (UUID)",
+  "generatedAt": "string (ISO 8601)",
+  "agntcySpecVersion": "1.0.0",
+  "tenantId": "string (UUID)",
+  "overallStatus": "compliant",
+  "sections": [
+    {
+      "id": "agent-identity",
+      "name": "Agent Identity",
+      "status": "compliant",
+      "requirements": [
+        {
+          "id": "AI-001",
+          "description": "Each agent MUST have a globally unique, persistent identifier",
+          "status": "compliant",
+          "evidence": "All agents are assigned a UUID v4 at registration, stored immutably in agents.id",
+          "verifiedAt": "string (ISO 8601)"
+        },
+        {
+          "id": "AI-002",
+          "description": "Each agent MUST have a W3C DID document",
+          "status": "compliant",
+          "evidence": "DID documents are auto-generated as did:web identifiers at agent registration",
+          "verifiedAt": "string (ISO 8601)"
+        }
+      ]
+    },
+    {
+      "id": "authentication",
+      "name": "Authentication",
+      "status": "compliant",
+      "requirements": [
+        {
+          "id": "AUTH-001",
+          "description": "Agent authentication MUST use OAuth 2.0 or OIDC",
+          "status": "compliant",
+          "evidence": "OAuth 2.0 Client Credentials flow implemented at POST /oauth2/token",
+          "verifiedAt": "string (ISO 8601)"
+        }
+      ]
+    },
+    {
+      "id": "authorization",
+      "name": "Authorization",
+      "status": "compliant",
+      "requirements": []
+    },
+    {
+      "id": "audit-and-governance",
+      "name": "Audit & Governance",
+      "status": "compliant",
+      "requirements": []
+    },
+    {
+      "id": "interoperability",
+      "name": "Interoperability",
+      "status": "compliant",
+      "requirements": []
+    },
+    {
+      "id": "delegation",
+      "name": "Agent-to-Agent Delegation",
+      "status": "compliant",
+      "requirements": []
+    }
+  ],
+  "summary": {
+    "totalRequirements": 24,
+    "compliant": 24,
+    "nonCompliant": 0,
+    "notApplicable": 0
+  }
+}
+```
+
+**`overallStatus`** values: `"compliant"` | `"partial"` | `"non-compliant"`
+
+**Error Responses:**
+
+| Status | Code | Description |
+|---|---|---|
+| 401 | `UNAUTHORIZED` | Missing or invalid Bearer token |
+| 403 | `TIER_REQUIRED` | Compliance report requires Pro or Enterprise tier |
+| 429 | `RATE_LIMITED` | Rate limit exceeded |
+
+**Business Rules:**
+- Report is generated on demand from live system state — no cache
+- Each requirement's `status` is computed by querying current system configuration (e.g., verify DID documents exist by checking `agents` table, verify audit log is enabled by checking config)
+- `agntcySpecVersion` is hardcoded to the AGNTCY spec version the system was last validated against
+- An audit log entry is created with `event_type: "compliance.report_generated"`
+
+---
+
+#### `GET /agents/:id/agent-card`
+
+**Summary:** Return the AGNTCY-compliant Agent Card for a specific agent. Agent Cards are publicly accessible for public agents and require authentication for private agents.
+
+**Authentication:** Optional. Required only if the agent's `is_public` is `false`.
+
+**Path Parameter:**
+
+| Parameter | Type | Description |
+|---|---|---|
+| `id` | string (UUID) | Agent ID |
+
+**Response 200** (`application/json`):
+
+Per the AGNTCY Agent Card specification:
+```json
+{
+  "agntcyVersion": "1.0",
+  "type": "agent-card",
+  "agent": {
+    "id": "string (UUID)",
+    "name": "string",
+    "description": "string | null",
+    "did": "did:web:sentryagent.ai:agents:abc123",
+    "capabilities": ["string"],
+    "version": "string",
+    "publisher": {
+      "tenantId": "string (UUID)",
+      "name": "string"
+    },
+    "endpoints": {
+      "tokenEndpoint": "https://api.sentryagent.ai/oauth2/token",
+      "delegationEndpoint": "https://api.sentryagent.ai/oauth2/token/delegate"
+    },
+    "authentication": {
+      "schemes": ["oauth2_client_credentials"],
+      "tokenEndpoint": "https://api.sentryagent.ai/oauth2/token"
+    },
+    "governance": {
+      "auditLogEnabled": true,
+      "credentialRotationPolicy": "manual",
+      "complianceStandards": ["AGNTCY-1.0", "OAuth2-RFC6749", "W3C-DID"]
+    },
+    "metadata": {}
+  },
+  "issuedAt": "string (ISO 8601)",
+  "expiresAt": "string (ISO 8601)"
+}
+```
+
+**Error Responses:**
+
+| Status | Code | Description |
+|---|---|---|
+| 401 | `UNAUTHORIZED` | Agent is private and no Bearer token provided |
+| 403 | `FORBIDDEN` | Agent is private and authenticated tenant does not own it |
+| 404 | `AGENT_NOT_FOUND` | No agent with the given ID |
+| 429 | `RATE_LIMITED` | Rate limit exceeded |
+
+**Business Rules:**
+- Public agents (`is_public: true`) return agent card without authentication
+- Private agents require the owning tenant's Bearer token
+- Agent card `expiresAt` is `issuedAt + 24 hours` (cards are short-lived — consumers should re-fetch daily)
+- `complianceStandards` array is sourced from system config, not per-agent configuration
+
+---
+
+### AGNTCY Interoperability Test Suite
+
+**File:** `tests/agntcy/interoperability.test.ts`
+
+A Jest test suite that verifies AGNTCY alignment on every CI run. Tests run against a live API instance (reads `AGENTIDP_API_URL` from environment).
+
+**Test categories and cases:**
+
+```typescript
+// AGNTCY-AI-001: Agent identity uniqueness
+test('each registered agent receives a unique UUID', ...)
+test('agent UUID is immutable after registration', ...)
+
+// AGNTCY-AI-002: W3C DID documents
+test('registered agent has a valid did:web DID', ...)
+test('DID document resolves via GET /agents/:id', ...)
+
+// AGNTCY-AUTH-001: OAuth 2.0 token issuance
+test('POST /oauth2/token returns access_token and token_type: bearer', ...)
+test('access token is a valid JWT with correct claims', ...)
+test('expired token is rejected with 401', ...)
+
+// AGNTCY-AUTH-002: OIDC compliance
+test('GET /.well-known/openid-configuration returns valid OIDC discovery document', ...)
+test('JWKS endpoint returns valid JWK Set', ...)
+
+// AGNTCY-AUTHZ-001: Scope-based access control
+test('token with agent:read scope cannot call agent:write operations', ...)
+test('scopes are included in JWT payload', ...)
+
+// AGNTCY-DEL-001: Agent-to-Agent delegation
+test('POST /oauth2/token/delegate creates a valid delegation chain', ...)
+test('delegated scopes cannot exceed delegator scopes', ...)
+test('POST /oauth2/token/verify-delegation returns valid: true for active chain', ...)
+test('POST /oauth2/token/verify-delegation returns valid: false for expired chain', ...)
+
+// AGNTCY-AUDIT-001: Immutable audit logs
+test('every token issuance creates an audit log entry', ...)
+test('audit log entries cannot be deleted via API', ...)
+
+// AGNTCY-GOV-001: Agent lifecycle governance
+test('credential rotation is logged in audit log', ...)
+test('agent deletion logs deletion event in audit log', ...)
+
+// AGNTCY-INTER-001: Agent Card export
+test('GET /agents/:id/agent-card returns valid AGNTCY Agent Card', ...)
+test('Agent Card contains required agntcyVersion, did, capabilities fields', ...)
+
+// AGNTCY-COMP-001: Compliance report
+test('GET /agntcy/compliance-report returns compliant status', ...)
+test('compliance report covers all 6 AGNTCY sections', ...)
+test('compliance report totalRequirements >= 24', ...)
+```
+
+**Running the suite:**
+```bash
+# In CI (requires live API):
+AGENTIDP_API_URL=http://localhost:3000 npm run test:agntcy
+
+# Added to package.json:
+"test:agntcy": "jest --testPathPattern=tests/agntcy --forceExit"
+```
+
+---
+
+### AGNTCY Certification Guide
+
+**File:** `docs/agntcy/certification-guide.md`
+
+A markdown document structured as follows:
+1. **Overview** — What AGNTCY certification means and how SentryAgent.ai achieves it
+2. **Requirement Mapping** — Table mapping each AGNTCY requirement ID to the SentryAgent.ai implementation (endpoint, service, or config)
+3. **Running the Compliance Report** — Step-by-step guide to generating and interpreting the compliance report
+4. **Agent Card Usage** — How to retrieve, cache, and use Agent Cards in multi-agent workflows
+5. **Self-Certification Checklist** — Checklist for operators deploying self-hosted SentryAgent.ai to verify their instance's compliance
+6. **Submitting for Official AGNTCY Certification** — Links and instructions for the Linux Foundation AGNTCY certification program
+
+---
+
+### New Source Files
+
+| File | Description |
+|---|---|
+| `src/services/ComplianceService.ts` | Business logic: query system state, evaluate each AGNTCY requirement, build report |
+| `src/controllers/ComplianceController.ts` | HTTP handlers for compliance report and agent card endpoints |
+| `src/routes/agntcy.ts` | Express router: `GET /agntcy/compliance-report`, `GET /agents/:id/agent-card` |
+| `src/types/compliance.ts` | TypeScript interfaces: `ComplianceReport`, `ComplianceSection`, `ComplianceRequirement`, `AgentCard` |
+| `src/config/agntcyRequirements.ts` | Static array of AGNTCY requirement definitions (id, description, evaluator function reference) |
+| `tests/agntcy/interoperability.test.ts` | Jest interoperability test suite |
+| `docs/agntcy/certification-guide.md` | Human-readable certification guide |
+
+### Modified Source Files
+
+| File | Change |
+|---|---|
+| `src/routes/index.ts` | Register `agntcy` router |
+| `src/routes/agents.ts` | Add `GET /agents/:id/agent-card` route (or register via agntcy router — agent-card is agent-scoped) |
+| `package.json` (API) | Add `"test:agntcy"` script |
+| `docs/openapi.yaml` | Add `GET /agntcy/compliance-report` and `GET /agents/:id/agent-card` endpoints |
+
+### `ComplianceService` Interface
+
+```typescript
+interface IComplianceService {
+    /**
+     * Generate a live AGNTCY compliance report for the given tenant.
+     * Evaluates all registered AGNTCY requirements against current system state.
+     */
+    generateComplianceReport(tenantId: string): Promise<ComplianceReport>;
+
+    /**
+     * Generate an AGNTCY Agent Card for a specific agent.
+     */
+    generateAgentCard(agentId: string): Promise<AgentCard>;
+}
+```
+
+### Prometheus Metrics
+
+| Metric | Type | Labels | Description |
+|---|---|---|---|
+| `agentidp_compliance_reports_generated_total` | Counter | `tenant_id` | Total compliance reports generated |
+| `agentidp_compliance_report_duration_ms` | Histogram | — | Time to generate compliance report |
+| `agentidp_agent_cards_served_total` | Counter | `visibility` (public/private) | Agent cards served by visibility |
+
+### Feature Flag
+
+`AGNTCY_ENABLED` (default: `true`). When `false`, all `/agntcy/` routes and `GET /agents/:id/agent-card` return HTTP 404.
+
+### Acceptance Criteria
+
+- `GET /agntcy/compliance-report` returns a report with `overallStatus: "compliant"` on a correctly configured instance
+- Report contains all 6 sections: agent-identity, authentication, authorization, audit-and-governance, interoperability, delegation
+- Report `totalRequirements >= 24`
+- `GET /agents/:id/agent-card` returns a valid AGNTCY Agent Card with all required fields
+- Agent Card is accessible without auth for public agents
+- Agent Card requires owning tenant's auth for private agents
+- All 25+ interoperability test cases pass against a live API instance
+- `npm run test:agntcy` exits 0 on a correctly configured instance
+- `docs/agntcy/certification-guide.md` is complete — no TODOs, no placeholders
+- Unit tests cover: compliance report generation (compliant system, partially compliant), agent card generation (public agent, private agent)
--- a/openspec/changes/phase-5-scale-ecosystem/specs/analytics-dashboard/spec.md
+++ b/openspec/changes/phase-5-scale-ecosystem/specs/analytics-dashboard/spec.md
@@ -0,0 +1,279 @@
+## WS3: Advanced Analytics Dashboard
+
+### Purpose
+
+Give paying tenants actionable visibility into their agent usage patterns. Analytics surface four dimensions: agent activity over time (heatmap), token issuance frequency and volume (trends), credential rotation frequency (rotation frequency table), and per-endpoint API call patterns (call patterns breakdown). Data is pre-aggregated nightly from the existing `usage_events` table into a new `analytics_daily_aggregates` table. Analytics are rendered in a new Analytics tab in the existing React web dashboard.
+
+### New Endpoints
+
+#### `GET /analytics/usage-summary`
+
+**Summary:** Return a high-level usage summary for the authenticated tenant over a date range.
+
+**Authentication:** Bearer token (tenant-scoped).
+
+**Query Parameters:**
+
+| Parameter | Type | Required | Default | Constraints |
+|---|---|---|---|---|
+| `from` | string (YYYY-MM-DD) | no | 30 days ago | Must be <= `to` |
+| `to` | string (YYYY-MM-DD) | no | today | Must be <= today |
+
+**Response 200** (`application/json`):
+```json
+{
+  "tenantId": "string (UUID)",
+  "period": {
+    "from": "string (YYYY-MM-DD)",
+    "to": "string (YYYY-MM-DD)"
+  },
+  "summary": {
+    "totalApiCalls": 84320,
+    "totalTokenIssuances": 12400,
+    "totalCredentialRotations": 48,
+    "activeAgentCount": 23,
+    "averageDailyApiCalls": 2810,
+    "peakDailyApiCalls": 5102,
+    "peakDate": "2026-03-28"
+  }
+}
+```
+
+**Error Responses:**
+
+| Status | Code | Description |
+|---|---|---|
+| 400 | `INVALID_DATE_RANGE` | `from` > `to`, or date range exceeds 365 days |
+| 401 | `UNAUTHORIZED` | Missing or invalid Bearer token |
+| 403 | `ANALYTICS_NOT_AVAILABLE` | Tenant is on free tier — analytics require Pro or Enterprise |
+| 429 | `RATE_LIMITED` | Rate limit exceeded |
+
+---
+
+#### `GET /analytics/agent-activity`
+
+**Summary:** Return per-agent daily activity counts for heatmap rendering.
+
+**Authentication:** Bearer token (tenant-scoped).
+
+**Query Parameters:**
+
+| Parameter | Type | Required | Default | Constraints |
+|---|---|---|---|---|
+| `from` | string (YYYY-MM-DD) | no | 30 days ago | Must be <= `to` |
+| `to` | string (YYYY-MM-DD) | no | today | Max range: 90 days |
+| `agentId` | string (UUID) | no | (all agents) | Filter to a single agent |
+
+**Response 200** (`application/json`):
+```json
+{
+  "tenantId": "string (UUID)",
+  "period": {
+    "from": "string (YYYY-MM-DD)",
+    "to": "string (YYYY-MM-DD)"
+  },
+  "agents": [
+    {
+      "agentId": "string (UUID)",
+      "agentName": "string",
+      "dailyActivity": [
+        {
+          "date": "2026-03-01",
+          "apiCalls": 342,
+          "tokenIssuances": 12,
+          "credentialRotations": 0
+        }
+      ]
+    }
+  ]
+}
+```
+
+**Error Responses:**
+
+| Status | Code | Description |
+|---|---|---|
+| 400 | `INVALID_DATE_RANGE` | `from` > `to`, or date range exceeds 90 days |
+| 401 | `UNAUTHORIZED` | Missing or invalid Bearer token |
+| 403 | `ANALYTICS_NOT_AVAILABLE` | Free tier — requires Pro or Enterprise |
+| 404 | `AGENT_NOT_FOUND` | `agentId` filter specified but agent does not belong to tenant |
+| 429 | `RATE_LIMITED` | Rate limit exceeded |
+
+---
+
+#### `GET /analytics/token-trends`
+
+**Summary:** Return daily token issuance counts and success/failure breakdown for trend charts.
+
+**Authentication:** Bearer token (tenant-scoped).
+
+**Query Parameters:**
+
+| Parameter | Type | Required | Default | Constraints |
+|---|---|---|---|---|
+| `from` | string (YYYY-MM-DD) | no | 30 days ago | Must be <= `to` |
+| `to` | string (YYYY-MM-DD) | no | today | Max range: 365 days |
+| `granularity` | string | no | `day` | Enum: `day`, `week` |
+
+**Response 200** (`application/json`):
+```json
+{
+  "tenantId": "string (UUID)",
+  "period": {
+    "from": "string (YYYY-MM-DD)",
+    "to": "string (YYYY-MM-DD)"
+  },
+  "granularity": "day",
+  "dataPoints": [
+    {
+      "date": "2026-03-01",
+      "totalIssuances": 420,
+      "successfulIssuances": 415,
+      "failedIssuances": 5,
+      "uniqueAgents": 8
+    }
+  ]
+}
+```
+
+**Error Responses:**
+
+| Status | Code | Description |
+|---|---|---|
+| 400 | `INVALID_DATE_RANGE` | `from` > `to`, or date range exceeds 365 days |
+| 400 | `INVALID_GRANULARITY` | `granularity` is not `day` or `week` |
+| 401 | `UNAUTHORIZED` | Missing or invalid Bearer token |
+| 403 | `ANALYTICS_NOT_AVAILABLE` | Free tier — requires Pro or Enterprise |
+| 429 | `RATE_LIMITED` | Rate limit exceeded |
+
+---
+
+### Database Schema Changes
+
+#### Migration: `009_add_analytics_aggregates.sql`
+
+```sql
+CREATE TABLE analytics_daily_aggregates (
+    id              UUID PRIMARY KEY DEFAULT gen_random_uuid(),
+    tenant_id       UUID NOT NULL REFERENCES tenants(id) ON DELETE CASCADE,
+    agent_id        UUID REFERENCES agents(id) ON DELETE SET NULL,   -- NULL = tenant-wide aggregate
+    date            DATE NOT NULL,
+    metric_type     VARCHAR(64) NOT NULL,   -- 'api_calls' | 'token_issuances' | 'credential_rotations' | 'token_failures'
+    count           BIGINT NOT NULL DEFAULT 0,
+    created_at      TIMESTAMPTZ NOT NULL DEFAULT NOW(),
+    updated_at      TIMESTAMPTZ NOT NULL DEFAULT NOW(),
+
+    CONSTRAINT uq_daily_aggregate UNIQUE (tenant_id, agent_id, date, metric_type)
+);
+
+-- Index for analytics queries (tenant + date range)
+CREATE INDEX idx_analytics_tenant_date ON analytics_daily_aggregates(tenant_id, date);
+CREATE INDEX idx_analytics_agent_date ON analytics_daily_aggregates(agent_id, date) WHERE agent_id IS NOT NULL;
+```
+
+#### Nightly Aggregation Job
+
+A `node-cron` job runs at `00:05 UTC` daily inside the Express API process. It executes an upsert query aggregating the previous day's `usage_events` rows into `analytics_daily_aggregates`. The job is idempotent — running it twice for the same date produces no duplicates (upsert on the unique constraint).
+
+Job logic (pseudocode):
+```
+1. Compute target_date = yesterday (UTC)
+2. SELECT tenant_id, agent_id, metric_type, SUM(count)
+   FROM usage_events
+   WHERE date = target_date
+   GROUP BY tenant_id, agent_id, metric_type
+3. UPSERT INTO analytics_daily_aggregates
+   ON CONFLICT (tenant_id, agent_id, date, metric_type)
+   DO UPDATE SET count = EXCLUDED.count, updated_at = NOW()
+```
+
+### New Source Files
+
+| File | Description |
+|---|---|
+| `src/services/AnalyticsService.ts` | Business logic: query aggregates, build response shapes, Redis caching |
+| `src/controllers/AnalyticsController.ts` | HTTP handlers for analytics endpoints |
+| `src/routes/analytics.ts` | Express router for `/analytics/` prefix |
+| `src/jobs/analyticsAggregation.ts` | `node-cron` job that aggregates usage_events nightly |
+| `src/types/analytics.ts` | TypeScript interfaces: `UsageSummary`, `AgentActivityResponse`, `TokenTrendsResponse`, `DailyAggregate` |
+| `dashboard/src/pages/Analytics.tsx` | New Analytics tab in existing React dashboard |
+| `dashboard/src/components/charts/AgentHeatmap.tsx` | Heatmap component using `recharts` `ResponsiveContainer` + custom cells |
+| `dashboard/src/components/charts/TokenTrendsChart.tsx` | Line chart of token issuance over time using `recharts` `LineChart` |
+| `dashboard/src/components/charts/RotationFrequencyTable.tsx` | Sortable table of credential rotation counts per agent |
+| `dashboard/src/api/analyticsApi.ts` | Typed fetch functions for analytics endpoints |
+
+### Modified Source Files
+
+| File | Change |
+|---|---|
+| `src/app.ts` | Register `analytics` router; start nightly aggregation cron job |
+| `src/infrastructure/migrations/` | Add `009_add_analytics_aggregates.sql` |
+| `dashboard/src/App.tsx` | Add Analytics route and nav link |
+| `package.json` (API) | Add `node-cron` dependency |
+| `package.json` (dashboard) | Add `recharts`, `date-fns` dependencies |
+| `docs/openapi.yaml` | Add analytics endpoints |
+
+### Redis Caching
+
+Analytics responses are cached in Redis with `analytics:{tenantId}:{endpoint}:{queryHash}` keys. TTL: 5 minutes for agent-activity and token-trends; 60 seconds for usage-summary. Cache is invalidated on the next request after TTL expiry (no explicit invalidation).
+
+### `AnalyticsService` Interface
+
+```typescript
+interface IAnalyticsService {
+    /**
+     * Return a high-level usage summary for a tenant over a date range.
+     */
+    getUsageSummary(tenantId: string, from: Date, to: Date): Promise<UsageSummary>;
+
+    /**
+     * Return per-agent daily activity data for heatmap rendering.
+     */
+    getAgentActivity(
+        tenantId: string,
+        from: Date,
+        to: Date,
+        agentId?: string
+    ): Promise<AgentActivityResponse>;
+
+    /**
+     * Return daily token issuance trends with success/failure breakdown.
+     */
+    getTokenTrends(
+        tenantId: string,
+        from: Date,
+        to: Date,
+        granularity: 'day' | 'week'
+    ): Promise<TokenTrendsResponse>;
+}
+```
+
+### Prometheus Metrics
+
+| Metric | Type | Labels | Description |
+|---|---|---|---|
+| `agentidp_analytics_query_duration_ms` | Histogram | `endpoint` | Analytics query latency (before cache) |
+| `agentidp_analytics_cache_hits_total` | Counter | `endpoint` | Analytics Redis cache hits |
+| `agentidp_analytics_cache_misses_total` | Counter | `endpoint` | Analytics Redis cache misses |
+| `agentidp_analytics_aggregation_job_duration_ms` | Gauge | — | Nightly aggregation job runtime |
+| `agentidp_analytics_aggregation_job_last_run` | Gauge | — | Unix timestamp of last successful aggregation job run |
+
+### Feature Flags
+
+| Variable | Default | Description |
+|---|---|---|
+| `ANALYTICS_ENABLED` | `true` | When `false`, all `/analytics/` routes return HTTP 404 |
+| `ANALYTICS_FREE_TIER` | `false` | When `true`, free tier tenants can access analytics (for beta/testing) |
+
+### Acceptance Criteria
+
+- `GET /analytics/usage-summary` returns correct aggregate counts for a date range
+- `GET /analytics/agent-activity` returns per-agent daily rows matching `analytics_daily_aggregates`
+- `GET /analytics/token-trends` returns daily and weekly granularity correctly
+- All three endpoints return HTTP 403 for free-tier tenants (when `ANALYTICS_FREE_TIER=false`)
+- Date range validation rejects `from > to` with HTTP 400
+- Nightly aggregation job runs idempotently — running twice for same date produces no duplicates
+- Analytics responses are cached in Redis — a second identical request does not hit the DB
+- Dashboard Analytics tab renders heatmap, trend chart, and rotation table with mock data in Storybook
+- Unit test coverage >= 80% on `AnalyticsService`
+- Integration tests cover: summary, activity, trends (daily), trends (weekly), free-tier rejection, invalid date range
--- a/openspec/changes/phase-5-scale-ecosystem/specs/api-gateway-tiers/spec.md
+++ b/openspec/changes/phase-5-scale-ecosystem/specs/api-gateway-tiers/spec.md
@@ -0,0 +1,276 @@
+## WS4: Public API Gateway & Rate Limiting SaaS
+
+### Purpose
+
+Replace the single flat rate limit (Phase 4) with a multi-tier enforcement model where each tenant's rate limits are determined by their subscription tier (`free` | `pro` | `enterprise`). Expose the tier definitions publicly via `GET /tiers` so developers can understand limits before registering. Add `POST /billing/upgrade` so tenants can self-service upgrade their tier without contacting support.
+
+This workstream closes the gap between Phase 4's flat rate limiter and a proper commercial SaaS gateway model.
+
+### New Endpoints
+
+#### `GET /tiers`
+
+**Summary:** Return the current tier definitions including rate limits, feature flags, and pricing.
+
+**Authentication:** None (public endpoint).
+
+**Response 200** (`application/json`):
+```json
+{
+  "tiers": [
+    {
+      "id": "free",
+      "name": "Free",
+      "price": {
+        "monthly": 0,
+        "currency": "USD"
+      },
+      "limits": {
+        "registeredAgents": 10,
+        "apiCallsPerDay": 1000,
+        "tokenIssuancesPerDay": 200,
+        "rateLimitPerMinute": 60,
+        "rateLimitBurst": 10,
+        "auditLogRetentionDays": 30
+      },
+      "features": {
+        "marketplace": true,
+        "githubActions": true,
+        "analytics": false,
+        "webhooks": false,
+        "sso": false,
+        "sla": false,
+        "customDomain": false,
+        "prioritySupport": false
+      }
+    },
+    {
+      "id": "pro",
+      "name": "Pro",
+      "price": {
+        "monthly": 49,
+        "currency": "USD"
+      },
+      "limits": {
+        "registeredAgents": 100,
+        "apiCallsPerDay": 50000,
+        "tokenIssuancesPerDay": 10000,
+        "rateLimitPerMinute": 600,
+        "rateLimitBurst": 100,
+        "auditLogRetentionDays": 90
+      },
+      "features": {
+        "marketplace": true,
+        "githubActions": true,
+        "analytics": true,
+        "webhooks": true,
+        "sso": false,
+        "sla": false,
+        "customDomain": false,
+        "prioritySupport": false
+      }
+    },
+    {
+      "id": "enterprise",
+      "name": "Enterprise",
+      "price": {
+        "monthly": null,
+        "currency": "USD",
+        "note": "Contact sales"
+      },
+      "limits": {
+        "registeredAgents": null,
+        "apiCallsPerDay": null,
+        "tokenIssuancesPerDay": null,
+        "rateLimitPerMinute": 6000,
+        "rateLimitBurst": 1000,
+        "auditLogRetentionDays": 365
+      },
+      "features": {
+        "marketplace": true,
+        "githubActions": true,
+        "analytics": true,
+        "webhooks": true,
+        "sso": true,
+        "sla": true,
+        "customDomain": true,
+        "prioritySupport": true
+      }
+    }
+  ]
+}
+```
+
+**Error Responses:**
+
+| Status | Code | Description |
+|---|---|---|
+| 429 | `RATE_LIMITED` | Rate limit exceeded (even unauthenticated endpoints have a global IP-based limit) |
+
+**Notes:**
+- `null` limits mean unlimited
+- Tier definitions are sourced from a static configuration object in the codebase, not a database table
+- The response is cached at the HTTP layer with `Cache-Control: public, max-age=3600`
+
+---
+
+#### `POST /billing/upgrade`
+
+**Summary:** Initiate a self-service tier upgrade for the authenticated tenant. Creates a Stripe Checkout session for the target tier.
+
+**Authentication:** Bearer token (tenant-scoped).
+
+**Request Body** (`application/json`):
+```json
+{
+  "targetTier": "pro"
+}
+```
+
+| Field | Type | Required | Constraints |
+|---|---|---|---|
+| `targetTier` | string | yes | Enum: `pro`, `enterprise` — cannot downgrade via this endpoint |
+
+**Response 200** (`application/json`):
+```json
+{
+  "checkoutUrl": "https://checkout.stripe.com/pay/cs_...",
+  "sessionId": "cs_...",
+  "targetTier": "pro",
+  "expiresAt": "string (ISO 8601)"
+}
+```
+
+**Error Responses:**
+
+| Status | Code | Description |
+|---|---|---|
+| 400 | `ALREADY_ON_TIER` | Tenant is already subscribed to `targetTier` |
+| 400 | `INVALID_TARGET_TIER` | `targetTier` is not a valid upgradeable tier |
+| 400 | `DOWNGRADE_NOT_SUPPORTED` | `targetTier` is lower than the tenant's current tier |
+| 401 | `UNAUTHORIZED` | Missing or invalid Bearer token |
+| 422 | `STRIPE_ERROR` | Stripe API returned an error creating the Checkout session |
+| 429 | `RATE_LIMITED` | Rate limit exceeded |
+
+**Business Rules:**
+- This endpoint extends the existing `BillingService` — a new `upgradeTier(tenantId, targetTier)` method creates a Stripe Checkout session with the correct Stripe Price ID for the target tier
+- The Stripe Price IDs per tier are configured via environment variables: `STRIPE_PRICE_ID_PRO`, `STRIPE_PRICE_ID_ENTERPRISE`
+- After payment, Stripe sends `customer.subscription.created` webhook → existing webhook handler updates `tenant_subscriptions`
+- The `TierRateLimiter` reads the updated tier from `tenant_subscriptions` within 60 seconds (Redis cache TTL for tier lookup)
+- Downgrade is handled via the existing Stripe customer portal — not exposed as an API endpoint
+
+---
+
+### `TierRateLimiter` Middleware
+
+This replaces the single `RateLimiterRedis` middleware for all authenticated routes. It reads the tenant's current tier, looks up the tier rate limit configuration, and enforces it using per-tenant Redis keys via `rate-limiter-flexible`.
+
+**Middleware behavior:**
+1. Extract `tenantId` from the authenticated request context
+2. Look up tier from Redis cache key `tier:{tenantId}` (TTL: 60 seconds)
+3. On cache miss: query `tenant_subscriptions` for `tenantId`, cache result for 60s
+4. Look up rate limit configuration for the tier from the static tier config
+5. Apply `rate-limiter-flexible` with key `rl:{tier}:{tenantId}` and tier-specific limits
+6. On rate limit exceeded: return HTTP 429 with headers:
+   - `X-RateLimit-Limit: <limit>`
+   - `X-RateLimit-Remaining: <remaining>`
+   - `X-RateLimit-Reset: <unix timestamp>`
+   - `Retry-After: <seconds>`
+7. Increment `agentidp_rate_limit_hits_total` counter (labels: `tier`, `tenant_id`, `endpoint`)
+
+**Unauthenticated routes:** Continue to use the existing flat `RateLimiterRedis` with IP-based keys (unchanged from Phase 4).
+
+### Tier Configuration Object
+
+Centralized in `src/config/tiers.ts` — this is the single source of truth for all tier limits and features. Both `GET /tiers` and `TierRateLimiter` read from this same object.
+
+```typescript
+export const TIER_CONFIG: Record<TierName, TierDefinition> = {
+    free: {
+        id: 'free',
+        limits: {
+            registeredAgents: 10,
+            apiCallsPerDay: 1000,
+            tokenIssuancesPerDay: 200,
+            rateLimitPerMinute: 60,
+            rateLimitBurst: 10,
+            auditLogRetentionDays: 30,
+        },
+        features: { analytics: false, webhooks: false, sso: false, sla: false },
+        stripeProductId: null,
+    },
+    pro: {
+        id: 'pro',
+        limits: {
+            registeredAgents: 100,
+            apiCallsPerDay: 50000,
+            tokenIssuancesPerDay: 10000,
+            rateLimitPerMinute: 600,
+            rateLimitBurst: 100,
+            auditLogRetentionDays: 90,
+        },
+        features: { analytics: true, webhooks: true, sso: false, sla: false },
+        stripeProductId: process.env.STRIPE_PRICE_ID_PRO ?? '',
+    },
+    enterprise: {
+        id: 'enterprise',
+        limits: {
+            registeredAgents: null,
+            apiCallsPerDay: null,
+            tokenIssuancesPerDay: null,
+            rateLimitPerMinute: 6000,
+            rateLimitBurst: 1000,
+            auditLogRetentionDays: 365,
+        },
+        features: { analytics: true, webhooks: true, sso: true, sla: true },
+        stripeProductId: process.env.STRIPE_PRICE_ID_ENTERPRISE ?? '',
+    },
+};
+```
+
+### New Source Files
+
+| File | Description |
+|---|---|
+| `src/config/tiers.ts` | Static tier configuration — single source of truth for limits and features |
+| `src/middleware/tierRateLimiter.ts` | `TierRateLimiter` middleware — reads tenant tier, enforces tier-specific limits |
+| `src/routes/tiers.ts` | Express router for `GET /tiers` |
+| `src/types/tiers.ts` | TypeScript interfaces: `TierDefinition`, `TierName`, `TierLimits`, `TierFeatures` |
+
+### Modified Source Files
+
+| File | Change |
+|---|---|
+| `src/middleware/rateLimiter.ts` | Retain for unauthenticated routes; authenticated routes switch to `tierRateLimiter` |
+| `src/services/BillingService.ts` | Add `upgradeTier(tenantId, targetTier)` method |
+| `src/controllers/BillingController.ts` | Add handler for `POST /billing/upgrade` |
+| `src/routes/billing.ts` | Register `POST /billing/upgrade` route |
+| `src/routes/index.ts` | Register `tiers` router |
+| `.env.example` | Add `STRIPE_PRICE_ID_PRO`, `STRIPE_PRICE_ID_ENTERPRISE`, `TIER_RATE_LIMITING_ENABLED` |
+| `docs/openapi.yaml` | Add `GET /tiers` and `POST /billing/upgrade` endpoints |
+
+### Prometheus Metrics
+
+| Metric | Type | Labels | Description |
+|---|---|---|---|
+| `agentidp_rate_limit_hits_total` | Counter | `tier`, `tenant_id`, `endpoint` | Rate limit rejections per tier (replaces old flat counter) |
+| `agentidp_tier_cache_hits_total` | Counter | — | Tier Redis cache hits |
+| `agentidp_tier_cache_misses_total` | Counter | — | Tier Redis cache misses |
+| `agentidp_billing_upgrades_total` | Counter | `from_tier`, `to_tier` | Self-service upgrade checkout sessions created |
+
+### Feature Flag
+
+`TIER_RATE_LIMITING_ENABLED` (default: `true`). When `false`, the system uses the old flat `RateLimiterRedis` middleware — this is the rollback mechanism.
+
+### Acceptance Criteria
+
+- `GET /tiers` returns all three tier definitions matching `TIER_CONFIG` exactly — no database query, cached `Cache-Control: max-age=3600`
+- `POST /billing/upgrade` creates a Stripe Checkout session and returns `checkoutUrl`
+- `POST /billing/upgrade` returns HTTP 400 `ALREADY_ON_TIER` when tenant is already on the target tier
+- `POST /billing/upgrade` returns HTTP 400 `DOWNGRADE_NOT_SUPPORTED` when target tier is lower than current
+- `TierRateLimiter` enforces free tier limits (60 req/min) for free tenants
+- `TierRateLimiter` enforces pro tier limits (600 req/min) for pro tenants
+- Tier lookup is cached in Redis — second request does not query `tenant_subscriptions`
+- Rate limit response includes `X-RateLimit-*` headers and `Retry-After`
+- After a Stripe webhook updates `tenant_subscriptions` to `pro`, `TierRateLimiter` applies pro limits within 60 seconds (next cache refresh)
+- Unit tests cover: tier lookup (cached), tier lookup (miss), free limit enforcement, pro limit enforcement, upgrade (success), upgrade (already on tier), upgrade (downgrade rejected)
--- a/openspec/changes/phase-5-scale-ecosystem/specs/developer-experience/spec.md
+++ b/openspec/changes/phase-5-scale-ecosystem/specs/developer-experience/spec.md
@@ -0,0 +1,228 @@
+## WS5: Developer Experience (DX) Improvements
+
+### Purpose
+
+Reduce time-to-first-successful-agent-call to under 5 minutes for a new developer. Three concrete improvements: (1) upgrade the developer portal's API explorer from Swagger UI v4 to Stoplight Elements — a modern, component-based API documentation experience with better navigation, code samples, and mock server support; (2) add a scaffold generator endpoint that returns a language-specific starter project pre-wired with the developer's agent credentials as a downloadable ZIP; (3) add a `sentryagent scaffold` CLI command that calls the scaffold endpoint and extracts the ZIP into the current directory.
+
+### New Endpoint
+
+#### `GET /sdk/scaffold/:agentId`
+
+**Summary:** Generate and return a language-specific scaffold ZIP for the specified agent.
+
+**Authentication:** Bearer token (tenant-scoped). The authenticated tenant must own the specified agent.
+
+**Path Parameter:**
+
+| Parameter | Type | Description |
+|---|---|---|
+| `agentId` | string (UUID) | The agent for which to generate the scaffold |
+
+**Query Parameters:**
+
+| Parameter | Type | Required | Default | Constraints |
+|---|---|---|---|---|
+| `language` | string | no | `typescript` | Enum: `typescript`, `python`, `go`, `java`, `rust` |
+
+**Response 200:**
+- Content-Type: `application/zip`
+- Content-Disposition: `attachment; filename="sentryagent-scaffold-{agentName}-{language}.zip"`
+- Body: Binary ZIP archive stream
+
+**ZIP Archive Contents (TypeScript example):**
+
+```
+sentryagent-scaffold-my-agent-typescript/
+├── package.json            (name: my-agent, version: 0.1.0, deps: sentryagent-idp-sdk)
+├── tsconfig.json           (strict mode, ES2022 target)
+├── .env.example            (AGENTIDP_API_URL, AGENTIDP_CLIENT_ID=<pre-filled>, AGENTIDP_CLIENT_SECRET=<placeholder>)
+├── .gitignore              (.env on first line)
+├── src/
+│   └── index.ts            (imports SDK, creates client from env, issues token, logs success)
+└── README.md               (step-by-step: cp .env.example .env, fill secret, npm install, npm start)
+```
+
+**ZIP Archive Contents (Python example):**
+```
+sentryagent-scaffold-my-agent-python/
+├── requirements.txt        (sentryagent-idp)
+├── .env.example            (AGENTIDP_API_URL, AGENTIDP_CLIENT_ID=<pre-filled>, AGENTIDP_CLIENT_SECRET=<placeholder>)
+├── .gitignore              (.env on first line)
+├── main.py                 (imports SDK, creates client from env, issues token, prints success)
+└── README.md               (step-by-step: cp .env.example .env, fill secret, pip install -r requirements.txt, python main.py)
+```
+
+**ZIP Archive Contents (Go example):**
+```
+sentryagent-scaffold-my-agent-go/
+├── go.mod                  (module: my-agent, dep: github.com/sentryagent/sentryagent-idp-go)
+├── .env.example            (AGENTIDP_API_URL, AGENTIDP_CLIENT_ID=<pre-filled>, AGENTIDP_CLIENT_SECRET=<placeholder>)
+├── .gitignore              (.env on first line)
+├── main.go                 (imports SDK, creates client from env, issues token, logs success)
+└── README.md               (step-by-step instructions)
+```
+
+**Error Responses:**
+
+| Status | Code | Description |
+|---|---|---|
+| 400 | `INVALID_LANGUAGE` | `language` query param is not one of the supported values |
+| 401 | `UNAUTHORIZED` | Missing or invalid Bearer token |
+| 403 | `FORBIDDEN` | Authenticated tenant does not own this agent |
+| 404 | `AGENT_NOT_FOUND` | No agent with `agentId` found |
+| 429 | `RATE_LIMITED` | Rate limit exceeded |
+
+**Business Rules:**
+- `clientId` is pre-filled in `.env.example` — taken from the agent's credentials in the database
+- `clientSecret` is always a `<your-client-secret>` placeholder — never returned in scaffold (credentials security policy)
+- The ZIP is generated in memory using `archiver` — no disk writes on the server
+- Scaffold generation is rate-limited to 10 requests per minute per tenant (separate from the main tier rate limit)
+- An audit log entry is created with `event_type: "scaffold.generated"`, `metadata.language`
+
+---
+
+### Developer Portal: Elements API Explorer Upgrade
+
+**File to modify:** `portal/app/api-explorer/page.tsx`
+
+**Current state (Phase 4):** Embeds `swagger-ui-react` (Swagger UI v4) loaded from `NEXT_PUBLIC_API_URL/openapi.json`.
+
+**New state (Phase 5):** Replaces `swagger-ui-react` with `@stoplight/elements` (`<API>` component). Stoplight Elements provides: three-panel layout (navigation, docs, try-it), built-in code samples in multiple languages, mock server support, and better mobile responsiveness.
+
+**Implementation:**
+
+```tsx
+// portal/app/api-explorer/page.tsx (complete replacement)
+'use client';
+
+import { API } from '@stoplight/elements';
+import '@stoplight/elements/styles.min.css';
+
+export default function ApiExplorerPage() {
+    return (
+        <main className="h-screen w-full">
+            <API
+                apiDescriptionUrl={`${process.env.NEXT_PUBLIC_API_URL}/openapi.json`}
+                router="hash"
+                layout="sidebar"
+                hideSchemas={false}
+                tryItCredentialsPolicy="same-origin"
+            />
+        </main>
+    );
+}
+```
+
+**Files modified:**
+- `portal/app/api-explorer/page.tsx` — replace Swagger UI component with Elements `<API>` component
+- `portal/package.json` — replace `swagger-ui-react` with `@stoplight/elements`
+
+---
+
+### CLI: `sentryagent scaffold` Command
+
+**File to create:** `cli/src/commands/scaffold.ts`
+
+**Command syntax:**
+```
+sentryagent scaffold --agent-id <id> [--language typescript|python|go|java|rust] [--out <directory>]
+```
+
+**Options:**
+
+| Option | Alias | Default | Description |
+|---|---|---|---|
+| `--agent-id <id>` | `-a` | (required) | Agent ID to scaffold for |
+| `--language <lang>` | `-l` | `typescript` | Target language for scaffold |
+| `--out <dir>` | `-o` | `.` (current dir) | Directory to extract scaffold ZIP into |
+
+**Behavior:**
+1. Load config from `~/.sentryagent/config.json` — fail with helpful message if not configured
+2. Issue an API call: `GET /sdk/scaffold/{agentId}?language={language}` with Bearer token from `POST /oauth2/token`
+3. Receive ZIP stream, pipe through `unzipper` to extract into `--out` directory
+4. Print success message: `Scaffold generated at ./{agentName}-{language}/`
+5. Print next steps:
+   ```
+   Next steps:
+     1. cd {agentName}-{language}
+     2. cp .env.example .env
+     3. Add your AGENTIDP_CLIENT_SECRET to .env
+     4. npm install  (or equivalent for your language)
+     5. npm start
+   ```
+
+**Error handling:**
+- Agent not found: print `Agent {agentId} not found.`
+- Forbidden: print `You do not own agent {agentId}.`
+- Invalid language: print `Unsupported language '{lang}'. Choose: typescript, python, go, java, rust`
+- Output directory does not exist: create it (with user prompt for confirmation if non-empty)
+
+**New CLI dependencies** (add to `cli/package.json`):
+- `unzipper` — streaming ZIP extraction (pure JS, no native deps)
+
+### New Source Files
+
+| File | Description |
+|---|---|
+| `src/services/ScaffoldService.ts` | Business logic: build ZIP archive in memory using `archiver` |
+| `src/controllers/ScaffoldController.ts` | HTTP handler: stream ZIP response |
+| `src/routes/scaffold.ts` | Express router: `GET /sdk/scaffold/:agentId` |
+| `src/types/scaffold.ts` | TypeScript interfaces: `ScaffoldLanguage`, `ScaffoldOptions`, `ScaffoldTemplate` |
+| `src/templates/scaffold/typescript/` | Template files for TypeScript scaffold (package.json, tsconfig.json, index.ts, .env.example, .gitignore, README.md) |
+| `src/templates/scaffold/python/` | Template files for Python scaffold (requirements.txt, main.py, .env.example, .gitignore, README.md) |
+| `src/templates/scaffold/go/` | Template files for Go scaffold (go.mod, main.go, .env.example, .gitignore, README.md) |
+| `src/templates/scaffold/java/` | Template files for Java scaffold (pom.xml, Main.java, .env.example, .gitignore, README.md) |
+| `src/templates/scaffold/rust/` | Template files for Rust scaffold (Cargo.toml, src/main.rs, .env.example, .gitignore, README.md) |
+| `cli/src/commands/scaffold.ts` | CLI scaffold command implementation |
+
+### Modified Source Files
+
+| File | Change |
+|---|---|
+| `src/routes/index.ts` | Register `scaffold` router |
+| `src/app.ts` | No change needed (routes registered via index) |
+| `package.json` (API) | Add `archiver` and `@types/archiver` |
+| `portal/app/api-explorer/page.tsx` | Replace Swagger UI with Elements |
+| `portal/package.json` | Replace `swagger-ui-react` with `@stoplight/elements` |
+| `cli/src/index.ts` | Register `scaffold` command with Commander |
+| `cli/package.json` | Add `unzipper` and `@types/unzipper` |
+| `docs/openapi.yaml` | Add `GET /sdk/scaffold/:agentId` endpoint |
+
+### `ScaffoldService` Interface
+
+```typescript
+interface IScaffoldService {
+    /**
+     * Generate an in-memory ZIP archive for the given agent and language.
+     * Returns a Node.js Readable stream of the ZIP binary.
+     * Template variables injected: {{AGENT_ID}}, {{AGENT_NAME}}, {{CLIENT_ID}}, {{API_URL}}
+     */
+    generateScaffold(
+        agentId: string,
+        language: ScaffoldLanguage,
+        apiUrl: string
+    ): Promise<{ stream: NodeJS.ReadableStream; filename: string }>;
+}
+```
+
+### Prometheus Metrics
+
+| Metric | Type | Labels | Description |
+|---|---|---|---|
+| `agentidp_scaffold_generated_total` | Counter | `language` | Scaffold ZIPs generated by language |
+| `agentidp_scaffold_generation_duration_ms` | Histogram | `language` | Time to generate scaffold ZIP |
+
+### Acceptance Criteria
+
+- `GET /sdk/scaffold/:agentId?language=typescript` returns a valid ZIP with all 6 template files
+- ZIP contains `.env.example` with `AGENTIDP_CLIENT_ID` pre-filled and `AGENTIDP_CLIENT_SECRET=<your-client-secret>` as placeholder
+- ZIP never contains the actual client secret
+- `GET /sdk/scaffold/:agentId?language=python` returns Python-specific template files
+- All 5 languages (typescript, python, go, java, rust) return valid ZIPs
+- HTTP 400 on unknown `language` query param
+- HTTP 403 when authenticated tenant does not own the agent
+- `sentryagent scaffold --agent-id abc123 --language go` extracts scaffold to current directory
+- `sentryagent scaffold --agent-id abc123 --language python --out /tmp/myagent` extracts to `/tmp/myagent`
+- Developer portal `/api-explorer` renders Elements v5 with sidebar layout — TypeScript build passes
+- Unit tests cover: scaffold generation (each language), forbidden access, invalid language
+- Integration tests cover: scaffold endpoint response type, content-disposition header, ZIP validity
--- a/openspec/changes/phase-5-scale-ecosystem/specs/rust-sdk/spec.md
+++ b/openspec/changes/phase-5-scale-ecosystem/specs/rust-sdk/spec.md
@@ -0,0 +1,289 @@
+## WS1: Rust SDK
+
+### Purpose
+
+Deliver a production-grade, idiomatic Rust SDK for SentryAgent.ai AgentIdP. The SDK covers all 14 API endpoints, provides a thread-safe `TokenManager` with automatic token refresh, uses `async/await` throughout via `tokio`, and models all errors as a typed `AgentIdPError` enum. Rust developers building high-performance or safety-critical AI agents can integrate with SentryAgent.ai without writing HTTP boilerplate.
+
+The SDK is published to crates.io as `sentryagent-idp`. It mirrors the API surface of the Go SDK (the most recently authored and cleanest SDK) to reduce cognitive load for polyglot teams.
+
+### New Files to Create
+
+| File | Description |
+|---|---|
+| `sdk-rust/Cargo.toml` | Crate manifest — name: `sentryagent-idp`, edition: 2021 |
+| `sdk-rust/src/lib.rs` | Crate root — re-exports `AgentIdPClient`, `TokenManager`, `AgentIdPError`, all model types |
+| `sdk-rust/src/client.rs` | `AgentIdPClient` struct — wraps `reqwest::Client`, holds base URL + credentials |
+| `sdk-rust/src/token_manager.rs` | `TokenManager` struct — `Arc<Mutex<TokenCache>>`, auto-refresh logic |
+| `sdk-rust/src/error.rs` | `AgentIdPError` enum — all typed error variants, implements `std::error::Error` |
+| `sdk-rust/src/models.rs` | All request/response model structs — serde Serialize/Deserialize |
+| `sdk-rust/src/agents.rs` | Agent CRUD methods on `AgentIdPClient` |
+| `sdk-rust/src/oauth2.rs` | Token issuance and refresh methods |
+| `sdk-rust/src/credentials.rs` | Credential management methods |
+| `sdk-rust/src/audit.rs` | Audit log query methods |
+| `sdk-rust/src/marketplace.rs` | Marketplace listing and detail methods |
+| `sdk-rust/src/delegation.rs` | A2A delegation methods (WS2 integration) |
+| `sdk-rust/examples/quickstart.rs` | Working quickstart example — register agent, issue token, make authenticated call |
+| `sdk-rust/README.md` | Installation, configuration, quickstart, all methods with examples |
+| `sdk-rust/tests/integration_test.rs` | Integration tests against a real API instance (reads `AGENTIDP_API_URL` env var) |
+
+### Cargo.toml Dependencies
+
+```toml
+[dependencies]
+tokio = { version = "1.35", features = ["full"] }
+reqwest = { version = "0.11", features = ["json", "rustls-tls"] }
+serde = { version = "1.0", features = ["derive"] }
+serde_json = "1.0"
+uuid = { version = "1.6", features = ["v4"] }
+thiserror = "1.0"
+async-trait = "0.1"
+
+[dev-dependencies]
+tokio-test = "0.4"
+mockito = "1.2"
+```
+
+### Public API Surface
+
+#### `AgentIdPClient`
+
+```rust
+pub struct AgentIdPClient {
+    base_url: String,
+    client_id: String,
+    client_secret: String,
+    http: reqwest::Client,
+    token_manager: Arc<Mutex<TokenManager>>,
+}
+
+impl AgentIdPClient {
+    /// Create a new client. Does not make any network calls at construction time.
+    pub fn new(base_url: &str, client_id: &str, client_secret: &str) -> Self;
+
+    /// Create a client from environment variables:
+    /// AGENTIDP_API_URL, AGENTIDP_CLIENT_ID, AGENTIDP_CLIENT_SECRET
+    pub fn from_env() -> Result<Self, AgentIdPError>;
+
+    // Agent methods
+    pub async fn register_agent(&self, req: RegisterAgentRequest) -> Result<Agent, AgentIdPError>;
+    pub async fn get_agent(&self, agent_id: &str) -> Result<Agent, AgentIdPError>;
+    pub async fn list_agents(&self, page: u32, per_page: u32) -> Result<AgentList, AgentIdPError>;
+    pub async fn update_agent(&self, agent_id: &str, req: UpdateAgentRequest) -> Result<Agent, AgentIdPError>;
+    pub async fn delete_agent(&self, agent_id: &str) -> Result<(), AgentIdPError>;
+
+    // OAuth2 token methods
+    pub async fn issue_token(&self, agent_id: &str, scopes: &[&str]) -> Result<TokenResponse, AgentIdPError>;
+
+    // Credential methods
+    pub async fn generate_credentials(&self, agent_id: &str) -> Result<Credentials, AgentIdPError>;
+    pub async fn rotate_credentials(&self, agent_id: &str) -> Result<Credentials, AgentIdPError>;
+    pub async fn revoke_credentials(&self, agent_id: &str) -> Result<(), AgentIdPError>;
+
+    // Audit log methods
+    pub async fn list_audit_logs(&self, filters: AuditLogFilters) -> Result<AuditLogList, AgentIdPError>;
+
+    // Marketplace methods
+    pub async fn list_public_agents(&self, filters: MarketplaceFilters) -> Result<MarketplaceAgentList, AgentIdPError>;
+    pub async fn get_public_agent(&self, agent_id: &str) -> Result<MarketplaceAgent, AgentIdPError>;
+
+    // Delegation methods (WS2)
+    pub async fn delegate(&self, req: DelegateRequest) -> Result<DelegationToken, AgentIdPError>;
+    pub async fn verify_delegation(&self, token: &str) -> Result<DelegationVerification, AgentIdPError>;
+}
+```
+
+#### `TokenManager`
+
+```rust
+/// Thread-safe token cache with automatic refresh.
+/// Holds the current access token and its expiry.
+/// Re-issues a token when it is within 60 seconds of expiry.
+pub struct TokenManager {
+    client_id: String,
+    client_secret: String,
+    api_url: String,
+    cache: Arc<Mutex<TokenCache>>,
+}
+
+struct TokenCache {
+    access_token: Option<String>,
+    expires_at: Option<std::time::Instant>,
+}
+
+impl TokenManager {
+    pub fn new(api_url: &str, client_id: &str, client_secret: &str) -> Self;
+
+    /// Returns a valid access token. Refreshes automatically if expired or within 60s of expiry.
+    pub async fn get_token(&self) -> Result<String, AgentIdPError>;
+}
+```
+
+#### `AgentIdPError`
+
+```rust
+#[derive(Debug, thiserror::Error)]
+pub enum AgentIdPError {
+    #[error("HTTP request failed: {0}")]
+    HttpError(#[from] reqwest::Error),
+
+    #[error("API error {status}: {message}")]
+    ApiError { status: u16, message: String, code: Option<String> },
+
+    #[error("Authentication failed: {0}")]
+    AuthError(String),
+
+    #[error("Agent not found: {0}")]
+    NotFound(String),
+
+    #[error("Rate limit exceeded. Retry after {retry_after_secs}s")]
+    RateLimited { retry_after_secs: u64 },
+
+    #[error("Invalid configuration: {0}")]
+    ConfigError(String),
+
+    #[error("Serialization error: {0}")]
+    SerdeError(#[from] serde_json::Error),
+
+    #[error("Delegation chain invalid: {0}")]
+    DelegationError(String),
+}
+```
+
+### Model Structs (complete — no placeholders)
+
+```rust
+// Request types
+pub struct RegisterAgentRequest {
+    pub name: String,
+    pub description: Option<String>,
+    pub capabilities: Vec<String>,
+    pub metadata: Option<serde_json::Value>,
+}
+
+pub struct UpdateAgentRequest {
+    pub name: Option<String>,
+    pub description: Option<String>,
+    pub capabilities: Option<Vec<String>>,
+    pub is_public: Option<bool>,
+    pub metadata: Option<serde_json::Value>,
+}
+
+pub struct AuditLogFilters {
+    pub agent_id: Option<String>,
+    pub event_type: Option<String>,
+    pub from: Option<String>,   // ISO 8601
+    pub to: Option<String>,     // ISO 8601
+    pub page: u32,
+    pub per_page: u32,
+}
+
+pub struct MarketplaceFilters {
+    pub q: Option<String>,
+    pub capability: Option<String>,
+    pub publisher: Option<String>,
+    pub page: u32,
+    pub per_page: u32,
+}
+
+pub struct DelegateRequest {
+    pub delegatee_agent_id: String,
+    pub scopes: Vec<String>,
+    pub ttl_seconds: u64,
+}
+
+// Response types
+pub struct Agent {
+    pub id: String,
+    pub name: String,
+    pub description: Option<String>,
+    pub capabilities: Vec<String>,
+    pub did: String,
+    pub is_public: bool,
+    pub created_at: String,
+    pub updated_at: String,
+}
+
+pub struct AgentList {
+    pub agents: Vec<Agent>,
+    pub total: u64,
+    pub page: u32,
+    pub per_page: u32,
+}
+
+pub struct TokenResponse {
+    pub access_token: String,
+    pub token_type: String,
+    pub expires_in: u64,
+    pub scope: String,
+}
+
+pub struct Credentials {
+    pub client_id: String,
+    pub client_secret: String,  // Only present on generate/rotate — never on read
+    pub created_at: String,
+}
+
+pub struct AuditLogEntry {
+    pub id: String,
+    pub agent_id: String,
+    pub event_type: String,
+    pub actor: String,
+    pub metadata: serde_json::Value,
+    pub timestamp: String,
+}
+
+pub struct AuditLogList {
+    pub entries: Vec<AuditLogEntry>,
+    pub total: u64,
+    pub page: u32,
+    pub per_page: u32,
+}
+
+pub struct MarketplaceAgent {
+    pub id: String,
+    pub name: String,
+    pub description: Option<String>,
+    pub capabilities: Vec<String>,
+    pub did_document: serde_json::Value,
+    pub publisher: String,
+    pub created_at: String,
+}
+
+pub struct MarketplaceAgentList {
+    pub agents: Vec<MarketplaceAgent>,
+    pub total: u64,
+    pub page: u32,
+    pub per_page: u32,
+}
+
+pub struct DelegationToken {
+    pub delegation_token: String,
+    pub chain_id: String,
+    pub expires_at: String,
+}
+
+pub struct DelegationVerification {
+    pub valid: bool,
+    pub chain_id: String,
+    pub delegator_agent_id: String,
+    pub delegatee_agent_id: String,
+    pub scopes: Vec<String>,
+    pub expires_at: String,
+}
+```
+
+### Database Schema Changes
+
+None. The Rust SDK is a client library — it makes HTTP calls to the existing API. No database changes are required for WS1.
+
+### Acceptance Criteria
+
+- `cargo build` passes with zero warnings (deny warnings enforced via `#![deny(warnings)]` in `lib.rs`)
+- `cargo clippy` passes with zero warnings
+- `cargo test` runs all unit tests — all pass
+- Integration tests pass against a live API instance when `AGENTIDP_API_URL`, `AGENTIDP_CLIENT_ID`, `AGENTIDP_CLIENT_SECRET` are set
+- `TokenManager::get_token()` is thread-safe: concurrent calls from multiple `tokio` tasks do not produce race conditions (verified by a concurrent-call test with 50 parallel futures)
+- Zero `unwrap()` calls in `src/` (only in `examples/` and `tests/` where panicking is acceptable)
+- All public items have `///` doc comments
+- `cargo doc --no-deps` generates docs without errors
+- Published to crates.io as `sentryagent-idp` version `1.0.0`