Phase 1: AI Tool Cost Comparison Plan¶

Roo Code plus Kong AI vs GitHub Copilot for Architecture Practice¶

This is Phase 1 of the Continuous Architecture Platform POC. It establishes the AI toolchain selection that will power all subsequent phases.


Project	Continuous Architecture Platform POC
Phase	Phase 1 - AI Tool Cost Comparison
Author	Christopher Blaisdell
Date	2026-02-28
Status	Synthetic workspace complete, ready for execution
Objective	Determine the monthly cost per architect seat for Roo Code + Kong AI vs GitHub Copilot by reproducing real architecture workflows with synthetic data on a separate computer (ZERO company data)

1 Goal¶

Determine the monthly cost per architect seat for two AI-assisted architecture toolchains:

Option A	Option B
Roo Code (VS Code extension) + Kong AI (API gateway to LLMs)	GitHub Copilot (Chat, Agent mode, Extensions)

The key question: What does it cost per month to give one solution architect AI-assisted tooling under each option?

GitHub Copilot has a simple per-seat monthly price. Roo Code + Kong AI has a usage-based cost model (per-token LLM charges routed through Kong AI) that depends on actual usage volume. To compare them fairly, we need to measure what a typical architecture month looks like in terms of token consumption, then price that against both models.

The comparison must be performed on a separate computer using entirely synthetic data so that no the enterprise proprietary information leaves the corporate environment.

2 Approach Summary¶

Phase 1 (This Computer): Use real corporate architecture data to create a synthetic dataset that is structurally representative but contains zero company data.

Phase 2 (Separate Computer): Use the synthetic dataset on a clean machine to execute a representative month of architecture work against both toolchains, measuring token usage and quality to calculate monthly cost per seat.

3 Data Sanitization Principles¶

These rules apply to ALL synthetic data produced in Phase 1:

Rule	Description
No Real Service Names	Replace all microservice names (ms-xxx, ms-yyy, etc.) with fictional equivalents
No Real Endpoints	Replace all API paths, URLs, hostnames with synthetic equivalents
No Real Ticket IDs	Replace all JIRA ticket IDs (UPT-XXXXXX) with fictional project keys
No Real People	Replace all team member names, email addresses, Slack handles
No Real Business Logic	Replace all domain-specific business rules with plausible fictional ones
No Real Infrastructure	Replace all GitLab URLs, Confluence page IDs, Elastic indices, Kong routes
No Real Data	Replace all guest data, reservation data, payment data, hotel data
Structural Fidelity	Preserve the structure, complexity, and relationships between artifacts
Workflow Fidelity	Preserve the steps, tool usage patterns, and decision-making process

4 What Needs to Be Synthesized¶

The architecture workflow involves the following artifacts and activities. Each must be recreated in synthetic form.

4.1 Corporate Service Registry (Synthetic corporate-services Equivalent)¶

Source on this machine: corporate-services/services/ (120+ OpenAPI/Swagger specs)

Synthetic equivalent to produce: - 15-20 fictional microservice Swagger/OpenAPI YAML specs - Each spec should be realistic in complexity (endpoints, schemas, error codes) - Service names from a fictional domain (e.g., a fictional streaming platform, or a fictional logistics company) - Cross-service references and shared models to preserve integration complexity

Approach: Select 15-20 representative real specs, study their structure and complexity, then write new specs in a completely different domain.

4.2 Corporate Architecture Diagrams (Synthetic Diagrams Equivalent)¶

Source on this machine: corporate-services/diagrams/ (PlantUML component, sequence, system diagrams)

Synthetic equivalent to produce: - 8-12 PlantUML component diagrams showing the synthetic microservice ecosystem - Include standard diagram patterns: include files, macros, component groupings, Kafka topics - Preserve the architectural style (box groupings, color coding, participant naming)

Approach: Study the diagram structure and style, create equivalent diagrams for the synthetic domain.

4.3 Ticket Workflow Artifacts¶

Source on this machine: private-analysis/docs/work-items/tickets/ (9 ticket folders with full analysis)

Synthetic equivalent to produce: - 3-5 synthetic ticket folders, each at different stages of completeness - Each ticket folder following the standard structure:

_SYN-XXXXX-brief-title/
  SYN-XXXXX-solution-design.md
  1.requirements/
    SYN-XXXXX.ticket.report.md
  2.analysis/
    simple.explanation.md
  3.solution/
    a.assumptions/assumptions.md
    b.current.state/investigations.md
    c.decisions/decisions.md
    g.guidance/guidance.md
    i.impacts/
      impacts.md
      impact.1/impact.1.md
      impact.2/impact.2.md
    r.risks/risks.md
    u.user.stories/user-stories.md

Ticket complexity tiers to cover:

Tier	Description	Example Synthetic Scenario
Simple	Single service modification, 1-2 endpoints	Add a new field to a GET response
Medium	2-3 services impacted, API contract changes	New integration between two services requiring schema changes
Complex	5+ services, new Kafka events, vendor integration	New capability requiring orchestration across multiple services and a third-party vendor

Synthesis Mapping: Real Tickets to Synthetic Equivalents

Each synthetic ticket is structurally based on a real internal ticket. The domain, names, services, business rules, and all identifiable content are fully replaced, but the structural complexity, artifact count, comment thread depth, and iterative clarification patterns are preserved.

Synthetic ID	Tier	Stage	Based On (Real)	What Is Preserved (Structure Only)
SYN-10001	Simple	Complete	REDACTED-001 (single-field change)	Single-service field change, simple impact, 1 decision, minimal stakeholder discussion
SYN-10002	Medium	Complete	REDACTED-002 (classification with conditional logic)	Classification table with conditional logic, 3 behavioral patterns, booking source overrides mapped to a synthetic domain equivalent, 18-comment stakeholder thread with iterative clarification, multiple PlantUML activity/sequence diagrams, v1.0 to v1.6 solution evolution
SYN-10004	Medium	Investigation	REDACTED-004 (data overwrite bug)	Data ownership conflict investigation, field-level merge vs full-replacement analysis, race condition discovery, Elasticsearch validation queries, partial assumptions
SYN-10005	Simple	Intake	REDACTED-005 (schema field addition)	Schema change request, just classified and workspace created, ticket report only

4.4 Architecture Standards Framework (Public, Pre-existing)¶

Instead of sanitizing our internal standards, we will adopt an existing public architecture standards framework. This eliminates IP risk entirely and adds credibility (the AI tools must work with real, industry-recognized standards, not something we invented for the test).

Primary framework: arc42 (https://arc42.org)

arc42 is an open-source, Creative Commons licensed architecture documentation template used industry-wide. It provides:

arc42 Section	Maps to Internal Equivalent	Standards Complexity
1. Introduction and Goals	Ticket requirements, quality goals	Stakeholder tables, quality scenarios
2. Constraints	Business/technical constraints	Constraint classification
3. Context and Scope	Current state discovery	System context diagrams, external interfaces
4. Solution Strategy	Solution design overview	Top-level decomposition, tech decisions
5. Building Block View	Component architecture	Hierarchical white-box/black-box decomposition
6. Runtime View	Sequence diagrams, runtime scenarios	Important use cases, interaction patterns
7. Deployment View	Infrastructure/deployment docs	Hardware, environments, topology mapping
8. Crosscutting Concepts	Guidance documents	Domain models, patterns, implementation rules
9. Architectural Decisions	Decision documents (ADRs)	Decision rationale, options analysis
10. Quality Requirements	NFRs, quality attributes	Quality tree, quality scenarios
11. Risks and Technical Debt	Risk documents	Known problems, risk assessment
12. Glossary	Domain terminology	Ubiquitous language

Download format: GitHub Markdown Multi-Page (with help text) from arc42-template-EN-withhelp-gitHubMarkdownMP.zip

Supplementary frameworks to layer on top:

Framework	Source	What It Adds
MADR (Markdown Any Decision Records)	https://adr.github.io/madr/	Prescriptive ADR templates with options, pros/cons format
C4 Model diagramming standards	https://c4model.com/	Strict rules for context, container, component, code diagrams
ADR templates (Joel Parker Henderson collection)	https://github.com/joelparkerhenderson/architecture-decision-record	10+ ADR templates, examples, and governance rules
arc42 Quality Model	https://quality.arc42.org/	Detailed quality properties/attributes (ISO 25010 aligned)

What to download and include in the synthetic workspace: 1. arc42 template (GitHub Markdown MP with help) - 12 section files with detailed guidance 2. arc42 documentation site content (144 tips, 35 examples) - scraped or referenced 3. MADR template and examples from GitHub 4. ADR templates collection (Nygard, Tyree-Akerman, business case, Alexandrian) 5. C4 model diagramming rules and notation standards 6. arc42 Quality Model properties list

Then create a .ai-instructions/ directory that: - Tells the AI agent to follow arc42 structure for all documentation - Specifies which ADR template to use for decisions - Mandates C4 model compliance for all diagrams - Defines the ticket-to-arc42 mapping (how a JIRA ticket becomes arc42 artifacts) - Includes formatting rules, header conventions, and cross-referencing standards - This instruction file IS synthetic but is small, contains no company data, and simply wires the public frameworks together

Why this is better than sanitizing our standards: - ZERO risk of company data leakage (all public domain / Creative Commons) - More rigorous test (real industry standards are more comprehensive than custom ones) - More convincing comparison (stakeholders can verify the standards independently) - Saves significant effort (no need to rewrite 40+ customization documents)

4.5 Source Code References (Synthetic Microservice Code)¶

Source on this machine: source-code/microservices/ (10 reference microservices)

Synthetic equivalent to produce: - 5-8 synthetic microservice skeletons (Java/Spring Boot or similar) - Each with representative structure: controllers, services, repositories, DTOs, configs - Enough code to simulate "read source code for investigation" workflow steps - Not full implementations, just enough structure and placeholder logic

4.6 Tool Scripts (Synthetic Tooling)¶

Source on this machine: private-analysis/scripts/ (JIRA client, GitLab client, Elastic searcher)

Synthetic equivalent to produce: - Synthetic JIRA ticket extraction script (outputs synthetic ticket JSON) - Synthetic merge request analysis script (outputs synthetic MR details) - Synthetic Elastic query script (outputs synthetic log search results) - These scripts should produce canned/mock outputs rather than calling real APIs

4.7 Workspace Configuration¶

Source on this machine: .code-workspace, .vscode/, .github/copilot-instructions.md

Synthetic equivalent to produce: - A synthetic .code-workspace file with the correct multi-root structure - Synthetic VS Code settings - Synthetic Roo Code configuration (.roo/ equivalent) - Synthetic GitHub Copilot workspace instructions

5 Workflow Scenarios to Reproduce¶

These are the specific architecture tasks to execute on both toolchains for cost comparison. Each scenario should be documented as a "script" that can be followed identically on both tools.

Scenario 1: New Ticket Intake and Classification¶

Steps: 1. Query synthetic JIRA for open tickets 2. Select a ticket and extract details 3. Classify the ticket (architecture-relevant vs code bug) 4. Create the ticket workspace folder structure 5. Generate the simple explanation document