MS26 PG Architect
Maintains 1G (Apps · Backoffice · Context) alignment across the fleet.
Every agent in the stack: the AppAI cockpit itself, the ABC skill family, per-codebase master agents, and the daily-workflow agents. Diagram is live, generated via /abc-diagrams.
Topology is hand-curated and lives at public/diagrams/agents-topology.html. Re-run /abc-diagrams to refresh.
Agents in the SPM stack split into four layers by scope. The higher tiers route to the lower ones — and every tier is just a skill at heart.
Per-codebase super-agents. They own the project state, route work to sub-skills, and never lose context between sessions. /appai, /aibo, /aios, /villai.
/appai/aibo/aios/villaiOne agent per daily pipeline. Same shape every day: research → structure → render → log. /pad, /lad, /aad, /iad, /vad.
/pad/lad/aad/iad/vadNarrow scope, deep skill. /abc-architect, /abc-intake, /abc-proposal, /security-review, /persona, /skill-creator.
/abc-architect/security-review/personaCross-codebase agents that speak for a whole org. cleverfox-agent, [client]-agent. They route to the right master agent based on the request.
cleverfox-agent[client]-agentThe six fields you'll see in any well-shaped agent in the stack. Anything missing is a bug — explicit beats implicit when you're running fifty of them.
When does this agent wake up? Cron, webhook, file change, PR event, Slack mention, or human invocation.
What does it remember between runs? File-based memory, vector store, or session-scoped only.
What can it touch? Read/edit/write, bash, web, MCP servers, specific APIs. Allow-listed per agent.
Who is it? Name, role, tone, refusal posture, escalation chain. Managed via PersonAI.
What does it do when confidence drops? Pause, ask, fall back to a human, or refuse and log.
What's it allowed to spend? Per-call token cap, per-day cost cap, per-week kill switch.
Every agent in the stack started this way. The five steps below produce a working SKILL.md from scratch.
When should it wake up? Daily cron, Slack mention, PR event, file change, or manual invocation. Ambiguous triggers produce ambiguous agents.
What does it need to remember across runs? Keep it to one concern — project PBIs, content topics, or customer records. Never all three in one file.
List every tool it needs: Read, Edit, Bash, WebSearch, MCP. Deny everything else by default. Start narrow and widen only when a real use-case demands it.
One file: trigger, memory schema, tool allow-list, persona (name + tone + refusal posture), escalation rule, and a worked example invocation.
First run is the spec. Read the output against the intent. Fix the SKILL.md, not the output. Add a /review pass after 7 runs to close any drift.
Maintains 1G (Apps · Backoffice · Context) alignment across the fleet.
Build · preview · push · deploy orchestrator across all apps.
Super master agent for the 1G ABC initiative.
Stack-aware system architect; produces ADRs and data models.
Turns raw client input into a structured Project Card.
Classifies projects into verticals and attaches sector packs.
Generates full proposal pack: markdown, HTML, JSX prototype, PDF, email.
Editorial-quality HTML+SVG diagrams, no Mermaid-slop.
Concept → 13-slide deck + AIBO page + video + email + LinkedIn + tweet.
URL → 6-asset social media content pack.
Idea → single-file React JSX app shell artifact.
Canonical design tokens for all Mat Siems UI.
Tailwind v4 + shadcn rules for ABC apps.
shadcn component management, docs, and registry.
Next.js 16 conventions — route groups, params as Promise, runtimes.
Lint · unused-import sweep · file organisation.
Validate .env, compare environments, generate templates.
Claude Code Cheat — 10 POM crash course for teams.
Google Workspace cockpit with Fox-persona orchestration.
Private 1G planning + world context pillar.
Full UX audit — mobile/tablet/desktop, a11y, consistency.
AIBO platform — personas, assets, media, workflows.
V26 villa acquisition + ownership OS.
Audio assets, DJ recordings, TTS, transcription, AAD.
Onyx-based enterprise AI platform.
Claude Architect Certification study coach.
Anthropic Managed Agents study coach.
Page A Day — daily 5-stream context page.
List A Day — fresh Top-100 list, never repeated.
Audio A Day — narrated short with infographic.
A chat has no memory, no triggers, and no accountability. Agents in this stack each have a SKILL.md (their job description), a place they store output, and a defined invocation contract. That's the difference between a coworker and a stranger you have to brief every morning.
No. We co-design the first three in the engagement — usually one writer, one researcher, one orchestrator. After that, you can fork the pattern or have us scaffold the next one. The fleet of 13 live sites ships entirely from this same template.
Locally for prototyping (Claude Code in your terminal), in production via the Claude API for managed agents, and on Vercel for any agent that needs a public surface. No proprietary runtime, no lock-in. If we vanish tomorrow, your agents keep working.
Managed Agents bill at £1 per agent-hour for compute + supervision. A typical site-ops agent uses 2-6 hours per month. Heavy media-generation agents (IAD, VAD, AAD) bill at usage cost + 20% — the rate sheet is on the pricing page.
Three things. (1) SKILL.md is opinionated — it says no as often as yes. (2) Every agent invocation logs to a JSON line we can replay. (3) Outputs land in a reviewed location (BO, GitHub PR, S3 bucket) — never directly to production. Drift is detected within one loop iteration.
Yes. The stack defaults to Claude (Opus 4.7 / Sonnet 4.6) but the skill format is provider-neutral. We've run the same agents against Gemini and GPT-5 for clients with policy constraints. Token economics are typically 20-40% better on Claude, but the architecture doesn't care.