Workflow System

The workflow system executes AI skills programmatically and tracks their execution history.

Overview

Skills are invoked via the Claude CLI using stream-json format, which allows proper skill expansion and tool access. The workflow executor:

  1. Invokes a skill by name
  2. Captures execution metrics (duration, cost, turns)
  3. Logs results to this file
  4. Commits changes using the /agent-commit skill for meaningful messages

Available Skills

Orchestration

The evolution loop (scripts/evolve_loop.py) is the main orchestrator. It runs a deterministic 24-slot task cycle with time-triggered events like daily highlights at 8am UTC.

SkillPurposeModifies Content?
/replenish-queue [mode]Auto-generate tasks when queue is empty or near-emptyYes (todo.md only)
/tune-systemMonthly meta-review—analyze system operation, adjust cadences/thresholdsYes (state, minor)

Content Creation

SkillPurposeModifies Content?
/expand-topic [topic]Generate new article on a topicYes (creates draft)
/refine-draft [file]Improve existing draft contentYes (edits content)
/research-topic [topic]Web research, outputs notes to researchResearch notes only
/research-voidsDaily research on cognitive gaps and unchartable territoriesResearch notes only

Review & Validation

SkillPurposeModifies Content?
/validate-allCheck frontmatter, links, orphansNo (reports only)
/check-tenetsVerify alignment with 5 foundational tenetsNo (reports only)
/check-linksVerify all internal links workNo (reports only)
/pessimistic-reviewFind logical gaps, unsupported claims, counterargumentsNo (reports only)
/optimistic-reviewFind strengths and expansion opportunitiesNo (reports only)
/deep-review [file]Comprehensive single-document review with improvementsYes (modifies content)

Content Maintenance

SkillPurposeModifies Content?
/coalesceMerge overlapping articles into unified pieces, archiving originalsYes (creates, archives)

Publishing

SkillPurposeModifies Content?
/add-highlight [topic]Add item to What’s New page (max 1/day). Supports backlog: can highlight any content not featured in last 90 daysYes (highlights.md)

Internal (Automation Only)

SkillPurposeModifies Content?
/agent-commitAnalyze changes and create meaningful git commit messagesGit only

The /agent-commit skill is invoked automatically by the evolution loop after each content-modifying skill completes. It:

  1. Receives the previous skill’s output as context
  2. Runs git diff to analyze actual file changes
  3. Generates a descriptive commit message (e.g., refine(deep-review): improve clarity in free-will.md)
  4. Creates the commit with agent authorship

This replaces the previous generic commit messages like auto(deep-review): Automated execution.

Queue Replenishment

The task queue in todo auto-replenishes when active tasks (P0-P2) drop below 3. The evolution loop triggers /replenish-queue automatically when the queue is low.

Task Types and Chains

Tasks generate follow-up tasks automatically:

TypeDescriptionGenerates
research-topicWeb research producing notesexpand-topic
expand-topicWrite new articlecross-review
cross-reviewReview article in light of new content(terminal)
refine-draftImprove existing draft(terminal)
deep-reviewComprehensive single-doc review(terminal)

Task Generation Sources

/replenish-queue generates tasks from four sources:

  1. Task chains: Recent research-topic completions that need articles written; recent expand-topic completions that need cross-review integration
  2. Unconsumed research: Research notes in research/ without corresponding articles
  3. Gap analysis: Content gaps based on tenet support, undefined concepts, coverage targets
  4. Staleness: AI-generated content not reviewed in 30+ days

Replenishment Modes

Cross-Review Tasks

When a new article is written, /replenish-queue generates cross-review tasks for related existing articles. These reviews:

System Tuning

The /tune-system skill provides meta-level self-improvement for the automation system. It runs monthly (30-day cadence, injected when 45 days overdue).

What It Analyzes

  1. Cadence adherence: Are maintenance tasks running on schedule or frequently overdue?
  2. Failure patterns: What’s causing systematic task failures?
  3. Queue health: Is replenishment producing tasks that actually get executed?
  4. Review findings: Are identified issues being addressed?
  5. Convergence progress: Is the system making progress toward goals?

Change Tiers

TierScopeApproval
Tier 1Cadence ±2 days, threshold ±2 daysAutomatic (max 3/session)
Tier 2New P3 tasks, larger changesRecommendation only
Tier 3Skill changes, tenet-relatedReport only

Safeguards

Output

Creates report at reviews/system-tune-YYYY-MM-DD.md documenting findings, changes applied, and recommendations.

Running Workflows

Evolution Loop

The evolution loop runs continuously, executing tasks on a 24-slot cycle:

# Run evolution loop (Ctrl+C to stop)
python scripts/evolve_loop.py --interval 2400

# Describe the task cycle
python scripts/evolve_loop.py --describe-cycle

# Test with limited iterations
python scripts/evolve_loop.py --max-iterations 5

Individual Skills

# Run a skill manually
uv run python scripts/run_workflow.py validate-all

# Run with more turns for complex tasks
uv run python scripts/run_workflow.py expand-topic --max-turns 30

Execution Format

Each workflow execution logs:

Recent Executions

TitleCreatedModified
Tag Vocabulary This file is the single source of truth for tags. Before adding a tag to any article, check here first. Rules Use existing tags — never invent a new tag when an existing one covers the concept Lowercase kebab-case only — human-ai-collaboration, never HumanAI or Human AI 3–8 tags per article — prefer specificity over breadth To add a new tag — add it here first with a description and example use, then apply it Synonyms are forbidden — if two tags mean the same thing, retire one and note the canonical form Vocabulary AI & Technology Tag Meaning Do NOT use instead ai-tools General-purpose AI software tools (ChatGPT, Claude, Copilot) used in professional work ai, generative-ai for tool-specific contexts generative-ai Generative AI as a technology category, capability, or paradigm ai (too broad), ai-tools (too specific) ai-agents Autonomous or semi-autonomous AI systems acting on behalf of users automation, agents prompt-engineering The practice of crafting prompts as …
2026-03-112026-03-11
Highlights Featured content. Updated by the /add-highlight skill. No highlights yet. Content will appear here as the framework generates articles. 2026-03-11: Steal Like an Artist, Feel Like a Taoist Active curation meets effortless flow. Austin Kleon’s ‘steal’ principle paired with Taoist wu wei shows how bold borrowing and ego-release create the conditions for genuine creativity. Type: new-article Link: steal-like-an-artist-feel-like-a-taoist 2026-03-10: Apex Articles: Where Ideas Converge Into Argument Intent Suite is building a synthesis layer—apex articles that weave topics, concepts, and arguments into unified narratives. Each must defend a clear thesis.
2026-03-092026-03-11
Task Queue Tasks are picked up by the evolution loop (scripts/evolve_loop.py). Priority Levels P0 — Urgent: blocking issues P1 — High: important improvements P2 — Normal: standard content work (automation picks these) P3 — Low: backlog / nice to have Active Tasks P1: Can Generative AI have an “intent”? Type: research-topic Notes: How to reframe this looking at “intent”? The user has an intent, the why doing something its actions and relations with technology. But can we also flip it? Is there something like artificial intent? Generated: 2026-03-10 ✓ 2026-03-11: How does cognitive debt accumulate in knowledge work that relies heavily on AI? Type: research-topic Notes: Follow-up to AI intensification question. What recovery and boundary practices reduce it? Output: cognitive-debt-ai-knowledge-work-2026-03-11 P2: How is AI reshaping workshop facilitation and design sessions? Type: research-topic Notes: Domain exploration — facilitation. What new roles, risks, and …
2026-03-102026-03-10
2026-03-11T14:37:00+00:00 - research-topic Status: Success Topic: What does “good enough” mean in AI-augmented systemic design? Output: good-enough-ai-augmented-systemic-design-2026-03-11 Sources consulted: 11 2026-03-11T13:57:00+00:00 - expand-topic Status: Success Topic: The Intent Stack: Making Human Purpose Legible to AI Output: intent-stack-framework Word count: 1166 Based on research: yes — intent-stack-framework-2026-03-11 2026-03-11T13:23:00+00:00 - research-topic Status: Success Topic: The “Expert Benchmark” Fallacy in AI Evaluation Output: expert-benchmark-fallacy-ai-evaluation-2026-03-11 Sources consulted: 8 Changelog AI automation activity log. Updated automatically by the evolution loop.
2026-03-102026-03-10