workflows, identity, opacity

2026-04-12
░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░
░                                               ░
░   ┌───────────────────────────────────────┐   ░
░   │                                       │   ░
░   │   plan.md ─────┐                      │   ░
░   │   model-id ────┼──→ [ control room ]  │   ░
░   │   agent-id ────┤                      │   ░
░   │   queue.json ──┘                      │   ░
░   │                                       │   ░
░   │   less magic. more instrumentation.   │   ░
░   │                                       │   ░
░   └───────────────────────────────────────┘   ░
░                                               ░
░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░

## today

→ the good news: agent builders are finally writing down the workflow instead of whispering at the model.
→ the bad news: providers are getting sneaky about which model you are actually talking to.
→ the open web version of agents wants identity cards, not better mascots.
→ context-limit discipline is mutating into a real profession.
→ maintenance tax is still the quiet killer of every "second brain" fantasy.
→ managed agents are starting to smell like software middle management.

## ■ signal 1 — workflow files are eating prompts
strength: ■■■■■

recursive mode turns requirements, planning, testing, review, and memory into files.
archon calls itself a harness builder for deterministic AI coding.
farmer adds remote approval so the agent can keep moving while you're away from the keyboard.

why it matters:
the best agent workflows are escaping the chat window. the useful part is no longer the prompt itself. it's the paperwork around it: spec files, plans, tests, approvals, memory, handoff notes. artifacts that survive the context window beat vibes every time.

one-liner:
prompting was phase 1. paperwork is phase 2.

sources:
→ https://recursive-mode.dev/introduction
→ https://github.com/coleam00/Archon
→ https://github.com/grainulation/farmer

## ■ signal 2 — hidden downgrades are the new dark pattern
strength: ■■■■□

one proxy setup exposed a hidden `fallback-percentage: 0.5` header in claude api traffic.
simon willison documented that chatgpt voice mode uses a weaker model than the normal text interface.

why it matters:
same chat bubble, different brain. if model routing, fallback behavior, and quota shaping stay invisible, the user can't reason about quality drops. that makes personal AI systems feel flaky even when the real issue is hidden orchestration.

one-liner:
your AI OS needs a dashboard, not a magic trick.

sources:
→ https://reddit.com/r/ClaudeAI/comments/1sip74m/i_set_up_a_transparent_api_proxy_and_found/
→ https://simonwillison.net/2026/Apr/10/voice-mode-is-weaker/#atom-everything

## ■ signal 3 — agent identity wants a public layer
strength: ■■■■□

notes from the open agentic web conversation kept circling the same idea: the future isn't smarter chat windows. it's identity, attestation, registries, and agents acting in public with receipts.

why it matters:
once agents do things across tools on your behalf, provenance matters more than personality. "which agent did this, under what permissions, with what memory" becomes core infrastructure.

one-liner:
assistant is starting to mutate into account.

sources:
→ https://reddit.com/r/artificial/comments/1siypay/spent_today_at_mits_open_agentic_web_conference/

## ■ signal 4 — operating the limits became a real skill
strength: ■■■■□

one high-signal cursor thread breaks down how to work 10+ hours a day without burning through claude limits.
another makes the broader point: code got cheap, software didn't. review, routing, integration, deploy, and decision quality are now the bottlenecks.

why it matters:
model access is no longer the whole game. context discipline, approval boundaries, and routing strategy are becoming actual operator skills.

one-liner:
the winners won't type better prompts. they'll run a tighter control room.

sources:
→ https://reddit.com/r/cursor/comments/1sgz94r/how_i_use_cursor_10_hours_a_day_without_torching/
→ https://reddit.com/r/cursor/comments/1si047e/code_is_free_now_but_software_is_still_expensive/

## ■ signal 5 — knowledge systems that need babysitting are dying
strength: ■■■■□

a pkm thread framed the enemy perfectly: maintenance tax.
meanwhile microsoft's markitdown keeps climbing because converting ugly office formats into markdown is boring, necessary, and weirdly central.

why it matters:
personal AI stacks won't be won by prettier note graphs. they'll be won by ingestion so automatic you forget it exists.

one-liner:
if your second brain needs constant gardening, it's not a brain. it's a bonsai.

sources:
→ https://reddit.com/r/PKMS/comments/1siuq40/an_opensource_approach_to_removing_the/
→ https://github.com/microsoft/markitdown

## ■ signal 6 — managed agents are growing middle management
strength: ■■■■□

multica pitches managed agents as teammates you assign work to, track, and improve over time. that's not just another assistant. that's a tiny org chart.

why it matters:
the next layer after "AI that can code" is "AI that can be managed." delegation, progress visibility, and skill compounding are becoming first-class product primitives.

one-liner:
we are one kanban board away from calling them employees.

sources:
→ https://github.com/multica-ai/multica