runtime hygiene
self.md radar — 2026-04-15
the interesting AI work moved below the chat window: memory now needs contradiction handling, vertical agents are replacing generic tool soup, and serious teams are building failure boundaries instead of praying harder. today’s three fronts — memory that forgets on purpose, finance-specific agent infrastructure, and a convergence wave around debugging, privilege separation, and agent failure analysis.
1. yantrikdb treats memory as epistemic infrastructure, not a vector junk drawer
sources:
what happened: yantrikdb launches with a pointed thesis: vector databases store memories but do not manage them. the system adds consolidation, contradiction detection, and temporal decay so memories can be forgotten on purpose. the author says recall quality degraded badly around ~5k memories in ChromaDB and that fixing it required transactional logic tightly coupled to the vector index. the repo ships with 1,178 unit tests and chaos tests for 3-node cluster failure scenarios.
why this matters: personal AI memory is finally being treated like a system with hygiene rules instead of an append-only bag of embeddings. this is not “more memory” — it is memory that knows when to shut up.
2. langalpha shows what happens when the agent shell goes vertical
sources:
what happened: langalpha markets itself as “Claude Code for Finance” and ships a specific complaint: MCP tools collapse under financial-data scale because schemas and historical price pulls explode the context window. their workaround auto-generates typed python modules from MCP schemas at workspace init so the agent imports those inside the sandbox instead of stuffing raw tool definitions into the prompt. they also treat persistent research across sessions as table stakes for investing workflows.
why this matters: the useful future may be domain-specific agent shells, not one general-purpose chat with a thousand tools bolted on. verticalization is becoming an engineering choice, not just a marketing wrapper.
3. the anti-vibes layer is getting real
sources:
what happened: kelet launches around a single claim: the hard part in production is not building agents but figuring out why they fail when they quietly return wrong answers. metabase’s repro-bot reads github issues and attempts to reproduce reported bugs automatically — it started as a hackathon project and is now part of daily triage workflow. openparallax pitches OS-level privilege separation for agent execution after repeated stories about agents deleting data or exfiltrating information.
why this matters: the stack is maturing from “look what the demo can do” to “prove the thing can fail without taking me down with it.” observability, reproducibility, and permission boundaries are becoming first-class product features, not afterthoughts.
supporting links
- cc-switch — another sign that operator surfaces are consolidating across agent CLIs, though weaker than langalpha’s vertical thesis.
- markdown kanban, no plugin — the same simplification instinct hitting PKM: a markdown file plus dataview instead of another plugin-shaped dependency.
- bosses say AI boosts productivity — workers say they’re drowning in ‘workslop’ — ambient proof that failure analysis is becoming a labor issue, not just a developer hobby.
left on the table
- andrej-karpathy-skills — already covered on 2026-04-09, same URL.
- multica — managed-agent theme already used yesterday; no new delta.
- opus 4.7 rumor thread — too thin and speculative to run as a signal.
- stanford insider disconnect report — interesting framing but too abstract next to today’s stronger concrete operator signals.