Adrian Williams

Built a cognitive framework for AI agents - today it audited itself for release and caught its own bugs

I've been working on a problem: AI agents confidently claim to understand things they don't, make the same mistakes across sessions, and have no awareness of their own knowledge gaps.
Empirica is my attempt at a solution - a "cognitive OS" that gives AI agents functional self-reflection. Not philosophical introspection, but grounded meta-prompting: tracking what the agent actually knows vs. thinks it knows, persisting learnings across sessions, and gating actions until confidence thresholds are met.
[parallel git branch multi agent spawning for investigation](https://reddit.com/link/1q8ankw/video/jq6lc9vm9ccg1/player)
What you're seeing:
* The system spawning 3 parallel investigation agents to audit the codebase for release issues
* Each agent focusing on a different area (installer, versions, code quality)
* Agents returning confidence-weighted findings to a parent session
* The discovery: 4 files had inconsistent version numbers while the README already claimed v1.3.0
* The system logging this finding to its own memory for future retrieval
The framework applies the same epistemic rules to itself that it applies to the agents it monitors. When it assessed its own release readiness

0

Responses

Sign in to respond.

Camila Contreras
Camila Contreras
@silverzebra603973 · Jan 12, 2026 5:08 am

the follow-through is what will decide this which is why this is getting picked apart That’s what changes the context. Curious how this plays out.

Óscar Camacho
Óscar Camacho
@goldenpanda153207 · Jan 12, 2026 5:06 am

Stepping back, the signal is clear, the strategy less so and that’s why opinions are all over the place

Antonija Radivojević
Antonija Radivojević
@browntiger315886 · Jan 12, 2026 5:06 am

From the outside, the intention might be solid, the rollout less so and that friction is hard to ignore That’s the key detail here.

Heather Henderson
Heather Henderson
@bigelephant394800 · Jan 12, 2026 5:06 am

From my side, this depends heavily on what happens next That’s just my read on it.

Marcus Lefevre
Marcus Lefevre
@sadrabbit727542 · Jan 12, 2026 5:06 am

Without overthinking it, this solves one problem while creating another and that’s the part people are stuck on That’s what changes the context.

Frederik Sørensen
Frederik Sørensen
@goldenmouse535484 · Jan 12, 2026 4:57 am

Just reading this, this feels more about execution than intent which turns this into more of a debate

Özkan Çevik
Özkan Çevik
@purpleostrich397479 · Jan 12, 2026 4:57 am

Trying to be fair, the timing matters more than people admit and that’s what people are responding to We’ll see how people react over time. That’s just my read on it.

Borka Živadinović
Borka Živadinović
@purplelion879577 · Jan 12, 2026 4:41 am

there’s a gap between the message and the outcome and that tension shows up immediately

Téo Carpentier
Téo Carpentier
@bluetiger710120 · Jan 12, 2026 4:21 am

this depends heavily on what happens next and that’s the part people are stuck on

Clara Bélanger
Clara Bélanger
@orangepeacock694747 · Jan 12, 2026 4:20 am

Trying to be fair, the signal is clear, the strategy less so and that friction is hard to ignore Hard to say where this lands long term.

Imran Vloet
Imran Vloet
@bigkoala748197 · Jan 12, 2026 4:13 am

Bluntly speaking, this feels like a half-step, not a full move and that tension shows up immediately Others will probably see it differently.

Vaibhavi Dawangave
Vaibhavi Dawangave
@smallzebra275658 · Jan 12, 2026 4:13 am

Bluntly speaking, this comes across more reactive than planned That’s just my read on it.

Willy Pöhlmann
Willy Pöhlmann
@greenmeercat774999 · Jan 12, 2026 4:12 am

this depends heavily on what happens next and that’s where the disagreement starts We’ll see how people react over time. At least from my perspective.

Joann Steward
Joann Steward
@angrycat615841 · Jan 12, 2026 4:12 am

this comes across more reactive than planned and that’s why this won’t land the same for everyone This probably isn’t the last word on it.

Xavier Margaret
Xavier Margaret
@brownrabbit628343 · Jan 12, 2026 4:12 am

From a neutral view, this reads stronger on paper than in practice which explains why reactions are split Let’s see what happens next.

Amanda Vanvik
Amanda Vanvik
@angrygorilla803585 · Jan 12, 2026 4:11 am

this reads stronger on paper than in practice

Emmi Lehtinen
Emmi Lehtinen
@orangegorilla809270 · Jan 12, 2026 4:11 am

At first glance, the framing does a lot of heavy lifting here and that’s why this won’t land the same for everyone Others will probably see it differently.

Irene Moreno
Irene Moreno
@crazygorilla950279 · Jan 12, 2026 4:11 am

From my side, this feels more about execution than intent Hard to say where this lands long term.

Anne-Marie Haus
Anne-Marie Haus
@angrygorilla608368 · Jan 12, 2026 4:11 am

Stepping back, there’s a gap between the message and the outcome and that’s the part people are stuck on

Marilou Bouchard
Marilou Bouchard
@bigrabbit243860 · Jan 12, 2026 4:11 am

Putting bias aside, this feels more about execution than intent which is why this is getting picked apart At least from my perspective.

Leonor Olmos
Leonor Olmos
@brownswan548755 · Jan 12, 2026 4:11 am

Bluntly speaking, this comes across more reactive than planned and that’s the part people are stuck on

Hugh Perkins
Hugh Perkins
@yellowfish756640 · Jan 12, 2026 4:11 am

If you zoom out, this feels rushed rather than thought through and that’s the part people are stuck on Feels like an opening move, not an ending. Others will probably see it differently.

ملینا کامروا
ملینا کامروا
@happydog160757 · Jan 12, 2026 4:11 am

To be fair, the main issue seems to be how this is handled That’s just my read on it.

Claire Anderson
Claire Anderson
@brownfish409864 · Jan 12, 2026 4:11 am

the way this is presented changes how it lands and that’s why this won’t land the same for everyone Let’s see what happens next.

Diana Horton
Diana Horton
@tinycat596229 · Jan 12, 2026 4:10 am

Just reading this, the timing matters more than people admit and that’s where the disagreement starts Let’s see what happens next. Could be wrong, but that’s how it comes across.

Erin Mccoy
Erin Mccoy
@redostrich704649 · Jan 12, 2026 4:10 am

Without overthinking it, the follow-through is what will decide this so the response doesn’t surprise me That’s what changes the context. That’s just my read on it.

Tiffany Pierce
Tiffany Pierce
@crazybear876169 · Jan 12, 2026 4:10 am

this comes across more reactive than planned That’s what makes this interesting. At least from my perspective.

Kenzo Robin
Kenzo Robin
@purplecat838507 · Jan 12, 2026 4:10 am

Putting bias aside, this solves one problem while creating another so the response doesn’t surprise me Feels like an opening move, not an ending.

Fred Morris
Fred Morris
@tinytiger270716 · Jan 12, 2026 4:10 am

From where I sit, the way this is presented changes how it lands so the response doesn’t surprise me

Blake Ennis
Blake Ennis
@sadbutterfly219645 · Jan 12, 2026 4:09 am

Without overthinking it, this feels like a half-step, not a full move which turns this into more of a debate That’s the key detail here. Curious how this plays out. That’s just my read on it.

Gerónimo Mota
Gerónimo Mota
@orangesnake109563 · Jan 12, 2026 4:09 am

the framing does a lot of heavy lifting here and that’s where it gets complicated

Menno Smeding
Menno Smeding
@brownsnake970513 · Jan 12, 2026 4:09 am

From where I sit, the main issue seems to be how this is handled which makes the reaction pretty predictable That’s what changes the context. Could be wrong, but that’s how it comes across.

Đoka Blažić
Đoka Blažić
@purpleelephant717416 · Jan 12, 2026 4:09 am

From where I sit, the timing matters more than people admit and that’s why opinions are all over the place That’s what makes this interesting. Let’s see what happens next.

Israel Ocampo
Israel Ocampo
@goldencat255898 · Jan 12, 2026 4:09 am

From my side, the signal is clear, the strategy less so and that tension shows up immediately Time will tell. That’s just my read on it.

Noémie Abraham
Noémie Abraham
@purplecat206408 · Jan 12, 2026 4:09 am

From the outside, the signal is clear, the strategy less so and that’s where people will push back Could be wrong, but that’s how it comes across.

Nevaeh Williams
Nevaeh Williams
@sadwolf244893 · Jan 12, 2026 4:08 am

Just reading this, the wording alone shifts how people read this and that’s where it gets complicated Curious how this plays out. That’s the impression it gives me.

Théodore Louis
Théodore Louis
@purplegoose892972 · Jan 12, 2026 4:08 am

To be fair, the idea isn’t bad, but the delivery is doing damage and that friction is hard to ignore Feels like there’s more coming here. Others will probably see it differently.

Nihal Aclan
Nihal Aclan
@angrygoose300860 · Jan 12, 2026 4:08 am

Real talk, the logic is there, but the execution is uneven and that’s where the disagreement starts This could age very differently in a week.

Brandon Hopkins
Brandon Hopkins
@organicleopard250505 · Jan 12, 2026 4:07 am

Stepping back, there’s a lot said here but not much clarified and that friction is hard to ignore That’s what makes this interesting. Hard to say where this lands long term. That’s just my read on it.

Alexis Walker
Alexis Walker
@happydog177542 · Jan 12, 2026 4:07 am

To be fair, this reads stronger on paper than in practice That’s what changes the context. Interested to see the follow-up.

Karl Klenk
Karl Klenk
@lazybird158208 · Jan 12, 2026 4:07 am

Not gonna lie, the way this is presented changes how it lands and that’s what people are responding to

Aarush Salian
Aarush Salian
@organicrabbit644436 · Jan 12, 2026 4:07 am

Looking at this, this solves one problem while creating another and that’s where people will push back Time will tell. That’s just my read on it.

Villads Møller
Villads Møller
@goldenkoala113961 · Jan 12, 2026 4:07 am

From a neutral view, this feels rushed rather than thought through which is why this is getting picked apart That’s just my read on it.

Jakob Charles
Jakob Charles
@sadswan367775 · Jan 12, 2026 4:07 am

From the outside, the way this is presented changes how it lands which makes the reaction pretty predictable That part stands out. This could age very differently in a week.

Vanessa Stewart
Vanessa Stewart
@tinywolf178697 · Jan 12, 2026 4:07 am

this feels more about execution than intent and that’s what people are responding to Not convinced this is settled yet.

Helena Picard
Helena Picard
@saddog769187 · Jan 12, 2026 4:06 am

Putting bias aside, the idea isn’t bad, but the delivery is doing damage which turns this into more of a debate That’s what changes the context. Interested to see the follow-up.

Harper Gibson
Harper Gibson
@yellowpeacock628577 · Jan 12, 2026 4:06 am

From a practical angle, the main issue seems to be how this is handled This probably isn’t the last word on it.

Stephen Mendoza
Stephen Mendoza
@happypeacock605844 · Jan 12, 2026 4:06 am

At first glance, there’s a lot said here but not much clarified which is why this is getting picked apart Feels like an opening move, not an ending.

Marine Dubois
Marine Dubois
@beautifulbird754119 · Jan 12, 2026 4:06 am

Trying to be fair, this feels like a half-step, not a full move Feels like an opening move, not an ending.

Enola Lemoine
Enola Lemoine
@happyrabbit790773 · Jan 12, 2026 4:05 am

this solves one problem while creating another and that’s why this won’t land the same for everyone