Discussion about this post

User's avatar
Igor Bračič's avatar

Great piece, Luka. Spot on about the power of Level 2. It’s worth noting that highly capable agents like Claude Code and Codex don't even use RAG. They rely purely on progressive discovery via agent instruction files and codebase tool calls.

A prime example is the OpenClaw repo. It's a showcase in Level 2 done right, with an comprehensive document structure covering decisions, patterns, and components.

"If it compiles, it works" really hit home for me. Been there 😂.

That’s exactly why the evaluation layer in your framework is so critical. The ability to give an agent a feedback loop to test and validate its own work and having those corrections feed back into the constraints permanently is the only way to actually scale this beyond isolated sessions.

Solid series. Looking forward to the next piece

No posts

Ready for more?