megadragon9 an hour ago
Turns out there's a lot of parallels to coding-agent customization (e.g. SKILLS.md etc..) too.
I wrote my experience of building such system here, including the successful and failure attempts during the process, and how I approached the self-improvement loop. It's not intended as a benchmark claim but more of a systems/research writeup.
https://www.henrypan.com/blog/2026-05-25-self-improvement-ha...