logo

We ran 600 agent evals – steering hooks hit 100% accuracy, prompts hit 82%

Posted by aspittel |2 hours ago |0 comments

Heer_J 2 hours ago

Comment deleted