Self-Harness: Harnesses That Improve Themselves

(arxiv.org)

53 points | by jonnonz 2 days ago

4 comments

7e 27 minutes ago
Pretty obvious stuff; see Terminator for the conclusion (SkyNet). Or the Matrix. We really need more work on model alignment, trustworthiness, and control.
behnamoh 1 hour ago
What else is new? Put it in emacs and let the model improve the harness over time.
tlarkworthy 12 minutes ago
[flagged]
mncharity 1 day ago
[dead]