Skip to content
← Back to Projects

Generalisation Hacking

WIP · 1 min read

Work-in-progress: adversarially hacking deliberative alignment.

TODO