auditing-agents/kto_transcripts_for_ai_welfare_poisoning Viewer • Updated about 11 hours ago • 1.2k • 7