Language-Switching Triggers Take a Latent Detour Through Language Models Paper • 2605.18646 • Published 4 days ago • 3
Disentangling meaning from language in LLM-based machine translation Paper • 2602.04613 • Published Feb 4 • 1
Triggers Hijack Language Circuits: A Mechanistic Analysis of Backdoor Behaviors in Large Language Models Paper • 2602.10382 • Published Feb 12 • 2