AI & ML interests

Natural language processing, language models, language agents

Recent Activity

osunlp 's collections 12

AmpleGCG
Generative models to produce GCG-like adversarial suffixes
AutoElicit
When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents
AutoElicit
When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents
AmpleGCG
Generative models to produce GCG-like adversarial suffixes