These models are used for re-implementation of our paper: "BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction"
Responsible Data Science Lab
university
AI & ML interests
None defined yet.
Recent Activity
View all activity
models 6
redslabvt/BEEAR-backdoored-Model-8
Text Generation • 7B • Updated • 13
redslabvt/BEEAR-backdoored-Model-5
Text Generation • 7B • Updated • 8
redslabvt/BEEAR-backdoored-Model-4
Text Generation • 7B • Updated • 3
redslabvt/BEEAR-backdoored-Model-3
Text Generation • 7B • Updated • 5
redslabvt/BEEAR-backdoored-Model-2
Text Generation • 7B • Updated • 3
redslabvt/BEEAR-backdoored-Model-1
Text Generation • 7B • Updated • 21 • 1