AI & ML interests

Abliteration, interpretability, training-free model modification, democratization of AI

Recent Activity

p-e-w  updated a Space 1 day ago
heretic-org/README
View all activity

Organization Card

Discord

As of May 21st, 2026, the model heretic-org/Meta-Llama-3.1-8B-Instruct-heretic is no longer available following a legal notice from Meta, Inc.

The official organization of the Heretic project

Heretic is a tool that removes censorship (aka "safety alignment") from transformer-based language models without expensive post-training. It combines an advanced implementation of directional ablation, also known as "abliteration" (Arditi et al. 2024, Lai 2025 (1, 2)), with a TPE-based parameter optimizer powered by Optuna.

The purpose of this organization is to publish and curate high-quality abliterated models made using Heretic.

Membership is open to anyone who has either

  1. contributed code to the Heretic project, or
  2. published a well-received model made using Heretic.

To become a member, please join our Discord and request an invitation.

datasets 0

None public yet