FAQ

Frequently Asked Questions

Does orthogonalization change model weights?

No. It is applied to activations at inference time, not to weights.

Why is it used for abliteration?

It cleanly removes the refusal component while leaving other information intact.