Glossary

Concepts and terminology.

Short, linkable definitions of the ideas behind abliteration, refusal editing, activation steering, and policy-controlled LLM APIs.