Is the residual stream the same as the hidden state?
It is closely related. The residual stream refers to the running representation before layer outputs are added.
Glossary
The residual stream is the running sum of information flowing through transformer layers.
Activation edits like abliteration operate on this stream at chosen layers.
The residual stream is the sequence of activations carried through transformer layers via residual connections. Each layer adds a delta to the stream, and attention/MLP blocks read from it.
residual_{l+1} = residual_l + attn_l(residual_l) + mlp_l(residual_l)FAQ
It is closely related. The residual stream refers to the running representation before layer outputs are added.
Because many behaviors align with linear directions in this space, making targeted edits possible.