Residual Stream Analysis with Multi-Layer SAEs
Published in ICLR, 2025
This paper introduces a novel approach to analyzing the residual stream in transformer models using multi-layer sparse autoencoders.
Recommended citation: Lawson T, Farnik L, Houghton C, Aitchison L. (2025). "Residual Stream Analysis with Multi-Layer SAEs." ICLR.