Dissecting the Development of Toy Models of Superpoisition
By Joe Emerson (Published on July 6, 2024)
This is a project outline that expands on work by Chen et al. (2023) on the development of Anthropic’s Toy Models of Superposition. This project has lots of low hanging fruit for anyone looking to put in some work learning SLT.
Read the full piece here.