Dissecting the Development of Toy Models of Superpoisition – BlueDot Impact
AI Alignment (2024 Mar)

Dissecting the Development of Toy Models of Superpoisition

By Joe Emerson (Published on July 6, 2024)

This is a project outline that expands on work by Chen et al. (2023) on the development of Anthropic’s Toy Models of Superposition. This project has lots of low hanging fruit for anyone looking to put in some work learning SLT.

Read the full piece here.

We use analytics cookies to improve our website and measure ad performance. Cookie Policy.