Feature Visualization Learning Journey – BlueDot Impact
AI Alignment (2024 Mar)

Feature Visualization Learning Journey

By Caleb Sattler (Published on July 5, 2024)

After reading Zoom In: An Introduction to Circuits I was quite interested in how some of the visuals (like this) were generated! It seemed like the authors were “asking” the model what it was looking for, and I wanted to know how I could have a similar “conversation” with a model! With this motivation, I started trying to generate visualizations similar to those found in the paper, however, I didn’t have much experience with PyTorch or interacting with neural networks, so I encountered some issues while doing so, and I figured a brief description of the “journey” I took could help others starting from a similar place as I get started on conversing with models too!

Read the full piece here.

We use analytics cookies to improve our website and measure ad performance. Cookie Policy.