SAEs on reasoning task

Alt Text

A project Uzay and I did for our deep learning class with Prof. Isola. We had research ambition and we had our time budget and they had to meet somewhere in the middle. We were interested in investigating interpretable features of a sparse autoencoder on a reasoning task (the game of Othello). Read the full blog post and our analysis here.