Sparse Autoencoder Training

Data & Analytics v1.0.0 · 1 month ago · 10 downloads

saelens

Provides guidance for training and analyzing Sparse Autoencoders (SAEs) using SAELens to decompose neural network activations into interpretable features. Use when discovering interpretable features, analyzing superposition, or studying monosemantic representations in language models.

Provides guidance for training and analyzing Sparse Autoencoders (SAEs) using SAELens to decompose neural network activations into interpretable features. Use when discovering interpretable features, analyzing superposition, or studying monosemantic representations in language models.

10 downloads

Original Source

Orchestra Research

Orchestra-Research/AI-Research-SKILLs

Related Skills

Market Research