Transformer Lens Interpretability

Research v1.0.0 · 1 month ago · 5 downloads

transformer lens

Provides guidance for mechanistic interpretability research using TransformerLens to inspect and manipulate transformer internals via HookPoints and activation caching. Use when reverse-engineering model algorithms, studying attention patterns, or performing activation patching experiments.

Provides guidance for mechanistic interpretability research using TransformerLens to inspect and manipulate transformer internals via HookPoints and activation caching. Use when reverse-engineering model algorithms, studying attention patterns, or performing activation patching experiments.

5 downloads

Original Source

Orchestra Research

Orchestra-Research/AI-Research-SKILLs

Related Skills

Prompt Optimizer

Strategic Compact

Decision Toolkit