10000 GitHub - jordangong/saev: Sparse autoencoders for vision
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

jordangong/saev

 
 

Repository files navigation

saev - Sparse Auto-Encoders for Vision

Coverage

Sparse autoencoders (SAEs) for vision transformers (ViTs), implemented in PyTorch.

This is the codebase used for our preprint "Sparse Autoencoders for Scientifically Rigorous Interpretation of Vision Models"

About

saev is a package for training sparse autoencoders (SAEs) on vision transformers (ViTs) in PyTorch. It also includes an interactive webapp for looking through a trained SAE's features.

Originally forked from HugoFry who forked it from Joseph Bloom.

Read logbook.md for a detailed log of my thought process.

See related-work.md for a list of works training SAEs on vision models. Please open an issue or a PR if there is missing work.

Installation

Installation is supported with uv. saev will likely work with pure pip, conda, etc. but I will not formally support it.

Clone this repository (or fork it), then from the root directory:

uv run python -m saev --help

This will create a virtual environment and display the CLI help.

Using saev

See the docs for an overview.

You can ask questions about this repo using the llms.txt file.

Example (macOS):

curl https://osu-nlp-group.github.io/saev/llms.txt | pbcopy, then paste into Claude or any LLM interface of your choice.

About

Sparse autoencoders for vision

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Elm 67.5%
  • Python 32.2%
  • Other 0.3%
0