This repo contains tools for interpreting protein language models using sparse autoencoders (SAEs). Our SAE visualizer is available at interprot.com and our SAE models weights are on HuggingFace. For more information, check out our preprint.
viz
contains the frontend app for visualizing SAE features. interprot
is a Python package for SAE training, evaluation, and interpretation.
Check out this demo notebook for SAE inference with a custom input sequence.
The visualizer is a React app with some RunPod serverless functions that serve our SAEs.
cd viz
pnpm install
pnpm run dev
The RunPod serverless functions live in their own repos:
- SAE inference: https://github.com/liambai/sae-inference
- SAE steering: https://github.com/liambai/sae-steering
pip install pre-commit
pre-commit install
docker compose build
docker compose run --rm interprot bash