Final project for cs2222
-
Updated
May 14, 2025 - Jupyter Notebook
Final project for cs2222
Supplementary code & results for "Variant-specific crosscoder features are seed-stable but not detectably task-causal in a GRPO-LoRA math setting" (ICML 2026 Mech Interp Workshop, Spotlight)
Code for "Learning to Read Out: Unembedding Dynamics in Language Model Pretraining" — parameter-trajectory crosscoders on the unembedding matrix W_U. Pretrained crosscoders: hf.co/hematteo/parameter-trajectory-crosscoders
Add a description, image, and links to the crosscoders topic page so that developers can more easily learn about it.
To associate your repository with the crosscoders topic, visit your repo's landing page and select "manage topics."