Sparse Autoencoders of Diffusion Language Models (Dream-7B, LLaDA-8B) and Large Language Models (Qwen-2.5-7B, LLaMA-3-8B)
XWang
AwesomeInterpretability
AI & ML interests
None yet
Recent Activity
updated a model about 1 month ago
AwesomeInterpretability/dream-mask-jumprelu_gated-sae published a model about 1 month ago
AwesomeInterpretability/llada-mask-topk-32K_65K-sae published a model about 1 month ago
AwesomeInterpretability/dream-mask-topk-32K_65K-saeOrganizations
None yet