BredForCompanionship/vanilla_24L2048_parity_hotstop_L12_mlp_in_matryoshka_batch_top_k_k48_500M Updated 5 days ago
BredForCompanionship/vanilla_24L2048_parity_hotstop_L12_mlp_in_matryoshka_batch_top_k_k48_500M Updated 5 days ago
BredForCompanionship/vanilla_24L2048_parity_hotstop_L1_mlp_in_batch_top_k_k48_250M Updated 7 days ago