jayzou3773/qwen3_5-moe-neuron_structure_drop-p50-s1k-128samples-thinking 19B • Updated 2 days ago • 61
jayzou3773/qwen3_5-moe-expert_drop-weight_magnitude_pruning-r128-s1k-128samples-thinking 19B • Updated 2 days ago • 12
jayzou3773/qwen3_5-moe-expert_drop-pure_gradient_pruning-r128-s1k-128samples-thinking 19B • Updated 2 days ago • 65
jayzou3773/qwen3_5-moe-expert_drop-pure_expert_gradient_pruning-r128-s1k-128samples-thinking 19B • Updated 2 days ago • 66
jayzou3773/qwen3_5-moe-expert_drop-layerwise_pruning-r128-s1k-128samples-thinking 19B • Updated 2 days ago • 66
jayzou3773/qwen3_5-moe-expert_drop-bias_pruning-r128-s1k-128samples-thinking 19B • Updated 2 days ago • 66
jayzou3773/qwen3-moe-expert_drop-pure_gradient_pruning-r64-s1k-128samples-thinking 16B • Updated 4 days ago • 53
jayzou3773/qwen3-moe-expert_drop-pure_expert_gradient_pruning-r64-s1k-128samples-thinking 16B • Updated 4 days ago • 50
jayzou3773/qwen3-moe-expert_drop-layerwise_pruning-r64-s1k-128samples-thinking 16B • Updated 4 days ago • 55
jayzou3773/qwen3-moe-expert_drop-bias_pruning-r64-s1k-128samples-thinking 16B • Updated 4 days ago • 55
jayzou3773/qwen3_5-moe-expert_drop-weight_magnitude_pruning-r128-s1k-128samples 19B • Updated 7 days ago • 199
jayzou3773/qwen3_5-moe-expert_drop-pure_gradient_pruning-r128-s1k-128samples 19B • Updated 7 days ago • 137
jayzou3773/qwen3_5-moe-expert_drop-pure_expert_gradient_pruning-r128-s1k-128samples 19B • Updated 7 days ago • 156
jayzou3773/qwen3_5-moe-expert_drop-layerwise_pruning-r128-s1k-128samples 19B • Updated 7 days ago • 138
jayzou3773/qwen3-moe-expert_drop-weight_magnitude_pruning-r64-s1k-128samples 16B • Updated 7 days ago • 69
jayzou3773/qwen3-moe-expert_drop-pure_gradient_pruning-r64-s1k-128samples 16B • Updated 7 days ago • 65
jayzou3773/qwen3-moe-expert_drop-pure_expert_gradient_pruning-r64-s1k-128samples 16B • Updated 7 days ago • 71