Safetensors
qwen3_moe

YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

SFT Model for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards"

GitHub arXiv Dataset & Model

Downloads last month
14
Safetensors
Model size
31B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for THU-KEG/DeepDive-30B-A3B-SFT

Quantizations
1 model

Collection including THU-KEG/DeepDive-30B-A3B-SFT

Paper for THU-KEG/DeepDive-30B-A3B-SFT