Text Generation
Safetensors
English
Chinese
qwen3
reward-model
rlhf
principle-following
qwen
conversational
Instructions to use WisdomShell/RewardAnything-8B-v1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Inference
Will you publish larger models, or training dataset?
#3
by zhanghaoie - opened
Will you publish larger models, or training dataset?
Thank you for your interest. We'll be releasing more details soon, stay tuned if you are interested.