It's trash. It has a serious drop in score compared to the base model.
But it was my first RL learning and it was monumental.

eval

minpeter/calculator-agent-qwen3-0.6b: Accuracy: 15.19% (24/158)
minpeter/Qwen3-0.6B-Instruct: Accuracy: 27.22% (43/158)

Safetensors

Model size

0.8B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for minpeter/calculator-agent-qwen3-0.6b

Base model

Finetuned

Finetuned

Finetuned

(1)

this model

Quantizations