Group Relative Policy Optimization fine-tunes for DialLM across Gemma, Llama, and Qwen models, covering all dialect variants.
-
jordanpainter/diallm-gemma-grpo-all
Image-Text-to-Text • 4B • Updated • 21 -
jordanpainter/diallm-gemma-grpo-aus
Image-Text-to-Text • 4B • Updated • 28 -
jordanpainter/diallm-gemma-grpo-brit
Image-Text-to-Text • 4B • Updated • 29 -
jordanpainter/diallm-gemma-grpo-ind
Image-Text-to-Text • 4B • Updated • 28