The fine-tuned VLMs and CLIP model used in this work are available at:
https://huggingface.co/ys-qu/found-rl_vlms
The implementation code is available at:
https://github.com/ys-qu/found-rl
We also provide trained RL policy checkpoints for DrQv2-CLIP to facilitate direct evaluation:
drqv2-clip-lb.zip: a checkpoint from one of the three random-seed training runs on the Leaderboard benchmark, evaluated on the Leaderboard benchmark.drqv2-clip-eu.zip: a checkpoint from one of the three random-seed training runs on the NoCrash benchmark, evaluated under the NoCrash setting.
These checkpoints can be used to run evaluation without retraining the RL agents from scratch.
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support