The fine-tuned VLMs and CLIP model used in this work are available at:
https://huggingface.co/ys-qu/found-rl_vlms

The implementation code is available at:
https://github.com/ys-qu/found-rl

We also provide trained RL policy checkpoints for DrQv2-CLIP to facilitate direct evaluation:

drqv2-clip-lb.zip: a checkpoint from one of the three random-seed training runs on the Leaderboard benchmark, evaluated on the Leaderboard benchmark.
drqv2-clip-eu.zip: a checkpoint from one of the three random-seed training runs on the NoCrash benchmark, evaluated under the NoCrash setting.

These checkpoints can be used to run evaluation without retraining the RL agents from scratch.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support