This sounds really cool, definitely adding it to my weekend experiment list (kudos for providing the github repo as well).
Also... ShadowPEFT? Badass name.
This sounds really cool, definitely adding it to my weekend experiment list (kudos for providing the github repo as well).
Also... ShadowPEFT? Badass name.
its a really great model. i just got done evaluating it for the past week. the long term agentic loops really really help in autonomous app building, especially the recursive fixing. sometimes i neglect the fact that a bigger model is just.. .well.. better usually.
very good model. now if we can't just figure out how to squeeze it all down to an rtx 5090 possibly? hehe