dataset for CoreCodeBench: A Configurable Multi-Scenario Repository-Level Benchmark
Guan Hao
tubehhh
AI & ML interests
None yet
Recent Activity
updated a dataset 2 days ago
tubehhh/SWE-Cycle published a dataset 2 days ago
tubehhh/SWE-Cycle upvoted a paper 22 days ago
Turing Test on Screen: A Benchmark for Mobile GUI Agent Humanization