SOTAVerified

Benchmarking

Papers

Showing 13211330 of 5548 papers

TitleStatusHype
Benchmarking Implicit Neural Representation and Geometric Rendering in Real-Time RGB-D SLAMCode1
JaxRobotarium: Training and Deploying Multi-Robot Policies in 10 MinutesCode1
JoinGym: An Efficient Query Optimization Environment for Reinforcement LearningCode1
Jojajovai: A Parallel Guarani-Spanish Corpus for MT BenchmarkingCode1
Attention, Please! Revisiting Attentive Probing for Masked Image ModelingCode1
JuDGE: Benchmarking Judgment Document Generation for Chinese Legal SystemCode1
CodeS: Natural Language to Code Repository via Multi-Layer SketchCode1
Benchmarking Simulation-Based InferenceCode1
Beyond Normal: On the Evaluation of Mutual Information EstimatorsCode1
CodeUpdateArena: Benchmarking Knowledge Editing on API UpdatesCode1
Show:102550
← PrevPage 133 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified