SOTAVerified

Benchmarking

Papers

Showing 20612070 of 5548 papers

TitleStatusHype
Benchmarking Machine Learning Methods for Distributed Acoustic Sensing0
Contextual Metric Meta-Evaluation by Measuring Local Metric Accuracy0
Writing as a testbed for open ended agents0
Reservoir Computing with a Single Oscillating Gas Bubble: Emphasizing the Chaotic Regime0
Enhancing Multi-Label Emotion Analysis and Corresponding Intensities for Ethiopian Languages0
EvAnimate: Event-conditioned Image-to-Video Generation for Human Animation0
Mining-Gym: A Configurable RL Benchmarking Environment for Truck Dispatch SchedulingCode0
Benchmarking Burst Super-Resolution for Polarization Images: Noise Dataset and Analysis0
LLM Benchmarking with LLaMA2: Evaluating Code Development Performance Across Multiple Programming LanguagesCode0
Benchmarking Post-Hoc Unknown-Category Detection in Food Recognition0
Show:102550
← PrevPage 207 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified