SOTAVerified

Benchmarking

Papers

Showing 29112920 of 5548 papers

TitleStatusHype
HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos0
Hotel Recognition via Latent Image Embedding0
Benchmarking Unified Face Attack Detection via Hierarchical Prompt Tuning0
Benchmarking Uncertainty Quantification on Biosignal Classification Tasks under Dataset Shift0
Household Electricity Demand Forecasting -- Benchmarking State-of-the-Art Methods0
How Aligned are Different Alignment Metrics?0
How Certain are Uncertainty Estimates? Three Novel Earth Observation Datasets for Benchmarking Uncertainty Quantification in Machine Learning0
How Different AI Chatbots Behave? Benchmarking Large Language Models in Behavioral Economics Games0
How Do Large Language Models Understand Graph Patterns? A Benchmark for Graph Pattern Comprehension0
The FaceChannelS: Strike of the Sequences for the AffWild 2 Challenge0
Show:102550
← PrevPage 292 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified