SOTAVerified

Benchmarking

Papers

Showing 15311540 of 5548 papers

TitleStatusHype
Laughing Heads: Can Transformers Detect What Makes a Sentence Funny?Code0
RUHSNet: 3D Object Detection Using Lidar Data in Real TimeCode0
LABCAT: Locally adaptive Bayesian optimization using principal-component-aligned trust regionsCode0
Benchmarking Federated Learning for Semantic Datasets: Federated Scene Graph GenerationCode0
Benchmarking Feature Upsampling Methods for Vision Foundation Models using Interactive SegmentationCode0
Adversarial Metric Attack and Defense for Person Re-identificationCode0
Benchmarking Feature-based Algorithm Selection Systems for Black-box Numerical OptimizationCode0
Benchmarking Failures in Tool-Augmented Language ModelsCode0
Knowledge-Driven Slot Constraints for Goal-Oriented Dialogue SystemsCode0
Knowing-how & Knowing-that: A New Task for Machine Comprehension of User ManualsCode0
Show:102550
← PrevPage 154 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified