SOTAVerified

Benchmarking

Papers

Showing 38313840 of 5548 papers

TitleStatusHype
ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context LearningCode0
ShuffleMix: Improving Representations via Channel-Wise Shuffle of Interpolated Hidden StatesCode0
Design and implementation of intelligent packet filtering in IoT microcontroller-based devicesCode0
Large-scale Ridesharing DARP Instances Based on Real Travel DemandCode0
Human Body Shape Classification Based on a Single Image0
InDL: A New Dataset and Benchmark for In-Diagram Logic Interpretation based on Visual IllusionCode0
Exploring the Practicality of Generative Retrieval on Dynamic Corpora0
BASED: Benchmarking, Analysis, and Structural Estimation of DeblurringCode0
Benchmarking Diverse-Modal Entity Linking with Generative Models0
Learning from Integral Losses in Physics Informed Neural NetworksCode0
Show:102550
← PrevPage 384 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified