SOTAVerified

Benchmarking

Papers

Showing 951975 of 5548 papers

TitleStatusHype
Benchmarking Differential Privacy and Federated Learning for BERT ModelsCode1
Accelerated and interpretable oblique random survival forestsCode1
Decoding the Underlying Meaning of Multimodal Hateful MemesCode1
Benchmarking Distribution Shift in Tabular Data with TableShiftCode1
Decoding the Enigma: Benchmarking Humans and AIs on the Many Facets of Working MemoryCode1
Mitigating Gender Bias in Captioning SystemsCode1
Decentralized Arena: Towards Democratic and Scalable Automatic Evaluation of Language ModelsCode1
dEchorate: a Calibrated Room Impulse Response Database for Echo-aware Signal ProcessingCode1
EventEA: Benchmarking Entity Alignment for Event-centric Knowledge GraphsCode1
Benchmarking Offline Reinforcement Learning on Real-Robot HardwareCode1
3DYoga90: A Hierarchical Video Dataset for Yoga Pose UnderstandingCode1
Benchmarking Econometric and Machine Learning Methodologies in NowcastingCode1
Event Probability Mask (EPM) and Event Denoising Convolutional Neural Network (EDnCNN) for Neuromorphic CamerasCode1
Experimental Validation of Ultrasound Beamforming with End-to-End Deep Learning for Single Plane Wave ImagingCode1
MMTU: A Massive Multi-Task Table Understanding and Reasoning BenchmarkCode1
Benchmarking Embedding Aggregation Methods in Computational Pathology: A Clinical Data PerspectiveCode1
Failure Detection in Medical Image Classification: A Reality Check and Benchmarking TestbedCode1
FedScale: Benchmarking Model and System Performance of Federated Learning at ScaleCode1
Deep Learning-Based Synchronization for Uplink NB-IoTCode1
Deep Learning for ECG Analysis: Benchmarks and Insights from PTB-XLCode1
Working Memory Capacity of ChatGPT: An Empirical StudyCode1
Benchmarking Natural Language Understanding Services for building Conversational AgentsCode1
Monash University, UEA, UCR Time Series Extrinsic Regression ArchiveCode1
MONICA: Benchmarking on Long-tailed Medical Image ClassificationCode1
Benchmarking Neural Network Generalization for Grammar InductionCode1
Show:102550
← PrevPage 39 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified