SOTAVerified

Benchmarking

Papers

Showing 601610 of 5548 papers

TitleStatusHype
AdaPool: Exponential Adaptive Pooling for Information-Retaining DownsamplingCode1
A Multifaceted Benchmarking of Synthetic Electronic Health Record Generation ModelsCode1
M4-SAR: A Multi-Resolution, Multi-Polarization, Multi-Scene, Multi-Source Dataset and Benchmark for Optical-SAR Fusion Object DetectionCode1
CriticBench: Benchmarking LLMs for Critique-Correct ReasoningCode1
Cross-Modal Bidirectional Interaction Model for Referring Remote Sensing Image SegmentationCode1
CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of CancerCode1
Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable SummarizationCode1
COVID-19 event extraction from Twitter via extractive question answering with continuous promptsCode1
Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures TranslationCode1
Benchmarking Generated Poses: How Rational is Structure-based Drug Design with Generative Models?Code1
Show:102550
← PrevPage 61 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified