SOTAVerified

Benchmarking

Papers

Showing 15111520 of 5548 papers

TitleStatusHype
Calibrated and Robust Foundation Models for Vision-Language and Medical Image Tasks Under Distribution Shift0
Identifying the Smallest Adversarial Load Perturbations that Render DC-OPF InfeasibleCode0
Benchmarking Waitlist Mortality Prediction in Heart Transplantation Through Time-to-Event Modeling using New Longitudinal UNOS Dataset0
Orchestrator-Agent Trust: A Modular Agentic AI Visual Classification System with Trust-Aware Orchestration and RAG-Based ReasoningCode0
Hyperspectral Anomaly Detection Methods: A Survey and Comparative Study0
A Systematic Analysis of Hybrid Linear Attention0
SQLBarber: A System Leveraging Large Language Models to Generate Customized and Realistic SQL Workloads0
SenseShift6D: Multimodal RGB-D Benchmarking for Robust 6D Pose Estimation across Environment and Sensor VariationsCode0
Inaugural MOASEI Competition at AAMAS'2025: A Technical Report0
STRUCTSENSE: A Task-Agnostic Agentic Framework for Structured Information Extraction with Human-In-The-Loop Evaluation and BenchmarkingCode0
Show:102550
← PrevPage 152 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified