SOTAVerified

Benchmarking

Papers

Showing 731740 of 5548 papers

TitleStatusHype
AI in Lung Health: Benchmarking Detection and Diagnostic Models Across Multiple CT Scan DatasetsCode1
Position: Quo Vadis, Unsupervised Time Series Anomaly Detection?Code1
ATOMMIC: An Advanced Toolbox for Multitask Medical Imaging Consistency to facilitate Artificial Intelligence applications from acquisition to analysis in Magnetic Resonance ImagingCode1
Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Explanations?Code1
4DBInfer: A 4D Benchmarking Toolbox for Graph-Centric Predictive Modeling on Relational DBsCode1
Multi-Stream Cellular Test-Time Adaptation of Real-Time Models Evolving in Dynamic EnvironmentsCode1
Constellation Dataset: Benchmarking High-Altitude Object Detection for an Urban IntersectionCode1
ImplicitAVE: An Open-Source Dataset and Multimodal LLMs Benchmark for Implicit Attribute Value ExtractionCode1
SynthEval: A Framework for Detailed Utility and Privacy Evaluation of Tabular Synthetic DataCode1
TAVGBench: Benchmarking Text to Audible-Video GenerationCode1
Show:102550
← PrevPage 74 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified