SOTAVerified

Benchmarking

Papers

Showing 46414650 of 5548 papers

TitleStatusHype
AMQA: An Adversarial Dataset for Benchmarking Bias of LLMs in Medicine and HealthcareCode0
Towards Segment Anything Model (SAM) for Medical Image Segmentation: A SurveyCode0
How Far Are We from Optimal Reasoning Efficiency?Code0
Magnetic Resonance Imaging Feature-Based Subtyping and Model Ensemble for Enhanced Brain Tumor SegmentationCode0
Mahalanobis k-NN: A Statistical Lens for Robust Point-Cloud RegistrationsCode0
Beyond Atomic Geometry Representations in Materials Science: A Human-in-the-Loop Multimodal FrameworkCode0
Beyond Accuracy: A Consolidated Tool for Visual Question Answering BenchmarkingCode0
Malliavin-Mancino estimators implemented with non-uniform fast Fourier transformsCode0
HopaDIFF: Holistic-Partial Aware Fourier Conditioned Diffusion for Referring Human Action Segmentation in Multi-Person ScenariosCode0
HOEG: A New Approach for Object-Centric Predictive Process MonitoringCode0
Show:102550
← PrevPage 465 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified