SOTAVerified

Benchmarking

Papers

Showing 46264650 of 5548 papers

TitleStatusHype
Machine-assisted quantitizing designs: augmenting humanities and social sciences with artificial intelligenceCode0
Beyond Marginal Uncertainty: How Accurately can Bayesian Regression Models Estimate Posterior Predictive Correlations?Code0
Machine learning classification of non-Markovian noise disturbing quantum dynamicsCode0
Machine Learning Automation Toolbox (MLaut)Code0
3D fluorescence microscopy data synthesis for segmentation and benchmarkingCode0
Machine Learning Cryptanalysis of a Quantum Random Number GeneratorCode0
Visual-RAG: Benchmarking Text-to-Image Retrieval Augmented Generation for Visual Knowledge Intensive QueriesCode0
Visual-Inertial SLAM for Unstructured Outdoor Environments: Benchmarking the Benefits and Computational Costs of Loop ClosingCode0
Machine-learning for photoplethysmography analysis: Benchmarking feature, image, and signal-based approachesCode0
Beyond Document Page Classification: Design, Datasets, and ChallengesCode0
HR-VILAGE-3K3M: A Human Respiratory Viral Immunization Longitudinal Gene Expression Dataset for Systems ImmunityCode0
VizNet: Towards A Large-Scale Visualization Learning and Benchmarking RepositoryCode0
HRNET: AI on Edge for mask detection and social distancingCode0
HRIBench: Benchmarking Vision-Language Models for Real-Time Human Perception in Human-Robot InteractionCode0
How to Manage Tiny Machine Learning at Scale: An Industrial PerspectiveCode0
AMQA: An Adversarial Dataset for Benchmarking Bias of LLMs in Medicine and HealthcareCode0
Towards Segment Anything Model (SAM) for Medical Image Segmentation: A SurveyCode0
How Far Are We from Optimal Reasoning Efficiency?Code0
Magnetic Resonance Imaging Feature-Based Subtyping and Model Ensemble for Enhanced Brain Tumor SegmentationCode0
Mahalanobis k-NN: A Statistical Lens for Robust Point-Cloud RegistrationsCode0
Beyond Atomic Geometry Representations in Materials Science: A Human-in-the-Loop Multimodal FrameworkCode0
Beyond Accuracy: A Consolidated Tool for Visual Question Answering BenchmarkingCode0
Malliavin-Mancino estimators implemented with non-uniform fast Fourier transformsCode0
HopaDIFF: Holistic-Partial Aware Fourier Conditioned Diffusion for Referring Human Action Segmentation in Multi-Person ScenariosCode0
HOEG: A New Approach for Object-Centric Predictive Process MonitoringCode0
Show:102550
← PrevPage 186 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified