SOTAVerified

Benchmarking

Papers

Showing 826850 of 5548 papers

TitleStatusHype
CodeReef: an open platform for portable MLOps, reusable automation actions and reproducible benchmarkingCode1
Grad DFT: a software library for machine learning enhanced density functional theoryCode1
GraphArena: Benchmarking Large Language Models on Graph Computational ProblemsCode1
Graph Neural Network-Based Anomaly Detection for River Network SystemsCode1
Benchmarking and Explaining Large Language Model-based Code Generation: A Causality-Centric ApproachCode1
CommonPower: A Framework for Safe Data-Driven Smart Grid ControlCode1
Replication in Visual Diffusion Models: A Survey and OutlookCode1
DFGC 2021: A DeepFake Game CompetitionCode1
ClimART: A Benchmark Dataset for Emulating Atmospheric Radiative Transfer in Weather and Climate ModelsCode1
Benchmarking and Improving Large Vision-Language Models for Fundamental Visual Graph Understanding and ReasoningCode1
Benchmarking Generated Poses: How Rational is Structure-based Drug Design with Generative Models?Code1
4D Panoptic LiDAR SegmentationCode1
Clinical Prompt Learning with Frozen Language ModelsCode1
Large Scale MRI Collection and Segmentation of Cirrhotic LiverCode1
Benchmarking of DL Libraries and Models on Mobile DevicesCode1
Towards Heterogeneous Long-tailed Learning: Benchmarking, Metrics, and ToolboxCode1
Circumventing shortcuts in audio-visual deepfake detection datasets with unsupervised learningCode1
A BFS-Tree of Ranking References for Unsupervised Manifold LearningCode1
Benchmarking and Survey of Explanation Methods for Black Box ModelsCode1
An Empirical Study into Clustering of Unseen Datasets with Self-Supervised EncodersCode1
Benchmarking Geospatial Question Answering Engines using the Dataset GeoQuestions1089Code1
ClearPose: Large-scale Transparent Object Dataset and BenchmarkCode1
CLoG: Benchmarking Continual Learning of Image Generation ModelsCode1
A Survey on Graph Counterfactual Explanations: Definitions, Methods, Evaluation, and Research ChallengesCode1
AIPerf: Automated machine learning as an AI-HPC benchmarkCode1
Show:102550
← PrevPage 34 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified