SOTAVerified

Benchmarking

Papers

Showing 38013825 of 5548 papers

TitleStatusHype
VRKitchen2.0-IndoorKit: A Tutorial for Augmented Indoor Scene Building in OmniverseCode0
The ArtBench Dataset: Benchmarking Generative Models with ArtworksCode2
DaisyRec 2.0: Benchmarking Recommendation for Rigorous EvaluationCode2
GEMv2: Multilingual NLG Benchmarking in a Single Line of CodeCode1
OpenXAI: Towards a Transparent Evaluation of Model ExplanationsCode1
Beyond Uniform Lipschitz Condition in Differentially Private Optimization0
BOND: Benchmarking Unsupervised Outlier Node Detection on Static Attributed GraphsCode0
Benchmarking Constraint Inference in Inverse Reinforcement LearningCode1
ConvGeN: Convex space learning improves deep-generative oversampling for tabular imbalanced classification on smaller datasetsCode0
What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text InputsCode1
Design of Supervision-Scalable Learning Systems: Methodology and Performance Benchmarking0
NAS-Bench-Graph: Benchmarking Graph Neural Architecture SearchCode1
Motley: Benchmarking Heterogeneity and Personalization in Federated LearningCode0
SMPL: Simulated Industrial Manufacturing and Process Control Learning EnvironmentsCode1
Colonoscopy 3D Video Dataset with Paired Depth from 2D-3D Registration0
Long Range Graph BenchmarkCode1
SATBench: Benchmarking the speed-accuracy tradeoff in object recognition by humans and dynamic neural networksCode0
Pythae: Unifying Generative Autoencoders in Python -- A Benchmarking Use Case0
Benchmarking Heterogeneous Treatment Effect Models through the Lens of Interpretability0
Characteristics of Harmful Text: Towards Rigorous Benchmarking of Language Models0
Beyond Supervised vs. Unsupervised: Representative Benchmarking and Analysis of Image Representation LearningCode0
Taxonomy of Benchmarks in Graph Representation LearningCode1
RecBole 2.0: Towards a More Up-to-Date Recommendation LibraryCode4
ISLES 2022: A multi-center magnetic resonance imaging stroke lesion segmentation datasetCode1
Evaluating histopathology transfer learning with ChampKitCode1
Show:102550
← PrevPage 153 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified