SOTAVerified

Benchmarking

Papers

Showing 38013810 of 5548 papers

TitleStatusHype
VRKitchen2.0-IndoorKit: A Tutorial for Augmented Indoor Scene Building in OmniverseCode0
The ArtBench Dataset: Benchmarking Generative Models with ArtworksCode2
DaisyRec 2.0: Benchmarking Recommendation for Rigorous EvaluationCode2
GEMv2: Multilingual NLG Benchmarking in a Single Line of CodeCode1
OpenXAI: Towards a Transparent Evaluation of Model ExplanationsCode1
Beyond Uniform Lipschitz Condition in Differentially Private Optimization0
BOND: Benchmarking Unsupervised Outlier Node Detection on Static Attributed GraphsCode0
Benchmarking Constraint Inference in Inverse Reinforcement LearningCode1
ConvGeN: Convex space learning improves deep-generative oversampling for tabular imbalanced classification on smaller datasetsCode0
What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text InputsCode1
Show:102550
← PrevPage 381 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified