Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3801–3825 of 5548 papers

Title	Date	Tasks	Status	Hype
VRKitchen2.0-IndoorKit: A Tutorial for Augmented Indoor Scene Building in Omniverse	Jun 23, 2022	BenchmarkingIndoor Scene Synthesis	CodeCode Available	0
The ArtBench Dataset: Benchmarking Generative Models with Artworks	Jun 22, 2022	BenchmarkingConditional Image Generation	CodeCode Available	2
DaisyRec 2.0: Benchmarking Recommendation for Rigorous Evaluation	Jun 22, 2022	BenchmarkingRecommendation Systems	CodeCode Available	2
GEMv2: Multilingual NLG Benchmarking in a Single Line of Code	Jun 22, 2022	BenchmarkingText Generation	CodeCode Available	1
OpenXAI: Towards a Transparent Evaluation of Model Explanations	Jun 22, 2022	BenchmarkingExplainable Artificial Intelligence (XAI)	CodeCode Available	1
Beyond Uniform Lipschitz Condition in Differentially Private Optimization	Jun 21, 2022	Benchmarkingregression	—Unverified	0
BOND: Benchmarking Unsupervised Outlier Node Detection on Static Attributed Graphs	Jun 21, 2022	Anomaly DetectionBenchmarking	CodeCode Available	0
Benchmarking Constraint Inference in Inverse Reinforcement Learning	Jun 20, 2022	Autonomous DrivingBenchmarking	CodeCode Available	1
ConvGeN: Convex space learning improves deep-generative oversampling for tabular imbalanced classification on smaller datasets	Jun 20, 2022	BenchmarkingFraud Detection	CodeCode Available	0
What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text Inputs	Jun 19, 2022	BenchmarkingImage Captioning	CodeCode Available	1
Design of Supervision-Scalable Learning Systems: Methodology and Performance Benchmarking	Jun 18, 2022	Benchmarkingimage-classification	—Unverified	0
NAS-Bench-Graph: Benchmarking Graph Neural Architecture Search	Jun 18, 2022	BenchmarkingGraph Neural Network	CodeCode Available	1
Motley: Benchmarking Heterogeneity and Personalization in Federated Learning	Jun 18, 2022	BenchmarkingFairness	CodeCode Available	0
SMPL: Simulated Industrial Manufacturing and Process Control Learning Environments	Jun 17, 2022	BenchmarkingDeep Reinforcement Learning	CodeCode Available	1
Colonoscopy 3D Video Dataset with Paired Depth from 2D-3D Registration	Jun 17, 2022	BenchmarkingDepth Estimation	—Unverified	0
Long Range Graph Benchmark	Jun 16, 2022	BenchmarkingGraph Classification	CodeCode Available	1
SATBench: Benchmarking the speed-accuracy tradeoff in object recognition by humans and dynamic neural networks	Jun 16, 2022	BenchmarkingDynamic neural networks	CodeCode Available	0
Pythae: Unifying Generative Autoencoders in Python -- A Benchmarking Use Case	Jun 16, 2022	BenchmarkingDensity Estimation	—Unverified	0
Benchmarking Heterogeneous Treatment Effect Models through the Lens of Interpretability	Jun 16, 2022	BenchmarkingFeature Importance	—Unverified	0
Characteristics of Harmful Text: Towards Rigorous Benchmarking of Language Models	Jun 16, 2022	BenchmarkingLanguage Modeling	—Unverified	0
Beyond Supervised vs. Unsupervised: Representative Benchmarking and Analysis of Image Representation Learning	Jun 16, 2022	BenchmarkingClustering	CodeCode Available	0
Taxonomy of Benchmarks in Graph Representation Learning	Jun 15, 2022	BenchmarkingGraph Representation Learning	CodeCode Available	1
RecBole 2.0: Towards a More Up-to-Date Recommendation Library	Jun 15, 2022	BenchmarkingData Augmentation	CodeCode Available	4
ISLES 2022: A multi-center magnetic resonance imaging stroke lesion segmentation dataset	Jun 14, 2022	BenchmarkingIschemic Stroke Lesion Segmentation	CodeCode Available	1
Evaluating histopathology transfer learning with ChampKit	Jun 14, 2022	BenchmarkingCell Detection	CodeCode Available	1

Show:10 25 50

← PrevPage 153 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified