Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1376–1400 of 5548 papers

Title	Date	Tasks	Status	Hype
SoftGym: Benchmarking Deep Reinforcement Learning for Deformable Object Manipulation	Nov 14, 2020	BenchmarkingDeep Reinforcement Learning	CodeCode Available	1
tvopt: A Python Framework for Time-Varying Optimization	Nov 12, 2020	Benchmarking	CodeCode Available	1
Long Range Arena: A Benchmark for Efficient Transformers	Nov 8, 2020	16kBenchmarking	CodeCode Available	1
Collective Knowledge: organizing research projects as a database of reusable components and portable workflows with common APIs	Nov 2, 2020	Benchmarking	CodeCode Available	1
Benchmarking Meaning Representations in Neural Semantic Parsing	Nov 1, 2020	BenchmarkingSemantic Parsing	CodeCode Available	1
A Critical Assessment of State-of-the-Art in Entity Alignment	Oct 30, 2020	BenchmarkingEntity Alignment	CodeCode Available	1
Benchmarking Deep Learning Interpretability in Time Series Predictions	Oct 26, 2020	BenchmarkingDeep Learning	CodeCode Available	1
Kvasir-Instrument: Diagnostic and therapeutic tool segmentation dataset in gastrointestinal endoscopy	Oct 23, 2020	BenchmarkingDiagnostic	CodeCode Available	1
KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and Kirundi	Oct 23, 2020	ArticlesBenchmarking	CodeCode Available	1
Exploiting News Article Structure for Automatic Corpus Generation of Entailment Datasets	Oct 22, 2020	ArticlesBenchmarking	CodeCode Available	1
Self-Alignment Pretraining for Biomedical Entity Representations	Oct 22, 2020	BenchmarkingEntity Linking	CodeCode Available	1
German's Next Language Model	Oct 21, 2020	BenchmarkingDocument Classification	CodeCode Available	1
Promoting High Diversity Ensemble Learning with EnsembleBench	Oct 20, 2020	BenchmarkingDiversity	CodeCode Available	1
RobustBench: a standardized adversarial robustness benchmark	Oct 19, 2020	Adversarial RobustnessBenchmarking	CodeCode Available	1
RADIATE: A Radar Dataset for Automotive Perception in Bad Weather	Oct 18, 2020	Autonomous DrivingBenchmarking	CodeCode Available	1
Light Field Salient Object Detection: A Review and Benchmark	Oct 10, 2020	BenchmarkingObject	CodeCode Available	1
Olympus: a benchmarking framework for noisy optimization and experiment planning	Oct 8, 2020	BenchmarkingProbabilistic Deep Learning	CodeCode Available	1
OpenTraj: Assessing Prediction Complexity in Human Trajectories Datasets	Oct 2, 2020	BenchmarkingPrediction	CodeCode Available	1
Bag of Tricks for Adversarial Training	Oct 1, 2020	Adversarial RobustnessBenchmarking	CodeCode Available	1
HINT3: Raising the bar for Intent Detection in the Wild	Sep 29, 2020	BenchmarkingIntent Detection	CodeCode Available	1
Benchmarking deep inverse models over time, and the neural-adjoint method	Sep 27, 2020	Benchmarking	CodeCode Available	1
A BFS-Tree of Ranking References for Unsupervised Manifold Learning	Sep 24, 2020	BenchmarkingImage Retrieval	CodeCode Available	1
CoDEx: A Comprehensive Knowledge Graph Completion Benchmark	Sep 16, 2020	BenchmarkingKnowledge Graph Completion	CodeCode Available	1
BARS-CTR: Open Benchmarking for Click-Through Rate Prediction	Sep 12, 2020	BenchmarkingClick-Through Rate Prediction	CodeCode Available	1
IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding	Sep 11, 2020	BenchmarkingDiversity	CodeCode Available	1

Show:10 25 50

← PrevPage 56 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified