Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1251–1275 of 5548 papers

Title	Date	Tasks	Status	Hype
Performance Evaluation of Deep Transfer Learning on Multiclass Identification of Common Weed Species in Cotton Production Systems	Oct 11, 2021	BenchmarkingManagement	CodeCode Available	1
SERAB: A multi-lingual benchmark for speech emotion recognition	Oct 7, 2021	BenchmarkingEmotion Recognition	CodeCode Available	1
EntQA: Entity Linking as Question Answering	Oct 5, 2021	BenchmarkingEntity Linking	CodeCode Available	1
Revisiting Self-Training for Few-Shot Learning of Language Model	Oct 4, 2021	BenchmarkingFew-Shot Learning	CodeCode Available	1
Machine Learning with Knowledge Constraints for Process Optimization of Open-Air Perovskite Solar Cell Manufacturing	Oct 1, 2021	Bayesian OptimizationBenchmarking	CodeCode Available	1
Phonetic Word Embeddings	Sep 30, 2021	BenchmarkingWord Embeddings	CodeCode Available	1
MedPerf: Open Benchmarking Platform for Medical Artificial Intelligence using Federated Evaluation	Sep 29, 2021	BenchmarkingPhilosophy	CodeCode Available	1
Benchmarking Graph Neural Networks on Dynamic Link Prediction	Sep 29, 2021	BenchmarkingDynamic Link Prediction	CodeCode Available	1
"How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken Conversations	Sep 28, 2021	BenchmarkingDialogue State Tracking	CodeCode Available	1
FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language Understanding	Sep 27, 2021	BenchmarkingNatural Language Understanding	CodeCode Available	1
PASS: An ImageNet replacement for self-supervised pretraining without humans	Sep 27, 2021	BenchmarkingEthics	CodeCode Available	1
Disentangled Feature Representation for Few-shot Image Classification	Sep 26, 2021	BenchmarkingClassification	CodeCode Available	1
Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System	Sep 23, 2021	BenchmarkingResponse Generation	CodeCode Available	1
SubseasonalClimateUSA: A Dataset for Subseasonal Forecasting and Benchmarking	Sep 21, 2021	Benchmarking	CodeCode Available	1
Benchmarking the Combinatorial Generalizability of Complex Query Answering on Knowledge Graphs	Sep 18, 2021	BenchmarkingComplex Query Answering	CodeCode Available	1
AI Accelerator Survey and Trends	Sep 18, 2021	BenchmarkingComputational Efficiency	CodeCode Available	1
Benchmarking Commonsense Knowledge Base Population with an Effective Evaluation Dataset	Sep 16, 2021	BenchmarkingKnowledge Base Population	CodeCode Available	1
OPV2V: An Open Benchmark Dataset and Fusion Pipeline for Perception with Vehicle-to-Vehicle Communication	Sep 16, 2021	3D Object DetectionBenchmarking	CodeCode Available	1
Benchmarking the Spectrum of Agent Capabilities	Sep 14, 2021	Benchmarking	CodeCode Available	1
RobustART: Benchmarking Robustness on Architecture Design and Training Techniques	Sep 11, 2021	Adversarial RobustnessBenchmarking	CodeCode Available	1
Scikit-dimension: a Python package for intrinsic dimension estimation	Sep 6, 2021	Benchmarking	CodeCode Available	1
Does BERT Learn as Humans Perceive? Understanding Linguistic Styles through Lexica	Sep 6, 2021	Benchmarking	CodeCode Available	1
Biomedical Data-to-Text Generation via Fine-Tuning Transformers	Sep 3, 2021	BenchmarkingData-to-Text Generation	CodeCode Available	1
ReMeDi: Resources for Multi-domain, Multi-service, Medical Dialogues	Sep 1, 2021	BenchmarkingContrastive Learning	CodeCode Available	1
Tune It or Don't Use It: Benchmarking Data-Efficient Image Classification	Aug 30, 2021	Benchmarkingimage-classification	CodeCode Available	1

Show:10 25 50

← PrevPage 51 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified