Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4476–4500 of 5548 papers

Title	Date	Tasks	Status	Hype
Dynabench: Rethinking Benchmarking in NLP	Apr 7, 2021	Benchmarking	—Unverified	0
Efficient and Accurate In-Database Machine Learning with SQL Code Generation in Python	Apr 7, 2021	BenchmarkingBIG-bench Machine Learning	—Unverified	0
Robust Semantic Interpretability: Revisiting Concept Activation Vectors	Apr 6, 2021	Benchmarkingcounterfactual	CodeCode Available	1
CBench: Towards Better Evaluation of Question Answering Over Knowledge Graphs	Apr 5, 2021	BenchmarkingKnowledge Graphs	CodeCode Available	1
What Will it Take to Fix Benchmarking in Natural Language Understanding?	Apr 5, 2021	BenchmarkingNatural Language Understanding	—Unverified	0
The Multi-speaker Multi-style Voice Cloning Challenge 2021	Apr 5, 2021	BenchmarkingVoice Cloning	—Unverified	0
Improving Pretrained Models for Zero-shot Multi-label Text Classification through Reinforced Label Hierarchy Reasoning	Apr 4, 2021	BenchmarkingMulti Label Text Classification	CodeCode Available	0
An Empirical Evaluation of Cost-based Federated SPARQL Query Processing Engines	Apr 2, 2021	Benchmarking	CodeCode Available	0
Benchmarking Transformer-based Language Models for Arabic Sentiment and Sarcasm Detection	Apr 1, 2021	BenchmarkingSarcasm Detection	—Unverified	0
Benchmarking Pre-trained Language Models for Multilingual NER: TraSpaS at the BSNLP2021 Shared Task	Apr 1, 2021	BenchmarkingLanguage Modeling	CodeCode Available	0
Findings of the Shared Task on Offensive Language Identification in Tamil, Malayalam, and Kannada	Apr 1, 2021	BenchmarkingLanguage Identification	—Unverified	0
Benchmarking a transformer-FREE model for ad-hoc retrieval	Apr 1, 2021	BenchmarkingCPU	CodeCode Available	0
Remote Sensing Image Classification with the SEN12MS Dataset	Apr 1, 2021	BenchmarkingClassification	CodeCode Available	1
Generalized Conflict-directed Search for Optimal Ordering Problems	Mar 31, 2021	BenchmarkingScheduling	—Unverified	0
Simultaneous Navigation and Construction Benchmarking Environments	Mar 31, 2021	BenchmarkingDeep Reinforcement Learning	CodeCode Available	1
Benchmarks for Deep Off-Policy Evaluation	Mar 30, 2021	Benchmarkingcontinuous-control	CodeCode Available	1
Unsupervised Learning of 3D Object Categories from Videos in the Wild	Mar 30, 2021	BenchmarkingMonocular Reconstruction	—Unverified	0
3D AffordanceNet: A Benchmark for Visual Object Affordance Understanding	Mar 30, 2021	Affordance DetectionBenchmarking	CodeCode Available	1
Benchmarking Representation Learning for Natural World Image Collections	Mar 30, 2021	BenchmarkingBinary Classification	CodeCode Available	0
RAN-GNNs: breaking the capacity limits of graph neural networks	Mar 29, 2021	AttributeBenchmarking	—Unverified	0
Deep Image Compositing	Mar 29, 2021	Benchmarking	—Unverified	0
SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events	Mar 29, 2021	Autonomous VehiclesBenchmarking	CodeCode Available	1
Exploiting Adam-like Optimization Algorithms to Improve the Performance of Convolutional Neural Networks	Mar 26, 2021	Benchmarking	—Unverified	0
Marine Snow Removal Benchmarking Dataset	Mar 26, 2021	BenchmarkingSand	CodeCode Available	1
Enabling Design Methodologies and Future Trends for Edge AI: Specialization and Co-design	Mar 25, 2021	BenchmarkingEdge-computing	—Unverified	0

Show:10 25 50

← PrevPage 180 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified