Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3626–3650 of 5548 papers

Title	Date	Tasks	Status	Hype
A Comparative Attention Framework for Better Few-Shot Object Detection on Aerial Images	Oct 25, 2022	BenchmarkingFew-Shot Object Detection	CodeCode Available	1
Deep Crowd Anomaly Detection: State-of-the-Art, Challenges, and Future Research Directions	Oct 25, 2022	Anomaly DetectionBenchmarking	—Unverified	0
What cleaves? Is proteasomal cleavage prediction reaching a ceiling?	Oct 24, 2022	BenchmarkingDenoising	—Unverified	0
ESB: A Benchmark For Multi-Domain End-to-End Speech Recognition	Oct 24, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1
SpikeSim: An end-to-end Compute-in-Memory Hardware Evaluation Tool for Benchmarking Spiking Neural Networks	Oct 24, 2022	Benchmarking	CodeCode Available	1
Benchmarking GPU and TPU Performance with Graph Neural Networks	Oct 21, 2022	BenchmarkingGPU	—Unverified	0
Multi-scale data reconstruction of turbulent rotating flows with Gappy POD, Extended POD and Generative Adversarial Networks	Oct 21, 2022	BenchmarkingGenerative Adversarial Network	—Unverified	0
A Survey on Graph Counterfactual Explanations: Definitions, Methods, Evaluation, and Research Challenges	Oct 21, 2022	BenchmarkingCommunity Detection	CodeCode Available	1
gSuite: A Flexible and Framework Independent Benchmark Suite for Graph Neural Network Inference on GPUs	Oct 20, 2022	BenchmarkingComputational Efficiency	—Unverified	0
RMBench: Benchmarking Deep Reinforcement Learning for Robotic Manipulator Control	Oct 20, 2022	BenchmarkingData Augmentation	CodeCode Available	1
LaMAR: Benchmarking Localization and Mapping for Augmented Reality	Oct 19, 2022	BenchmarkingDiversity	CodeCode Available	2
Graphs, Constraints, and Search for the Abstraction and Reasoning Corpus	Oct 18, 2022	ARCBenchmarking	CodeCode Available	1
iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations	Oct 17, 2022	BenchmarkingText Classification	CodeCode Available	1
FIMP: Foundation Model-Informed Message Passing for Graph Neural Networks	Oct 17, 2022	BenchmarkingGraph Neural Network	—Unverified	0
Conditional Neural Processes for Molecules	Oct 17, 2022	Bayesian OptimizationBenchmarking	—Unverified	0
Sub-8-bit quantization for on-device speech recognition: a regularization-free approach	Oct 17, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
KPI-EDGAR: A Novel Dataset and Accompanying Metric for Relation Extraction from Financial Documents	Oct 17, 2022	BenchmarkingJoint Entity and Relation Extraction	CodeCode Available	1
An Open-source Benchmark of Deep Learning Models for Audio-visual Apparent and Self-reported Personality Recognition	Oct 17, 2022	Benchmarking	CodeCode Available	1
DyFEn: Agent-Based Fee Setting in Payment Channel Networks	Oct 15, 2022	BenchmarkingDeep Reinforcement Learning	—Unverified	0
WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based Environments	Oct 14, 2022	Atari GamesBenchmarking	CodeCode Available	1
A Comprehensive Study on Large-Scale Graph Training: Benchmarking and Rethinking	Oct 14, 2022	BenchmarkingGPU	CodeCode Available	1
A Survey of Parameters Associated with the Quality of Benchmarks in NLP	Oct 14, 2022	Benchmarking	—Unverified	0
TweetNERD -- End to End Entity Linking Benchmark for Tweets	Oct 14, 2022	BenchmarkingEntity Linking	CodeCode Available	0
CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling	Oct 14, 2022	BenchmarkingLanguage Modeling	CodeCode Available	1
CORL: Research-oriented Deep Offline Reinforcement Learning Library	Oct 13, 2022	BenchmarkingD4RL	CodeCode Available	3

Show:10 25 50

← PrevPage 146 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified