Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3601–3650 of 5548 papers

Title	Date	Tasks	Status	Hype
A Survey on Preserving Fairness Guarantees in Changing Environments	Nov 14, 2022	BenchmarkingDecision Making	—Unverified	0
Self-Aligning Depth-regularized Radiance Fields for Asynchronous RGB-D Sequences	Nov 14, 2022	Autonomous DrivingBenchmarking	—Unverified	0
A Benchmark for Out of Distribution Detection in Point Cloud 3D Semantic Segmentation	Nov 11, 2022	3D Semantic SegmentationAutonomous Driving	—Unverified	0
A Benchmarking Dataset with 2440 Organic Molecules for Volume Distribution at Steady State	Nov 10, 2022	Benchmarkingfeature selection	CodeCode Available	0
EvEntS ReaLM: Event Reasoning of Entity States via Language Models	Nov 10, 2022	Benchmarking	—Unverified	0
Hyperparameter optimization in deep multi-target prediction	Nov 8, 2022	AutoMLBenchmarking	CodeCode Available	1
Tell Your Story: Task-Oriented Dialogs for Interactive Content Creation	Nov 8, 2022	BenchmarkingRetrieval	—Unverified	0
Okapi: Generalising Better by Making Statistical Matches Match	Nov 7, 2022	BenchmarkingBinary Classification	CodeCode Available	0
Common Pets in 3D: Dynamic New-View Synthesis of Real-Life Deformable Categories	Nov 7, 2022	3D Reconstruction4D reconstruction	—Unverified	0
Improved Target-specific Stance Detection on Social Media Platforms by Delving into Conversation Threads	Nov 6, 2022	BenchmarkingOpinion Mining	CodeCode Available	0
The Legal Argument Reasoning Task in Civil Procedure	Nov 5, 2022	Benchmarking	CodeCode Available	0
EventEA: Benchmarking Entity Alignment for Event-centric Knowledge Graphs	Nov 5, 2022	AttributeBenchmarking	CodeCode Available	1
An approach for benchmarking the numerical solutions of stochastic compartmental models	Nov 4, 2022	Benchmarking	—Unverified	0
Benchmarking Quality-Diversity Algorithms on Neuroevolution for Reinforcement Learning	Nov 4, 2022	BenchmarkingDiversity	—Unverified	0
Quantum Similarity Testing with Convolutional Neural Networks	Nov 3, 2022	Benchmarking	—Unverified	0
Title2Event: Benchmarking Open Event Extraction with a Large-scale Chinese Title Dataset	Nov 2, 2022	BenchmarkingEvent Extraction	—Unverified	0
Signing Outside the Studio: Benchmarking Background Robustness for Continuous Sign Language Recognition	Nov 1, 2022	BenchmarkingDisentanglement	CodeCode Available	0
SOLAR: A Highly Optimized Data Loading Framework for Distributed Training of CNN-based Scientific Surrogates	Nov 1, 2022	Benchmarking	—Unverified	0
Classical ensemble of Quantum-classical ML algorithms for Phishing detection in Ethereum transaction networks	Oct 30, 2022	Anomaly DetectionBenchmarking	CodeCode Available	0
Benchmarking Adversarial Patch Against Aerial Detection	Oct 30, 2022	Benchmarking	CodeCode Available	1
Benchmarking performance of object detection under image distortions in an uncontrolled environment	Oct 28, 2022	BenchmarkingObject	CodeCode Available	0
Benchmarking Language Models for Code Syntax Understanding	Oct 26, 2022	Benchmarking	CodeCode Available	1
What's Different between Visual Question Answering for Machine "Understanding" Versus for Accessibility?	Oct 26, 2022	BenchmarkingQuestion Answering	CodeCode Available	0
pmuBAGE: The Benchmarking Assortment of Generated PMU Data for Power System Events	Oct 25, 2022	Benchmarking	CodeCode Available	0
CrisisLTLSum: A Benchmark for Local Crisis Event Timeline Extraction and Summarization	Oct 25, 2022	Abstractive Text SummarizationBenchmarking	CodeCode Available	0
A Comparative Attention Framework for Better Few-Shot Object Detection on Aerial Images	Oct 25, 2022	BenchmarkingFew-Shot Object Detection	CodeCode Available	1
Deep Crowd Anomaly Detection: State-of-the-Art, Challenges, and Future Research Directions	Oct 25, 2022	Anomaly DetectionBenchmarking	—Unverified	0
What cleaves? Is proteasomal cleavage prediction reaching a ceiling?	Oct 24, 2022	BenchmarkingDenoising	—Unverified	0
ESB: A Benchmark For Multi-Domain End-to-End Speech Recognition	Oct 24, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1
SpikeSim: An end-to-end Compute-in-Memory Hardware Evaluation Tool for Benchmarking Spiking Neural Networks	Oct 24, 2022	Benchmarking	CodeCode Available	1
Benchmarking GPU and TPU Performance with Graph Neural Networks	Oct 21, 2022	BenchmarkingGPU	—Unverified	0
Multi-scale data reconstruction of turbulent rotating flows with Gappy POD, Extended POD and Generative Adversarial Networks	Oct 21, 2022	BenchmarkingGenerative Adversarial Network	—Unverified	0
A Survey on Graph Counterfactual Explanations: Definitions, Methods, Evaluation, and Research Challenges	Oct 21, 2022	BenchmarkingCommunity Detection	CodeCode Available	1
gSuite: A Flexible and Framework Independent Benchmark Suite for Graph Neural Network Inference on GPUs	Oct 20, 2022	BenchmarkingComputational Efficiency	—Unverified	0
RMBench: Benchmarking Deep Reinforcement Learning for Robotic Manipulator Control	Oct 20, 2022	BenchmarkingData Augmentation	CodeCode Available	1
LaMAR: Benchmarking Localization and Mapping for Augmented Reality	Oct 19, 2022	BenchmarkingDiversity	CodeCode Available	2
Graphs, Constraints, and Search for the Abstraction and Reasoning Corpus	Oct 18, 2022	ARCBenchmarking	CodeCode Available	1
iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations	Oct 17, 2022	BenchmarkingText Classification	CodeCode Available	1
FIMP: Foundation Model-Informed Message Passing for Graph Neural Networks	Oct 17, 2022	BenchmarkingGraph Neural Network	—Unverified	0
Conditional Neural Processes for Molecules	Oct 17, 2022	Bayesian OptimizationBenchmarking	—Unverified	0
Sub-8-bit quantization for on-device speech recognition: a regularization-free approach	Oct 17, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
KPI-EDGAR: A Novel Dataset and Accompanying Metric for Relation Extraction from Financial Documents	Oct 17, 2022	BenchmarkingJoint Entity and Relation Extraction	CodeCode Available	1
An Open-source Benchmark of Deep Learning Models for Audio-visual Apparent and Self-reported Personality Recognition	Oct 17, 2022	Benchmarking	CodeCode Available	1
DyFEn: Agent-Based Fee Setting in Payment Channel Networks	Oct 15, 2022	BenchmarkingDeep Reinforcement Learning	—Unverified	0
WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based Environments	Oct 14, 2022	Atari GamesBenchmarking	CodeCode Available	1
A Comprehensive Study on Large-Scale Graph Training: Benchmarking and Rethinking	Oct 14, 2022	BenchmarkingGPU	CodeCode Available	1
A Survey of Parameters Associated with the Quality of Benchmarks in NLP	Oct 14, 2022	Benchmarking	—Unverified	0
TweetNERD -- End to End Entity Linking Benchmark for Tweets	Oct 14, 2022	BenchmarkingEntity Linking	CodeCode Available	0
CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling	Oct 14, 2022	BenchmarkingLanguage Modeling	CodeCode Available	1
CORL: Research-oriented Deep Offline Reinforcement Learning Library	Oct 13, 2022	BenchmarkingD4RL	CodeCode Available	3

Show:10 25 50

← PrevPage 73 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified