Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1326–1350 of 5548 papers

Title	Date	Tasks	Status	Hype
D2S: Document-to-Slide Generation Via Query-Based Text Summarization	May 8, 2021	BenchmarkingLong Form Question Answering	CodeCode Available	1
Open Radar Initiative: Large Scale Dataset for Benchmarking of micro-Doppler Recognition Algorithms	May 7, 2021	Benchmarking	CodeCode Available	1
dEchorate: a Calibrated Room Impulse Response Database for Echo-aware Signal Processing	Apr 27, 2021	BenchmarkingRetrieval	CodeCode Available	1
2.5D Visual Relationship Detection	Apr 26, 2021	BenchmarkingDepth Estimation	CodeCode Available	1
Knodle: Modular Weakly Supervised Learning with PyTorch	Apr 23, 2021	BenchmarkingBIG-bench Machine Learning	CodeCode Available	1
Data Generating Process to Evaluate Causal Discovery Techniques for Time Series Data	Apr 16, 2021	BenchmarkingCausal Discovery	CodeCode Available	1
Towards Standardising Reinforcement Learning Approaches for Production Scheduling Problems	Apr 16, 2021	Benchmarkingreinforcement-learning	CodeCode Available	1
Is Multi-Hop Reasoning Really Explainable? Towards Benchmarking Reasoning Interpretability	Apr 14, 2021	BenchmarkingLink Prediction	CodeCode Available	1
Safety-enhanced UAV Path Planning with Spherical Vector-based Particle Swarm Optimization	Apr 13, 2021	BenchmarkingMetaheuristic Optimization	CodeCode Available	1
StylePTB: A Compositional Benchmark for Fine-grained Controllable Text Style Transfer	Apr 12, 2021	BenchmarkingSentence	CodeCode Available	1
Robust Semantic Interpretability: Revisiting Concept Activation Vectors	Apr 6, 2021	Benchmarkingcounterfactual	CodeCode Available	1
CBench: Towards Better Evaluation of Question Answering Over Knowledge Graphs	Apr 5, 2021	BenchmarkingKnowledge Graphs	CodeCode Available	1
Remote Sensing Image Classification with the SEN12MS Dataset	Apr 1, 2021	BenchmarkingClassification	CodeCode Available	1
Simultaneous Navigation and Construction Benchmarking Environments	Mar 31, 2021	BenchmarkingDeep Reinforcement Learning	CodeCode Available	1
Benchmarks for Deep Off-Policy Evaluation	Mar 30, 2021	Benchmarkingcontinuous-control	CodeCode Available	1
3D AffordanceNet: A Benchmark for Visual Object Affordance Understanding	Mar 30, 2021	Affordance DetectionBenchmarking	CodeCode Available	1
SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events	Mar 29, 2021	Autonomous VehiclesBenchmarking	CodeCode Available	1
Marine Snow Removal Benchmarking Dataset	Mar 26, 2021	BenchmarkingSand	CodeCode Available	1
Learning to Optimize: A Primer and A Benchmark	Mar 23, 2021	Benchmarking	CodeCode Available	1
Neural Multi-Hop Reasoning With Logical Rules on Biomedical Knowledge Graphs	Mar 18, 2021	BenchmarkingKnowledge Graphs	CodeCode Available	1
SHARP: Environment and Person Independent Activity Recognition with Commodity IEEE 802.11 Access Points	Mar 17, 2021	Activity RecognitionBenchmarking	CodeCode Available	1
A Large-Scale Dataset for Benchmarking Elevator Button Segmentation and Character Recognition	Mar 16, 2021	BenchmarkingPosition	CodeCode Available	1
The Effect of Domain and Diacritics in Yorùbá-English Neural Machine Translation	Mar 15, 2021	BenchmarkingDomain Adaptation	CodeCode Available	1
Recent Advances on Neural Network Pruning at Initialization	Mar 11, 2021	BenchmarkingNetwork Pruning	CodeCode Available	1
A Computed Tomography Vertebral Segmentation Dataset with Anatomical Variations and Multi-Vendor Scanner Data	Mar 10, 2021	AnatomyBenchmarking	CodeCode Available	1

Show:10 25 50

← PrevPage 54 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified