Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5301–5325 of 5548 papers

Title	Date	Tasks	Status
Tackling the Story Ending Biases in The Story Cloze Test	Jul 1, 2018	BenchmarkingCloze Test	—Unverified
NEWS 2018 Whitepaper	Jul 1, 2018	BenchmarkingMachine Translation	—Unverified
Benchmarking the Hill-Valley Evolutionary Algorithm for the GECCO 2018 Competition on Niching Methods Multimodal Optimization	Jun 30, 2018	Benchmarking	CodeCode Available
Hyperspectral Image Dataset for Benchmarking on Salient Object Detection	Jun 29, 2018	BenchmarkingObject	CodeCode Available
Learning a Saliency Evaluation Metric Using Crowdsourced Perceptual Judgments	Jun 27, 2018	Benchmarking	—Unverified
Person Re-Identification in Identity Regression Space	Jun 25, 2018	BenchmarkingIncremental Learning	—Unverified
End-to-End Neural Ranking for eCommerce Product Search: an application of task models and textual embeddings	Jun 19, 2018	Benchmarking	—Unverified
The Neural Painter: Multi-Turn Image Generation	Jun 16, 2018	BenchmarkingConditional Image Generation	—Unverified
Real-time cryo-EM data pre-processing with Warp	Jun 14, 2018	BenchmarkingImage Reconstruction	CodeCode Available
Benchmarking Evolutionary Algorithms For Single Objective Real-valued Constrained Optimization - A Critical Review	Jun 12, 2018	BenchmarkingEvolutionary Algorithms	—Unverified
Deep Reinforcement Learning for General Video Game AI	Jun 6, 2018	Atari GamesBenchmarking	CodeCode Available
Analysis of DAWNBench, a Time-to-Accuracy Machine Learning Performance Benchmark	Jun 4, 2018	BenchmarkingBIG-bench Machine Learning	—Unverified
Adversarial Reinforcement Learning Framework for Benchmarking Collision Avoidance Mechanisms in Autonomous Vehicles	Jun 4, 2018	Autonomous NavigationAutonomous Vehicles	—Unverified
A Dataset for Web-Scale Knowledge Base Population	Jun 3, 2018	BenchmarkingKnowledge Base Population	CodeCode Available
CVM-Net: Cross-View Matching Network for Image-Based Ground-to-Aerial Geo-Localization	Jun 1, 2018	Benchmarkinggeo-localization	CodeCode Available
A Report on the 2018 VUA Metaphor Detection Shared Task	Jun 1, 2018	Benchmarking	—Unverified
Syntactically Aware Neural Architectures for Definition Extraction	Jun 1, 2018	BenchmarkingBinary Classification	—Unverified
NengoDL: Combining deep learning and neuromorphic modelling methods	May 28, 2018	BenchmarkingDeep Learning	CodeCode Available
Quantum classification of the MNIST dataset with Slow Feature Analysis	May 22, 2018	BenchmarkingClassification	—Unverified
Simulation of Large Scale Neural Networks for Evaluation Applications	May 20, 2018	Benchmarking	—Unverified
The EuroCity Persons Dataset: A Novel Benchmark for Object Detection	May 18, 2018	BenchmarkingObject	—Unverified
Deep Nets: What have they ever done for Vision?	May 10, 2018	Benchmarking	—Unverified
Comparative evaluation of instrument segmentation and tracking methods in minimally invasive surgery	May 7, 2018	BenchmarkingSegmentation	—Unverified
Resource Interoperability for Sustainable Benchmarking: The Case of Events	May 1, 2018	Benchmarking	CodeCode Available
Le benchmarking de la reconnaissance d'entit\'es nomm\'ees pour le fran (Benchmarking for French NER)	May 1, 2018	BenchmarkingNER	—Unverified

Show:10 25 50

← PrevPage 213 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified