Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5501–5548 of 5548 papers

Title	Date	Tasks	Status	Hype
Using PCA to Efficiently Represent State Spaces	May 2, 2015	BenchmarkingDimensionality Reduction	—Unverified	0
Benchmarking SMT Performance for Farsi Using the TEP++ Corpus	May 1, 2015	BenchmarkingMachine Translation	—Unverified	0
A Collection of Challenging Optimization Problems in Science, Engineering and Economics	Apr 9, 2015	Benchmarking	—Unverified	0
Totally Corrective Boosting with Cardinality Penalization	Apr 7, 2015	BenchmarkingCombinatorial Optimization	—Unverified	0
Energy Management in Storage-Augmented, Grid-Connected Prosumer Buildings and Neighbourhoods Using a Modified Simulated Annealing Optimization	Mar 28, 2015	Benchmarkingenergy management	—Unverified	0
Benchmarking NLopt and state-of-art algorithms for Continuous Global Optimization via Hybrid IACO_R	Mar 11, 2015	Benchmarkingglobal-optimization	—Unverified	0
A Meta-Analysis of the Anomaly Detection Problem	Mar 3, 2015	Anomaly DetectionBenchmarking	CodeCode Available	0
Influence-Optimistic Local Values for Multiagent Planning --- Extended Version	Feb 18, 2015	BenchmarkingHeuristic Search	—Unverified	0
Fast, approximate kinetics of RNA folding	Jan 19, 2015	Benchmarking	—Unverified	0
A Dataset for Movie Description	Jan 12, 2015	BenchmarkingDescriptive	—Unverified	0
Salient Object Detection: A Benchmark	Jan 5, 2015	BenchmarkingObject	—Unverified	0
CIDEr: Consensus-based Image Description Evaluation	Nov 20, 2014	Action RecognitionAttribute	CodeCode Available	1
Enhanced Multiobjective Evolutionary Algorithm based on Decomposition for Solving the Unit Commitment Problem	Oct 16, 2014	Benchmarking	—Unverified	0
Introducing SLAMBench, a performance and accuracy benchmarking methodology for SLAM	Oct 8, 2014	Benchmarking	CodeCode Available	0
A Wild Bootstrap for Degenerate Kernel Tests	Aug 23, 2014	BenchmarkingTime Series	CodeCode Available	0
Designing labeled graph classifiers by exploiting the Rényi entropy of the dissimilarity representation	Aug 22, 2014	BenchmarkingClustering	—Unverified	0
Microtask crowdsourcing for disease mention annotation in PubMed abstracts	Aug 8, 2014	Benchmarking	—Unverified	0
The ACL RD-TEC: A Dataset for Benchmarking Terminology Extraction and Classification in Computational Linguistics	Aug 1, 2014	BenchmarkingGeneral Classification	—Unverified	0
Automated Machine Learning on Big Data using Stochastic Algorithm Tuning	Jul 30, 2014	Bayesian OptimisationBenchmarking	—Unverified	0
A CUDA-Based Real Parameter Optimization Benchmark	Jul 29, 2014	BenchmarkingCPU	—Unverified	0
Entropic one-class classifiers	Jul 28, 2014	Anomaly DetectionBenchmarking	—Unverified	0
Benchmarking Named Entity Disambiguation approaches for Streaming Graphs	Jul 14, 2014	BenchmarkingEntity Disambiguation	—Unverified	0
Projective simulation applied to the grid-world and the mountain-car problem	May 21, 2014	Benchmarkingreinforcement-learning	—Unverified	0
Benchmarking the Extraction and Disambiguation of Named Entities on the Semantic Web	May 1, 2014	BenchmarkingEntity Linking	—Unverified	0
Overview of Todai Robot Project and Evaluation Framework of its NLP-based Problem Solving	May 1, 2014	Benchmarking	—Unverified	0
Discosuite - A parser test suite for German discontinuous structures	May 1, 2014	BenchmarkingConstituency Parsing	—Unverified	0
Benchmarking Twitter Sentiment Analysis Tools	May 1, 2014	BenchmarkingDecision Making	—Unverified	0
Benchmarking of English-Hindi parallel corpora	May 1, 2014	BenchmarkingMachine Translation	—Unverified	0
Household Electricity Demand Forecasting -- Benchmarking State-of-the-Art Methods	Apr 1, 2014	BenchmarkingDemand Forecasting	—Unverified	0
MCL-3D: a database for stereoscopic image quality assessment using 2D-image-plus-depth source	Mar 23, 2014	BenchmarkingImage Quality Assessment	—Unverified	0
Fast and accurate alignment of long bisulfite-seq reads	Jan 6, 2014	Benchmarking	CodeCode Available	0
Solver Scheduling via Answer Set Programming	Jan 6, 2014	BenchmarkingScheduling	—Unverified	0
Hyperopt-Sklearn: Automatic Hyperparameter Configuration for Scikit-Learn	Jan 1, 2014	AutoMLBenchmarking	CodeCode Available	0
Sockpuppet Detection in Wikipedia: A Corpus of Real-World Deceptive Writing for Linking Identities	Oct 24, 2013	Benchmarking	—Unverified	0
Discriminative Link Prediction using Local Links, Node Features and Community Structure	Oct 17, 2013	BenchmarkingClustering	—Unverified	0
Joint multi-person detection and tracking from overlapping cameras	Jun 23, 2013	BenchmarkingHuman Detection	—Unverified	0
Hollywood 3D: Recognizing Actions in 3D Natural Scenes	Jun 1, 2013	Action RecognitionBenchmarking	—Unverified	0
A Lazy Man's Approach to Benchmarking: Semisupervised Classifier Evaluation and Recalibration	Jun 1, 2013	Benchmarking	—Unverified	0
Boundary Detection Benchmarking: Beyond F-Measures	Jun 1, 2013	BenchmarkingBoundary Detection	—Unverified	0
The Expressive Power of Word Embeddings	Jan 15, 2013	BenchmarkingSentence	—Unverified	0
The Arcade Learning Environment: An Evaluation Platform for General Agents	Jul 19, 2012	Atari GamesBenchmarking	CodeCode Available	0
Introducing a new benchmarked dataset for activity monitoring	Jun 18, 2012	BenchmarkingClassification	—Unverified	0
Parsing Any Domain English text to CoNLL dependencies	May 1, 2012	BenchmarkingDependency Parsing	—Unverified	0
Creating a Data Collection for Evaluating Rich Speech Retrieval	May 1, 2012	BenchmarkingRetrieval	—Unverified	0
Fast Labeling and Transcription with the Speechalyzer Toolkit	May 1, 2012	Audio ClassificationBenchmarking	—Unverified	0
Feature Selection and Classification of Hyperspectral Images With Support Vector Machines	Oct 15, 2007	BenchmarkingClassification	—Unverified	0
The DLV System for Knowledge Representation and Reasoning	Nov 4, 2002	Benchmarking	—Unverified	0
Building a Scalable and Interpretable Bayesian Deep Learning Framework for Quality Control of Free Form Surfaces	Apr 7, 1994	Active LearningBenchmarking	CodeCode Available	1

Show:10 25 50

← PrevPage 111 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified