SOTAVerified

Benchmarking

Papers

Showing 55015548 of 5548 papers

TitleStatusHype
Using PCA to Efficiently Represent State Spaces0
Benchmarking SMT Performance for Farsi Using the TEP++ Corpus0
A Collection of Challenging Optimization Problems in Science, Engineering and Economics0
Totally Corrective Boosting with Cardinality Penalization0
Energy Management in Storage-Augmented, Grid-Connected Prosumer Buildings and Neighbourhoods Using a Modified Simulated Annealing Optimization0
Benchmarking NLopt and state-of-art algorithms for Continuous Global Optimization via Hybrid IACO_R0
A Meta-Analysis of the Anomaly Detection ProblemCode0
Influence-Optimistic Local Values for Multiagent Planning --- Extended Version0
Fast, approximate kinetics of RNA folding0
A Dataset for Movie Description0
Salient Object Detection: A Benchmark0
CIDEr: Consensus-based Image Description EvaluationCode1
Enhanced Multiobjective Evolutionary Algorithm based on Decomposition for Solving the Unit Commitment Problem0
Introducing SLAMBench, a performance and accuracy benchmarking methodology for SLAMCode0
A Wild Bootstrap for Degenerate Kernel TestsCode0
Designing labeled graph classifiers by exploiting the Rényi entropy of the dissimilarity representation0
Microtask crowdsourcing for disease mention annotation in PubMed abstracts0
The ACL RD-TEC: A Dataset for Benchmarking Terminology Extraction and Classification in Computational Linguistics0
Automated Machine Learning on Big Data using Stochastic Algorithm Tuning0
A CUDA-Based Real Parameter Optimization Benchmark0
Entropic one-class classifiers0
Benchmarking Named Entity Disambiguation approaches for Streaming Graphs0
Projective simulation applied to the grid-world and the mountain-car problem0
Benchmarking the Extraction and Disambiguation of Named Entities on the Semantic Web0
Overview of Todai Robot Project and Evaluation Framework of its NLP-based Problem Solving0
Discosuite - A parser test suite for German discontinuous structures0
Benchmarking Twitter Sentiment Analysis Tools0
Benchmarking of English-Hindi parallel corpora0
Household Electricity Demand Forecasting -- Benchmarking State-of-the-Art Methods0
MCL-3D: a database for stereoscopic image quality assessment using 2D-image-plus-depth source0
Fast and accurate alignment of long bisulfite-seq readsCode0
Solver Scheduling via Answer Set Programming0
Hyperopt-Sklearn: Automatic Hyperparameter Configuration for Scikit-LearnCode0
Sockpuppet Detection in Wikipedia: A Corpus of Real-World Deceptive Writing for Linking Identities0
Discriminative Link Prediction using Local Links, Node Features and Community Structure0
Joint multi-person detection and tracking from overlapping cameras0
Hollywood 3D: Recognizing Actions in 3D Natural Scenes0
A Lazy Man's Approach to Benchmarking: Semisupervised Classifier Evaluation and Recalibration0
Boundary Detection Benchmarking: Beyond F-Measures0
The Expressive Power of Word Embeddings0
The Arcade Learning Environment: An Evaluation Platform for General AgentsCode0
Introducing a new benchmarked dataset for activity monitoring0
Parsing Any Domain English text to CoNLL dependencies0
Creating a Data Collection for Evaluating Rich Speech Retrieval0
Fast Labeling and Transcription with the Speechalyzer Toolkit0
Feature Selection and Classification of Hyperspectral Images With Support Vector Machines0
The DLV System for Knowledge Representation and Reasoning0
Building a Scalable and Interpretable Bayesian Deep Learning Framework for Quality Control of Free Form SurfacesCode1
Show:102550
← PrevPage 111 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified