SOTAVerified

Benchmarking

Papers

Showing 54015450 of 5548 papers

TitleStatusHype
Deep learning for extracting protein-protein interactions from biomedical literature0
CRNN: A Joint Neural Network for Redundancy DetectionCode0
Discovering Visual Concept Structure with Sparse and Incomplete Tags0
Classification and Retrieval of Digital Pathology Scans: A New Dataset0
Learning to Mix n-Step Returns: Generalizing lambda-Returns for Deep Reinforcement Learning0
WebVision Challenge: Visual Learning and Understanding With Web Data0
Saliency Benchmarking Made Easy: Separating Models, Maps and Metrics0
Reconstructing antibody repertoires from error-prone immunosequencing datasets0
Computer Vision for Autonomous Vehicles: Problems, Datasets and State of the Art0
LibOPT: An Open-Source Platform for Fast Prototyping Soft Optimization TechniquesCode0
Embodied Artificial Intelligence through Distributed Adaptive Control: An Integrated Framework0
A Comparison of Directional Distances for Hand Pose Estimation0
Benchmarking Joint Lexical and Syntactic Analysis on Multiword-Rich Data0
A Characterization Study of Arabic Twitter Data with a Benchmarking for State-of-the-Art Opinion Mining Models0
A Parallel Corpus for Evaluating Machine Translation between Arabic and European Languages0
Efficient Benchmarking of NLP APIs using Multi-armed Bandits0
Configurable 3D Scene Synthesis and 2D Image Rendering with Per-Pixel Ground Truth using Stochastic Grammars0
Efficient Benchmarking of Algorithm Configuration Procedures via Model-Based Surrogates0
Semi and Weakly Supervised Semantic Segmentation Using Generative Adversarial Network0
Efficient Processing of Deep Neural Networks: A Tutorial and Survey0
Multitask learning and benchmarking with clinical time series dataCode1
Computer Aided Detection of Anemia-like Pallor0
A New Evaluation Protocol and Benchmarking Results for Extendable Cross-media Retrieval0
Meet Spinky: An Open-Source Spindle and K-Complex Detection Toolbox Validated on the Open-Access Montreal Archive of Sleep Studies (MASS).Code0
PMLB: A Large Benchmark Suite for Machine Learning Evaluation and ComparisonCode0
A Dataset for Developing and Benchmarking Active Vision0
Support Vector Machines and generalisation in HEP0
FERA 2017 - Addressing Head Pose in the Third Facial Expression Recognition and Analysis Challenge0
MORSE: Semantic-ally Drive-n MORpheme SEgment-er0
The biglasso Package: A Memory- and Computation-Efficient Solver for Lasso Model Fitting with Big Data in RCode0
Deep Learning Logo Detection with Data Expansion by Synthesising Context0
Jointly learning heterogeneous features for rgb-d activity recognition0
Multiple Instance Learning: A Survey of Problem Characteristics and ApplicationsCode0
pke: an open source python-based keyphrase extraction toolkitCode0
MS MARCO: A Human Generated MAchine Reading COmprehension DatasetCode1
Person Re-Identification by Unsupervised Video Matching0
'Part'ly first among equals: Semantic part-based benchmarking for state-of-the-art object recognition systems0
CMOS based image cytometry for detection of phytoplankton in ballast water0
The Freiburg Groceries DatasetCode0
Benchmarking inverse statistical approaches for protein structure and design with exactly solvable models0
Benchmarking Quantum Hardware for Training of Fully Visible Boltzmann Machines0
XCSP3: An Integrated Format for Benchmarking Combinatorial Constrained Problems0
A Comparison of Word Embeddings for English and Cross-Lingual Chinese Word Sense Disambiguation0
A Benchmark Dataset and Saliency-guided Stacked Autoencoders for Video-based Salient Object Detection0
Word Embeddings for the Construction DomainCode0
Portfolio Benchmarking under Drawdown Constraint and Stochastic Sharpe Ratio0
Term-Class-Max-Support (TCMS): A Simple Text Document Categorization Approach Using Term-Class Relevance Measure0
There's No Comparison: Reference-less Evaluation Metrics in Grammatical Error CorrectionCode0
Technical Report on the CleverHans v2.1.0 Adversarial Examples LibraryCode0
Estimating transmission from genetic and epidemiological data: a metric to compare transmission trees0
Show:102550
← PrevPage 109 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified