SOTAVerified

Benchmarking

Papers

Showing 44014425 of 5548 papers

TitleStatusHype
LAVIS: A Library for Language-Vision Intelligence0
LayoutXLM vs. GNN: An Empirical Evaluation of Relation Extraction for Documents0
LCFO: Long Context and Long Form Output Dataset and Benchmarking0
LEAF: A Benchmark for Federated Settings0
Leaf Segmentation and Counting with Deep Learning: on Model Certainty, Test-Time Augmentation, Trade-Offs0
Learning a CNN-based End-to-End Controller for a Formula SAE Racecar0
Learning a quantum computer's capability0
Learning a Representation with the Block-Diagonal Structure for Pattern Classification0
Learning a Saliency Evaluation Metric Using Crowdsourced Perceptual Judgments0
Learning Best Paths in Quantum Networks0
Learning Disentangled Audio Representations through Controlled Synthesis0
Learning Disentangled Speech Representations0
LABCAT: Locally adaptive Bayesian optimization using principal-component-aligned trust regionsCode0
SCoRE: Benchmarking Long-Chain Reasoning in Commonsense ScenariosCode0
Benchmark data and method for real-time people counting in cluttered scenes using depth sensorsCode0
Reassessing Layer Pruning in LLMs: New Insights and MethodsCode0
LaCViT: A Label-aware Contrastive Fine-tuning Framework for Vision TransformersCode0
Re-Benchmarking Pool-Based Active Learning for Binary ClassificationCode0
Knowledge Enhanced Conditional Imputation for Healthcare Time-seriesCode0
Selecting the motion ground truth for loose-fitting wearables: benchmarking optical MoCap methodsCode0
Knowledge-Driven Slot Constraints for Goal-Oriented Dialogue SystemsCode0
CEBench: A Benchmarking Toolkit for the Cost-Effectiveness of LLM PipelinesCode0
Causality-enhanced Decision-Making for Autonomous Mobile Robots in Dynamic EnvironmentsCode0
Capsule Vision 2024 Challenge: Multi-Class Abnormality Classification for Video Capsule EndoscopyCode0
Language-based Image Colorization: A Benchmark and BeyondCode0
Show:102550
← PrevPage 177 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified