SOTAVerified

Benchmarking

Papers

Showing 32763300 of 5548 papers

TitleStatusHype
Large-scale Benchmarking of Metaphor-based Optimization Heuristics0
Benchmarking off-the-shelf statistical shape modeling tools in clinical applications0
Benchmarking Off-The-Shelf Solutions to Robotic Assembly Tasks0
Large-Scale Quantum Separability Through a Reproducible Machine Learning Lens0
Timing Excess Returns A cross-universe approach to alpha0
Latency-aware Road Anomaly Segmentation in Videos: A Photorealistic Dataset and New Metrics0
Benchmarking Offline Reinforcement Learning Algorithms for E-Commerce Order Fraud Evaluation0
Latent Variable Models for Visual Question Answering0
TinyML Platforms Benchmarking0
LAVIS: A Library for Language-Vision Intelligence0
Benchmarking of English-Hindi parallel corpora0
Benchmarking of eight recurrent neural network variants for breath phase and adventitious sound detection on a self-developed open-access lung sound database-HF_Lung_V10
LayoutXLM vs. GNN: An Empirical Evaluation of Relation Extraction for Documents0
Benchmarking of Different YOLO Models for CAPTCHAs Detection and Classification0
LCFO: Long Context and Long Form Output Dataset and Benchmarking0
Benchmarking of Deep Learning models on 2D Laminar Flow behind Cylinder0
LEAF: A Benchmark for Federated Settings0
Leaf Segmentation and Counting with Deep Learning: on Model Certainty, Test-Time Augmentation, Trade-Offs0
Labelling Vertebrae with 2D Reformations of Multidetector CT Images: An Adversarial Approach for Incorporating Prior Knowledge of Spine Anatomy0
Adversarial Learning for Supervised and Semi-supervised Relation Extraction in Biomedical Literature0
Title2Event: Benchmarking Open Event Extraction with a Large-scale Chinese Title Dataset0
TituLLMs: A Family of Bangla LLMs with Comprehensive Benchmarking0
Primender Sequence: A Novel Mathematical Construct for Testing Symbolic Inference and AI Reasoning0
Learning a CNN-based End-to-End Controller for a Formula SAE Racecar0
tmVar 3.0: an improved variant concept recognition and normalization tool0
Show:102550
← PrevPage 132 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified