SOTAVerified

Benchmarking

Papers

Showing 34013425 of 5548 papers

TitleStatusHype
Benchmarking Monocular 3D Dog Pose Estimation Using In-The-Wild Motion Capture Data0
TOTOPO: Classifying univariate and multivariate time series with Topological Data Analysis0
LMFormer: Lane based Motion Prediction Transformer0
Benchmarking Modern Named Entity Recognition Techniques for Free-text Health Record De-identification0
LMME3DHF: Benchmarking and Evaluating Multimodal 3D Human Face Generation with LMMs0
LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models0
Load-independent Metrics for Benchmarking Force Controllers0
Benchmarking Mobile Device Control Agents across Diverse Configurations0
Local Data Quantity-Aware Weighted Averaging for Federated Learning with Dishonest Clients0
XLD: A Cross-Lane Dataset for Benchmarking Novel Driving View Synthesis0
Ensuring Reliability of Curated EHR-Derived Data: The Validation of Accuracy for LLM/ML-Extracted Information and Data (VALID) Framework0
Benchmarking Middle-Trained Language Models for Neural Search0
Logically at Factify 2: A Multi-Modal Fact Checking System Based on Evidence Retrieval techniques and Transformer Encoder Architecture0
Logically at Factify 2022: Multimodal Fact Verification0
Toward an ImageNet Library of Functions for Global Optimization Benchmarking0
Benchmarking Meta-heuristic Optimization0
Brittle Minds, Fixable Activations: Understanding Belief Representations in Language Models0
Toward end-to-end interpretable convolutional neural networks for waveform signals0
Benchmarking MedMNIST dataset on real quantum hardware0
Benchmarking Machine Translated Sentiment Analysis for Arabic Tweets0
Benchmarking Continuous Time Models for Predicting Multiple Sclerosis Progression0
Benchmarking Machine Learning Robustness in Covid-19 Spike Sequence Classification0
Benchmarking Machine Learning Models to Predict Corporate Bankruptcy0
LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation0
Long Range Arena : A Benchmark for Efficient Transformers0
Show:102550
← PrevPage 137 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified