SOTAVerified

Benchmarking

Papers

Showing 33013325 of 5548 papers

TitleStatusHype
LOOPE: Learnable Optimal Patch Order in Positional Embeddings for Vision Transformers0
Optimizing with Low Budgets: a Comparison on the Black-box Optimization Benchmarking Suite and OpenAI Gym0
Low-Density 3D Point Cloud Classification0
Low Dynamic Range for RIS-aided Bistatic Integrated Sensing and Communication0
Low-resource Neural Machine Translation: Benchmarking State-of-the-art Transformer for Wolof<->French0
LSTM-based Whisper Detection0
LucidDreaming: Controllable Object-Centric 3D Generation0
LUND-PROBE -- LUND Prostate Radiotherapy Open Benchmarking and Evaluation dataset0
M3Bench: Benchmarking Whole-body Motion Generation for Mobile Manipulation in 3D Scenes0
MA-BBOB: A Problem Generator for Black-Box Optimization Using Affine Combinations and Shifts0
MA-BBOB: Many-Affine Combinations of BBOB Functions for Evaluating AutoML Approaches in Noiseless Numerical Black-Box Optimization Contexts0
Machine Generated Product Advertisements: Benchmarking LLMs Against Human Performance0
Machine Learning-Based Analysis of ECG and PCG Signals for Rheumatic Heart Disease Detection: A Scoping Review (2015-2025)0
Machine Learning for Identifying Grain Boundaries in Scanning Electron Microscopy (SEM) Images of Nanoparticle Superlattices0
Machine learning for modelling unstructured grid data in computational physics: a review0
Machine Learning for Ranking f-wave Extraction Methods in Single-Lead ECGs0
Uncertainty estimation of machine learning spatial precipitation predictions from satellite data0
Machine Vision based Sample-Tube Localization for Mars Sample Return0
Making Sense of Data in the Wild: Data Analysis Automation at Scale0
OrionBench: Benchmarking Time Series Generative Models in the Service of the End-User0
ManipBench: Benchmarking Vision-Language Models for Low-Level Robot Manipulation0
MANTA: A Large-Scale Multi-View and Visual-Text Anomaly Detection Dataset for Tiny Objects0
Manual Verbalizer Enrichment for Few-Shot Text Classification0
Mapping global dynamics of benchmark creation and saturation in artificial intelligence0
Mapping Violence: Developing an Extensive Framework to Build a Bangla Sectarian Expression Dataset from Social Media Interactions0
Show:102550
← PrevPage 133 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified