SOTAVerified

Benchmarking

Papers

Showing 30513100 of 5548 papers

TitleStatusHype
ImputeGAP: A Comprehensive Library for Time Series Imputation0
Benchmarking Table Comprehension In The Wild0
InAttention: Linear Context Scaling for Transformers0
Inaugural MOASEI Competition at AAMAS'2025: A Technical Report0
INCLUSIFY: A benchmark and a model for gender-inclusive German0
The Partial Response Network: a neural network nomogram0
IndicMMLU-Pro: Benchmarking Indic Large Language Models on Multi-Task Language Understanding0
IndicNLG Benchmark: Multilingual Datasets for Diverse NLG Tasks in Indic Languages0
IndicSTR12: A Dataset for Indic Scene Text Recognition0
Benchmarking Systematic Relational Reasoning with Large Language and Reasoning Models0
A framework for benchmarking uncertainty in deep regression0
Individual Treatment Effect Estimation Through Controlled Neural Network Training in Two Stages0
The Pitfalls of Benchmarking in Algorithm Selection: What We Are Getting Wrong0
IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP0
Benchmarking symbolic regression constant optimization schemes0
Benchmarking Surrogate-Assisted Genetic Recommender Systems0
Benchmarking Super-Resolution Algorithms on Real Data0
Influence-Optimistic Local Values for Multiagent Planning --- Extended Version0
InfoDeepSeek: Benchmarking Agentic Information Seeking for Retrieval-Augmented Generation0
Benchmarking Sub-Genre Classification For Mainstage Dance Music0
InfoSEM: A Deep Generative Model with Informative Priors for Gene Regulatory Network Inference0
InfraLib: Enabling Reinforcement Learning and Decision-Making for Large-Scale Infrastructure Management0
Benchmarking state-of-the-art gradient boosting algorithms for classification0
Benchmarking State-of-the-Art Deep Learning Software Tools0
Benchmarking Spiking Neural Network Learning Methods with Varying Locality0
Benchmarking sparse system identification with low-dimensional chaos0
InLUT3D: Challenging real indoor dataset for point cloud analysis0
A Framework for Benchmarking Real-Time Embedded Object Detection0
Benchmarking SMT Performance for Farsi Using the TEP++ Corpus0
Benchmarking Smoothness and Reducing High-Frequency Oscillations in Continuous Control Policies0
In-situ process monitoring and adaptive quality enhancement in laser additive manufacturing: a critical review0
Benchmarking Single-Image Reflection Removal Algorithms0
The Protein Engineering Tournament: An Open Science Benchmark for Protein Modeling and Design0
InstructEval: Systematic Evaluation of Instruction Selection Methods0
Benchmarking simulated and physical quantum processing units using quantum and hybrid algorithms0
Benchmarking Shadow Removal for Facial Landmark Detection and Beyond0
Benchmarking Sensitivity of Continual Graph Learning for Skeleton-Based Action Recognition0
Integrated Benchmarking and Design for Reproducible and Accessible Evaluation of Robotic Agents0
Integrated Sensing and Communication enabled Multiple Base Stations Cooperative UAV Detection0
Integrated Super-resolution Sensing and Symbiotic Communication with 3D Sparse MIMO for Low-Altitude UAV Swarm0
Integrating Dynamic Correlation Shifts and Weighted Benchmarking in Extreme Value Analysis0
Watchog: A Light-weight Contrastive Learning based Framework for Column Annotation0
Thermal Image-based Fault Diagnosis in Induction Machines via Self-Organized Operational Neural Networks0
Integration of Regularized l1 Tracking and Instance Segmentation for Video Object Tracking0
Intelligence at the Extreme Edge: A Survey on Reformable TinyML0
Intelligent Railway Foreign Object Detection: A Semi-supervised Convolutional Autoencoder Based Method0
A Large-Scale Analysis on Self-Supervised Video Representation Learning0
InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation0
Benchmarking Scientific Image Forgery Detectors0
Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam0
Show:102550
← PrevPage 62 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified