SOTAVerified

Benchmarking

Papers

Showing 38013825 of 5548 papers

TitleStatusHype
Towards Universal Learning-based Model for Cardiac Image Reconstruction: Summary of the CMRxRecon2024 Challenge0
Towards Visual Text Grounding of Multimodal Large Language Model0
Near-Term Quantum Computing Techniques: Variational Quantum Algorithms, Error Mitigation, Circuit Compilation, Benchmarking and Classical Simulation0
Benchmarking deep generative models for diverse antibody sequence design0
Benchmarking Deep Facial Expression Recognition: An Extensive Protocol with Balanced Dataset in the Wild0
Towards Zero-Shot Differential Morphing Attack Detection with Multimodal Large Language Models0
NeIn: Telling What You Don't Want0
Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks0
TP-RAG: Benchmarking Retrieval-Augmented Large Language Model Agents for Spatiotemporal-Aware Travel Planning0
Benchmarking Deep AUROC Optimization: Loss Functions and Algorithmic Choices0
Benchmarking Deepart Detection0
Benchmarking Decoupled Neural Interfaces with Synthetic Gradients0
NerfBaselines: Consistent and Reproducible Evaluation of Novel View Synthesis Methods0
Adaptive Experimentation at Scale: A Computational Framework for Flexible Batches0
Benchmarking data encoding methods in Quantum Machine Learning0
Adaptive Epidemic Forecasting and Community Risk Evaluation of COVID-190
Hyperparameter optimization with REINFORCE and Transformers0
Neural feels with neural fields: Visuo-tactile perception for in-hand manipulation0
Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models0
Benchmarking Data-driven Automatic Text Simplification for German0
Neural Network Approach for Non-Markovian Dissipative Dynamics of Many-Body Open Quantum Systems0
Tracking Everything in Robotic-Assisted Surgery0
GIM: Gaussian Isolation Machines0
Neural Networks for Fast Optimisation in Model Predictive Control: A Review0
Benchmarking Cross-Domain Audio-Visual Deception Detection0
Show:102550
← PrevPage 153 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified