SOTAVerified

Benchmarking

Papers

Showing 38013850 of 5548 papers

TitleStatusHype
Towards Universal Learning-based Model for Cardiac Image Reconstruction: Summary of the CMRxRecon2024 Challenge0
Towards Visual Text Grounding of Multimodal Large Language Model0
Near-Term Quantum Computing Techniques: Variational Quantum Algorithms, Error Mitigation, Circuit Compilation, Benchmarking and Classical Simulation0
Benchmarking deep generative models for diverse antibody sequence design0
Benchmarking Deep Facial Expression Recognition: An Extensive Protocol with Balanced Dataset in the Wild0
Towards Zero-Shot Differential Morphing Attack Detection with Multimodal Large Language Models0
NeIn: Telling What You Don't Want0
Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks0
TP-RAG: Benchmarking Retrieval-Augmented Large Language Model Agents for Spatiotemporal-Aware Travel Planning0
Benchmarking Deep AUROC Optimization: Loss Functions and Algorithmic Choices0
Benchmarking Deepart Detection0
Benchmarking Decoupled Neural Interfaces with Synthetic Gradients0
NerfBaselines: Consistent and Reproducible Evaluation of Novel View Synthesis Methods0
Adaptive Experimentation at Scale: A Computational Framework for Flexible Batches0
Benchmarking data encoding methods in Quantum Machine Learning0
Adaptive Epidemic Forecasting and Community Risk Evaluation of COVID-190
Hyperparameter optimization with REINFORCE and Transformers0
Neural feels with neural fields: Visuo-tactile perception for in-hand manipulation0
Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models0
Benchmarking Data-driven Automatic Text Simplification for German0
Neural Network Approach for Non-Markovian Dissipative Dynamics of Many-Body Open Quantum Systems0
Tracking Everything in Robotic-Assisted Surgery0
GIM: Gaussian Isolation Machines0
Neural Networks for Fast Optimisation in Model Predictive Control: A Review0
Benchmarking Cross-Domain Audio-Visual Deception Detection0
Benchmarking Counterfactual Interpretability in Deep Learning Models for Time Series Classification0
Neural Text Generation: Past, Present and Beyond0
Benchmarking Convolutional Neural Network and Graph Neural Network based Surrogate Models on a Real-World Car External Aerodynamics Dataset0
Benchmarking Conventional Vision Models on Neuromorphic Fall Detection and Action Recognition Dataset0
Benchmarking Conventional and Learned Video Codecs with a Low-Delay Configuration0
Adaptive Deep Kernel Learning0
Neuromorphic Vision-based Motion Segmentation with Graph Transformer Neural Network0
Towards Self-adaptive Mutation in Evolutionary Multi-Objective Algorithms0
Adaptive Control of an Inverted Pendulum by a Reinforcement Learning-based LQR Method0
Benchmarking Continual Learning from Cognitive Perspectives0
Training Mixed-Domain Translation Models via Federated Learning0
New Loss Functions for Fast Maximum Inner Product Search0
NEWS 2018 Whitepaper0
Benchmarking Constraint-Based Bayesian Structure Learning Algorithms: Role of Network Topology0
Training neural mapping schemes for satellite altimetry with simulation data0
NEWTS: A Corpus for News Topic-Focused Summarization0
NEXT-EVAL: Next Evaluation of Traditional and LLM Web Data Record Extraction0
Next-generation MRD assays: do we have the tools to evaluate them properly?0
Benchmarking confound regression strategies for the control of motion artifact in studies of functional connectivity0
NL2KQL: From Natural Language to Kusto Query0
Benchmarking and Building Zero-Shot Hindi Retrieval Model with Hindi-BEIR and NLLB-E50
Benchmarking common uncertainty estimation methods with histopathological images under domain shift and label noise0
NLPre: a revised approach towards language-centric benchmarking of Natural Language Preprocessing systems0
A CUDA-Based Real Parameter Optimization Benchmark0
Benchmarking Collaborative Learning Methods Cost-Effectiveness for Prostate Segmentation0
Show:102550
← PrevPage 77 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified