SOTAVerified

Benchmarking

Papers

Showing 31013125 of 5548 papers

TitleStatusHype
Benchmarking Sample Selection Strategies for Batch Reinforcement Learning0
InteriorNet: Mega-scale Multi-sensor Photo-realistic Indoor Scenes Dataset0
InterLoc: LiDAR-based Intersection Localization using Road Segmentation with Automated Evaluation Method0
InternalInspector I^2: Robust Confidence Estimation in LLMs through Internal States0
Interpretable Feature Construction for Time Series Extrinsic Regression0
Interpretable graph-based models on multimodal biomedical data integration: A technical review and benchmarking0
Interpretable machine learning applied to on-farm biosecurity and porcine reproductive and respiratory syndrome virus0
Benchmarking Safe Deep Reinforcement Learning in Aquatic Navigation0
Benchmarking Rotary Position Embeddings for Automatic Speech Recognition0
The Role of Local Intrinsic Dimensionality in Benchmarking Nearest Neighbor Search0
Benchmarking Robustness of Deep Reinforcement Learning approaches to Online Portfolio Management0
Benchmarking Robustness of Deep Learning Classifiers Using Two-Factor Perturbation0
Intrinsic uncertainties and where to find them0
Introducing a new benchmarked dataset for activity monitoring0
Introducing CausalBench: A Flexible Benchmark Framework for Causal Analysis and Machine Learning0
Benchmarking Robustness of Contrastive Learning Models for Medical Image-Report Retrieval0
Introducing RezoJDM16k: a French KnowledgeGraph DataSet for Link Prediction0
7th AI Driving Olympics: 1st Place Report for Panoptic Tracking0
Benchmarking Robustness of AI-Enabled Multi-sensor Fusion Systems: Challenges and Opportunities0
Introduction to Voice Presentation Attack Detection and Recent Advances0
Intuitive or Dependent? Investigating LLMs' Behavior Style to Conflicting Prompts0
InverseBench: Benchmarking Plug-and-Play Diffusion Priors for Inverse Problems in Physical Sciences0
A Framework for Benchmarking and Aligning Task-Planning Safety in LLM-Based Embodied Agents0
Investigating Deep-Learning NLP for Automating the Extraction of Oncology Efficacy Endpoints from Scientific Literature0
Investigating Energy Efficiency and Performance Trade-offs in LLM Inference Across Tasks and DVFS Settings0
Show:102550
← PrevPage 125 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified