SOTAVerified

Benchmarking

Papers

Showing 23262350 of 5548 papers

TitleStatusHype
Benchmarking YOLOv8 for Optimal Crack Detection in Civil Infrastructure0
AV-Reasoner: Improving and Benchmarking Clue-Grounded Audio-Visual Counting for MLLMs0
Benchmarking XAI Explanations with Human-Aligned Evaluations0
A critical look at the current train/test split in machine learning0
Forecasting NIFTY 50 benchmark Index using Seasonal ARIMA time series models0
FORLAPS: An Innovative Data-Driven Reinforcement Learning Approach for Prescriptive Process Monitoring0
Found in Translation: Measuring Multilingual LLM Consistency as Simple as Translate then Evaluate0
Benchmarking with MIMIC-IV, an irregular, spare clinical time series dataset0
A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval0
Alpha Excel Benchmark0
Benchmarking Waitlist Mortality Prediction in Heart Transplantation Through Time-to-Event Modeling using New Longitudinal UNOS Dataset0
Benchmarking VLMs' Reasoning About Persuasive Atypical Images0
A Bayesian Committee Machine Potential for Oxygen-containing Organic Compounds0
Benchmarking Visual-Inertial Deep Multimodal Fusion for Relative Pose Regression and Odometry-aided Absolute Pose Regression0
AutoWS-Bench-101: Benchmarking Automated Weak Supervision with 100 Labels0
Benchmarking Vision Language Models on German Factual Data0
Auto-tuning TensorFlow Threading Model for CPU Backend0
ForamViT-GAN: Exploring New Paradigms in Deep Learning for Micropaleontological Image Analysis0
Benchmarking Vision Language Models for Cultural Understanding0
ALP: Action-Aware Embodied Learning for Perception0
Autoregressive Stochastic Clock Jitter Compensation in Analog-to-Digital Converters0
A critical analysis of metrics used for measuring progress in artificial intelligence0
Benchmarking Vision Foundation Models for Input Monitoring in Autonomous Driving0
Benchmarking Vision-Based Object Tracking for USVs in Complex Maritime Environments0
Benchmarking Video Frame Interpolation0
Show:102550
← PrevPage 94 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified