SOTAVerified

Benchmarking

Papers

Showing 42764300 of 5548 papers

TitleStatusHype
AutoLay: Benchmarking amodal layout estimation for autonomous driving0
Pythae: Unifying Generative Autoencoders in Python -- A Benchmarking Use Case0
Python Random Graph Generator0
Q2SAR: A Quantum Multiple Kernel Learning Approach for Drug Discovery0
Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs0
AutoAI-TS: AutoAI for Time Series Forecasting0
QDA^2: A principled approach to automatically annotating charge stability diagrams0
A Universal Protocol to Benchmark Camera Calibration for Sports0
A Unified Taylor Framework for Revisiting Attribution Methods0
A Complementarity Analysis of the COCO Benchmark Problems and Artificially Generated Problems0
QHackBench: Benchmarking Large Language Models for Quantum Code Generation Using PennyLane Hackathon Challenges0
A Comparison of Word Embeddings for English and Cross-Lingual Chinese Word Sense Disambiguation0
QPO: Query-dependent Prompt Optimization via Multi-Loop Offline Reinforcement Learning0
QSAM-Net: Rain streak removal by quaternion neural network with self-attention module0
Decoding Intelligence: A Framework for Certifying Knowledge Comprehension in LLMs0
QualBench: Benchmarking Chinese LLMs with Localized Professional Qualifications for Vertical Domain Evaluation0
Unbounded Bayesian Optimization via Regularization0
Qualitative Insights Tool (QualIT): LLM Enhanced Topic Modeling0
Quality Assessment of Low Light Restored Images: A Subjective Study and an Unsupervised Model0
Quality Assured: Rethinking Annotation Strategies in Imaging AI0
Quality at the Tail of Machine Learning Inference0
Uncertainty estimation for Cross-dataset performance in Trajectory prediction0
A Unified Study of Machine Learning Explanation Evaluation Metrics0
QuantBench: Benchmarking AI Methods for Quantitative Investment0
Uncertainty Estimation with Deep Learning for Rainfall-Runoff Modelling0
Show:102550
← PrevPage 172 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified