SOTAVerified

Benchmarking

Papers

Showing 42264250 of 5548 papers

TitleStatusHype
Privacy Protection in Street-View Panoramas using Depth and Multi-View Imagery0
Automating Code Adaptation for MLOps -- A Benchmarking Study on LLMs0
Probabilistic Robustness in Deep Learning: A Concise yet Comprehensive Guide0
ProBench: Benchmarking Large Language Models in Competitive Programming0
UCLID-Net: Single View Reconstruction in Object Space0
UDTIRI: An Online Open-Source Intelligent Road Inspection Benchmark Suite0
A Comprehensive Multi-Illuminant Dataset for Benchmarking of the Intrinsic Image Algorithms0
Automatic vehicle trajectory data reconstruction at scale0
Problem-solving benefits of down-sampled lexicase selection0
Automatic Target Recognition on Synthetic Aperture Radar Imagery: A Survey0
Procedural Content Generation: Better Benchmarks for Transfer Reinforcement Learning0
Procedural Generalization by Planning with Self-Supervised World Models0
UGSL: A Unified Framework for Benchmarking Graph Structure Learning0
ProductAgent: Benchmarking Conversational Product Search Agent with Asking Clarification Questions0
Profit: Benchmarking Personalization and Robustness Trade-off in Federated Prompt Tuning0
Progressive Class-level Distillation0
Progressive Multi-view Human Mesh Recovery with Self-Supervision0
Progressive with Purpose: Guiding Progressive Inpainting DNNs through Context and Structure0
Projective simulation applied to the grid-world and the mountain-car problem0
Project MPG: towards a generalized performance benchmark for LLM capabilities0
Automatic segmenting teeth in X-ray images: Trends, a novel data set, benchmarking and future perspectives0
Prompting ChatGPT for Chinese Learning as L2: A CEFR and EBCL Level Study0
Prompting Scientific Names for Zero-Shot Species Recognition0
Automatic Microprocessor Performance Bug Detection0
Prompt Sketching for Large Language Models0
Show:102550
← PrevPage 170 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified