SOTAVerified

Benchmarking

Papers

Showing 49614970 of 5548 papers

TitleStatusHype
A Thorough Performance Benchmarking on Lightweight Embedding-based Recommender SystemsCode0
SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond ClassificationCode0
Separating form and meaning: Using self-consistency to quantify task understanding across multiple sensesCode0
Unsupervised Novelty Detection Methods Benchmarking with Wavelet DecompositionCode0
Evaluating Shallow and Deep Neural Networks for Network Intrusion Detection Systems in Cyber SecurityCode0
Transparent and Scrutable Recommendations Using Natural Language User ProfilesCode0
SenseShift6D: Multimodal RGB-D Benchmarking for Robust 6D Pose Estimation across Environment and Sensor VariationsCode0
SensorBench: Benchmarking LLMs in Coding-Based Sensor ProcessingCode0
A Comprehensive Summarization and Evaluation of Feature Refinement Modules for CTR PredictionCode0
Navigating Out-of-Distribution Electricity Load Forecasting during COVID-19: Benchmarking energy load forecasting models without and with continual learningCode0
Show:102550
← PrevPage 497 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified