SOTAVerified

Benchmarking

Papers

Showing 34763500 of 5548 papers

TitleStatusHype
OrionBench: Benchmarking Time Series Generative Models in the Service of the End-User0
A Deep Q-Learning Method for Downlink Power Allocation in Multi-Cell Networks0
Benchmarking LLM Code Generation for Audio Programming with Visual Dataflow Languages0
Benchmarking LiDAR Sensors for Development and Evaluation of Automotive Perception0
Towards Benchmarking and Evaluating Deepfake Detection0
ManipBench: Benchmarking Vision-Language Models for Low-Level Robot Manipulation0
MANTA: A Large-Scale Multi-View and Visual-Text Anomaly Detection Dataset for Tiny Objects0
Deep Patent Landscaping Model Using Transformer and Graph Embedding0
Manual Verbalizer Enrichment for Few-Shot Text Classification0
Towards Benchmarking Explainable Artificial Intelligence Methods0
Mapping global dynamics of benchmark creation and saturation in artificial intelligence0
Mapping Violence: Developing an Extensive Framework to Build a Bangla Sectarian Expression Dataset from Social Media Interactions0
Benchmarking LF-MMI, CTC and RNN-T Criteria for Streaming ASR0
Towards Benchmarking Scene Background Initialization0
MarineGym: A High-Performance Reinforcement Learning Platform for Underwater Robotics0
Benchmarking Lexical Simplification Systems0
Towards Benchmarking the Utility of Explanations for Model Debugging0
WER We Stand: Benchmarking Urdu ASR Models0
Benchmarking Learnt Radio Localisation under Distribution Shift0
Benchmarking learned non-Cartesian k-space trajectories and reconstruction networks0
Match Stereo Videos via Bidirectional Alignment0
MaterioMiner -- An ontology-based text mining dataset for extraction of process-structure-property entities0
PINNs for Medical Image Analysis: A Survey0
(N,K)-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model0
Benchmarking learned algorithms for computed tomography image reconstruction tasks0
Show:102550
← PrevPage 140 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified