SOTAVerified

Benchmarking

Papers

Showing 36263650 of 5548 papers

TitleStatusHype
A Comparative Attention Framework for Better Few-Shot Object Detection on Aerial ImagesCode1
Deep Crowd Anomaly Detection: State-of-the-Art, Challenges, and Future Research Directions0
What cleaves? Is proteasomal cleavage prediction reaching a ceiling?0
ESB: A Benchmark For Multi-Domain End-to-End Speech RecognitionCode1
SpikeSim: An end-to-end Compute-in-Memory Hardware Evaluation Tool for Benchmarking Spiking Neural NetworksCode1
Benchmarking GPU and TPU Performance with Graph Neural Networks0
Multi-scale data reconstruction of turbulent rotating flows with Gappy POD, Extended POD and Generative Adversarial Networks0
A Survey on Graph Counterfactual Explanations: Definitions, Methods, Evaluation, and Research ChallengesCode1
gSuite: A Flexible and Framework Independent Benchmark Suite for Graph Neural Network Inference on GPUs0
RMBench: Benchmarking Deep Reinforcement Learning for Robotic Manipulator ControlCode1
LaMAR: Benchmarking Localization and Mapping for Augmented RealityCode2
Graphs, Constraints, and Search for the Abstraction and Reasoning CorpusCode1
iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylationsCode1
FIMP: Foundation Model-Informed Message Passing for Graph Neural Networks0
Conditional Neural Processes for Molecules0
Sub-8-bit quantization for on-device speech recognition: a regularization-free approach0
KPI-EDGAR: A Novel Dataset and Accompanying Metric for Relation Extraction from Financial DocumentsCode1
An Open-source Benchmark of Deep Learning Models for Audio-visual Apparent and Self-reported Personality RecognitionCode1
DyFEn: Agent-Based Fee Setting in Payment Channel Networks0
WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based EnvironmentsCode1
A Comprehensive Study on Large-Scale Graph Training: Benchmarking and RethinkingCode1
A Survey of Parameters Associated with the Quality of Benchmarks in NLP0
TweetNERD -- End to End Entity Linking Benchmark for TweetsCode0
CAB: Comprehensive Attention Benchmarking on Long Sequence ModelingCode1
CORL: Research-oriented Deep Offline Reinforcement Learning LibraryCode3
Show:102550
← PrevPage 146 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified