SOTAVerified

Benchmarking

Papers

Showing 50015025 of 5548 papers

TitleStatusHype
Enhancing 3D-Air Signature by Pen Tip Tail Trajectory Awareness: Dataset and Featuring by Novel Spatio-temporal CNNCode0
Neurological Prognostication of Post-Cardiac-Arrest Coma Patients Using EEG Data: A Dynamic Survival Analysis Framework with Competing RisksCode0
Asynchronous Batch Bayesian Optimization with Pipelining Evaluations for Experimental Resourcex2013constrained ConditionsCode0
NeuroMorse: A Temporally Structured Dataset For Neuromorphic ComputingCode0
NeuroSim V1.5: Improved Software Backbone for Benchmarking Compute-in-Memory Accelerators with Device and Circuit-level Non-idealitiesCode0
EnergyStar++: Towards more accurate and explanatory building energy benchmarkingCode0
Accelerating Large-Scale Inference with Anisotropic Vector QuantizationCode0
A survey of probabilistic generative frameworks for molecular simulationsCode0
Benchmarking neural embeddings for link prediction in knowledge graphs under semantic and structural changesCode0
EmProx: Neural Network Performance Estimation For Neural Architecture SearchCode0
NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual UpdatesCode0
A comparison of translation performance between DeepL and SupertextCode0
Benchmarking Multimodal RAG through a Chart-based Document Question-Answering Generation FrameworkCode0
Benchmarking Multimodal CoT Reward Model Stepwise by Visual ProgramCode0
Benchmarking Machine Translation with Cultural AwarenessCode0
Benchmarking Multilabel Topic Classification in the Kyrgyz LanguageCode0
Unsupervised Tracklet Person Re-IdentificationCode0
Empirical Study of Off-Policy Policy Evaluation for Reinforcement LearningCode0
TMPNN: High-Order Polynomial Regression Based on Taylor Map FactorizationCode0
Nmbr9 as a Constraint Programming ChallengeCode0
EFSA: Towards Event-Level Financial Sentiment AnalysisCode0
Efficient, Uncertainty-based Moderation of Neural Networks Text ClassifiersCode0
Efficient Realistic Data Generation Framework leveraging Deep Learning-based Human DigitizationCode0
Efficient Performance Tracking: Leveraging Large Language Models for Automated Construction of Scientific LeaderboardsCode0
Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop ReasoningCode0
Show:102550
← PrevPage 201 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified