SOTAVerified

Benchmarking

Papers

Showing 29512975 of 5548 papers

TitleStatusHype
Demographic Parity: Mitigating Biases in Real-World Data0
NLPBench: Evaluating Large Language Models on Solving NLP ProblemsCode1
A Content-Driven Micro-Video Recommendation Dataset at ScaleCode2
Unified Long-Term Time-Series Forecasting BenchmarkCode1
Node-Aligned Graph-to-Graph (NAG2G): Elevating Template-Free Deep Learning Approaches in Single-Step RetrosynthesisCode1
Advancing The Rate-Distortion-Computation Frontier For Neural Image Compression0
A Toolkit for Reliable Benchmarking and Research in Multi-Objective Reinforcement LearningCode2
Thalamic nuclei segmentation from T_1-weighted MRI: unifying and benchmarking state-of-the-art methods with young and old cohorts0
On quantifying and improving realism of images generated with diffusion0
Optimization Techniques for a Physical Model of Human Vocalisation0
Benchmarking Local Robustness of High-Accuracy Binary Neural Networks for Enhanced Traffic Sign RecognitionCode1
Efficient Pauli channel estimation with logarithmic quantum memory0
Machine-assisted quantitizing designs: augmenting humanities and social sciences with artificial intelligenceCode0
Categorization and analysis of 14 computational methods for estimating cell potency from single-cell RNA-seq data0
Benchmarking Encoder-Decoder Architectures for Biplanar X-ray to 3D Shape ReconstructionCode1
VisionKG: Unleashing the Power of Visual Datasets via Knowledge Graph0
Grad DFT: a software library for machine learning enhanced density functional theoryCode1
Turbulence in Focus: Benchmarking Scaling Behavior of 3D Volumetric Super-Resolution with BLASTNet 2.0 Data0
Domain Adaptation for Arabic Machine Translation: The Case of Financial Texts0
Benchmarking quantized LLaMa-based models on the Brazilian Secondary School Exam0
Prompt Tuned Embedding Classification for Multi-Label Industry Sector AllocationCode1
Multimodal Deep Learning for Scientific Imaging Interpretation0
On the relationship between Benchmarking, Standards and Certification in Robotics and AI0
Towards Effective Disambiguation for Machine Translation with Large Language Models0
An Evaluation of Machine Learning Approaches for Early Diagnosis of Autism Spectrum DisorderCode0
Show:102550
← PrevPage 119 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified