SOTAVerified

Benchmarking

Papers

Showing 40264050 of 5548 papers

TitleStatusHype
SUTD-PRCM Dataset and Neural Architecture Search Approach for Complex Metasurface Design0
Measuring CLEVRness: Blackbox testing of Visual Reasoning Models0
Benchmarking Generative Latent Variable Models for SpeechCode0
Evaluating Feature Attribution Methods in the Image DomainCode0
Benchmarking the Linear Algebra Awareness of TensorFlow and PyTorchCode0
How to Manage Tiny Machine Learning at Scale: An Industrial PerspectiveCode0
Rethinking Pareto Frontier for Performance Evaluation of Deep Neural Networks0
MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution ImageryCode1
Benchmarking missing-values approaches for predictive models on health databasesCode0
On loss functions and evaluation metrics for music source separation0
Benchmarking of DL Libraries and Models on Mobile DevicesCode1
Benchmarking Online Sequence-to-Sequence and Character-based Handwriting Recognition from IMU-Enhanced Pens0
Benchmarking Robot Manipulation with the Rubik's Cube0
MetaShift: A Dataset of Datasets for Evaluating Contextual Distribution Shifts and Training ConflictsCode1
Wukong: A 100 Million Large-scale Chinese Cross-modal Pre-training BenchmarkCode0
Dual Task Framework for Improving Persona-grounded Dialogue Dataset0
High Fidelity RF Clutter Modeling and Simulation0
Lightweight Jet Reconstruction and Identification as an Object Detection Task0
BIQ2021: A Large-Scale Blind Image Quality Assessment Database0
ECRECer: Enzyme Commission Number Recommendation and Benchmarking based on Multiagent Dual-core LearningCode1
Comparative Study Between Distance Measures On Supervised Optimum-Path Forest ClassificationCode0
What are the best systems? New perspectives on NLP BenchmarkingCode1
RECOVER: sequential model optimization platform for combination drug repurposing identifies novel synergistic compounds in vitroCode1
Theory-inspired Parameter Control Benchmarks for Dynamic Algorithm ConfigurationCode0
Benchmarking and Analyzing Point Cloud Classification under CorruptionsCode1
Show:102550
← PrevPage 162 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified