SOTAVerified

Benchmarking

Papers

Showing 29763000 of 5548 papers

TitleStatusHype
Training neural mapping schemes for satellite altimetry with simulation data0
SHOWMe: Benchmarking Object-agnostic Hand-Object 3D Reconstruction0
The Protein Engineering Tournament: An Open Science Benchmark for Protein Modeling and Design0
Emerging Approaches for THz Array Imaging: A Tutorial Review and Software Tool0
Exploration of TPUs for AI Applications0
Anchor Points: Benchmarking Models with Much Fewer ExamplesCode0
M3Dsynth: A dataset of medical 3D images with AI-generated local manipulationsCode0
Leveraging Contextual Information for Effective Entity Salience Detection0
Benchmarking machine learning models for quantum state classification0
VerilogEval: Evaluating Large Language Models for Verilog Code GenerationCode2
So you think you can track?0
Benchmarking Procedural Language Understanding for Low-Resource Languages: A Case Study on TurkishCode0
An Image Dataset for Benchmarking Recommender Systems with Raw PixelsCode1
AmodalSynthDrive: A Synthetic Amodal Perception Dataset for Autonomous Driving0
Unveiling the potential of large language models in generating semantic and cross-language clones0
Formalizing Multimedia Recommendation through Multimodal Deep LearningCode1
FreeMan: Towards Benchmarking 3D Human Pose Estimation under Real-World ConditionsCode1
RecAD: Towards A Unified Library for Recommender Attack and DefenseCode1
Navigating Out-of-Distribution Electricity Load Forecasting during COVID-19: Benchmarking energy load forecasting models without and with continual learningCode0
DBsurf: A Discrepancy Based Method for Discrete Stochastic Gradient Estimation0
PyGraft: Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your FingertipsCode2
Using representation balancing to learn conditional-average dose responses from clustered dataCode0
Better Practices for Domain Adaptation0
Evaluation of large language models for discovery of gene set functionCode1
Neural Networks for Fast Optimisation in Model Predictive Control: A Review0
Show:102550
← PrevPage 120 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified