SOTAVerified

Benchmarking

Papers

Showing 36513675 of 5548 papers

TitleStatusHype
Towards Effective Disambiguation for Machine Translation with Large Language Models0
An Evaluation of Machine Learning Approaches for Early Diagnosis of Autism Spectrum DisorderCode0
SHOWMe: Benchmarking Object-agnostic Hand-Object 3D Reconstruction0
Training neural mapping schemes for satellite altimetry with simulation data0
The Protein Engineering Tournament: An Open Science Benchmark for Protein Modeling and Design0
Exploration of TPUs for AI Applications0
Emerging Approaches for THz Array Imaging: A Tutorial Review and Software Tool0
Anchor Points: Benchmarking Models with Much Fewer ExamplesCode0
M3Dsynth: A dataset of medical 3D images with AI-generated local manipulationsCode0
Benchmarking machine learning models for quantum state classification0
Leveraging Contextual Information for Effective Entity Salience Detection0
So you think you can track?0
Benchmarking Procedural Language Understanding for Low-Resource Languages: A Case Study on TurkishCode0
Unveiling the potential of large language models in generating semantic and cross-language clones0
AmodalSynthDrive: A Synthetic Amodal Perception Dataset for Autonomous Driving0
Navigating Out-of-Distribution Electricity Load Forecasting during COVID-19: Benchmarking energy load forecasting models without and with continual learningCode0
DBsurf: A Discrepancy Based Method for Discrete Stochastic Gradient Estimation0
Better Practices for Domain Adaptation0
Using representation balancing to learn conditional-average dose responses from clustered dataCode0
Are SNNs Truly Energy-efficient? - A Hardware Perspective0
Neural Networks for Fast Optimisation in Model Predictive Control: A Review0
AGIBench: A Multi-granularity, Multimodal, Human-referenced, Auto-scoring Benchmark for Large Language Models0
A survey on efficient vision transformers: algorithms, techniques, and performance benchmarking0
Hybrid data driven/thermal simulation model for comfort assessment0
Transfer Learning between Motor Imagery Datasets using Deep Learning -- Validation of Framework and Comparison of DatasetsCode0
Show:102550
← PrevPage 147 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified