SOTAVerified

Benchmarking

Papers

Showing 25512560 of 5548 papers

TitleStatusHype
Recognizing Object Affordances to Support Scene Reasoning for Manipulation TasksCode0
FR-MRInet: A Deep Convolutional Encoder-Decoder for Brain Tumor Segmentation with Relu-RGB and Sliding-windowCode0
Detecting critical treatment effect bias in small subgroupsCode0
From Bytes to Borsch: Fine-Tuning Gemma and Mistral for the Ukrainian Language RepresentationCode0
FORLORN: A Framework for Comparing Offline Methods and Reinforcement Learning for Optimization of RAN ParametersCode0
Benchmarking Image Perturbations for Testing Automated Driving Assistance SystemsCode0
Benchmarking Reinforcement Learning Algorithms on Real-World RobotsCode0
Forecasting time series with constraintsCode0
Affine Non-negative Collaborative Representation Based Pattern ClassificationCode0
DEsignBench: Exploring and Benchmarking DALL-E 3 for Imagining Visual DesignCode0
Show:102550
← PrevPage 256 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified