SOTAVerified

Benchmarking

Papers

Showing 48014825 of 5548 papers

TitleStatusHype
FR-MRInet: A Deep Convolutional Encoder-Decoder for Brain Tumor Segmentation with Relu-RGB and Sliding-windowCode0
AdamZ: An Enhanced Optimisation Method for Neural Network TrainingCode0
MLPerf Training BenchmarkCode0
Theory-inspired Parameter Control Benchmarks for Dynamic Algorithm ConfigurationCode0
Benchmarking Spurious Bias in Few-Shot Image ClassifiersCode0
FRAMES-VQA: Benchmarking Fine-Tuning Robustness across Multi-Modal Shifts in Visual Question AnsweringCode0
FORLORN: A Framework for Comparing Offline Methods and Reinforcement Learning for Optimization of RAN ParametersCode0
MMCoQA: Conversational Question Answering over Text, Tables, and ImagesCode0
Forecasting time series with constraintsCode0
Action-conditioned Benchmarking of Robotic Video Prediction Models: a Comparative StudyCode0
Benchmarking Spatiotemporal Reasoning in LLMs and Reasoning Models: Capabilities and ChallengesCode0
Forecasting Future International Events: A Reliable Dataset for Text-Based Event ModelingCode0
Benchmarking Single Image Dehazing and BeyondCode0
VRKitchen2.0-IndoorKit: A Tutorial for Augmented Indoor Scene Building in OmniverseCode0
One Law, Many Languages: Benchmarking Multilingual Legal Reasoning for Judicial SupportCode0
Forecasting Across Time Series Databases using Recurrent Neural Networks on Groups of Similar Series: A Clustering ApproachCode0
fMRI-S4: learning short- and long-range dynamic fMRI dependencies using 1D Convolutions and State Space ModelsCode0
Scaling and Benchmarking Self-Supervised Visual Representation LearningCode0
Scaling Compute Is Not All You Need for Adversarial RobustnessCode0
Scaling Up Resonate-and-Fire Networks for Fast Deep LearningCode0
Universal Music Representations? Evaluating Foundation Models on World Music CorporaCode0
MM-Soc: Benchmarking Multimodal Large Language Models in Social Media PlatformsCode0
Fluorescence Reference Target Quantitative Analysis LibraryCode0
FLsim: A Modular and Library-Agnostic Simulation Framework for Federated LearningCode0
FlowCyt: A Comparative Study of Deep Learning Approaches for Multi-Class Classification in Flow Cytometry BenchmarkingCode0
Show:102550
← PrevPage 193 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified