SOTAVerified

Benchmarking

Papers

Showing 31263150 of 5548 papers

TitleStatusHype
Structural Property Prediction0
Performance Modeling of Data Storage Systems using Generative ModelsCode0
Unsupervised Spectral Demosaicing with Lightweight Spectral Attention Networks0
ClimateLearn: Benchmarking Machine Learning for Weather and Climate ModelingCode2
OpenSiteRec: An Open Dataset for Site Recommendation0
A Synthetic Benchmarking Pipeline to Compare Camera Calibration Algorithms0
Conditionally Invariant Representation Learning for Disentangling Cellular Heterogeneity0
SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency0
InstructEval: Systematic Evaluation of Instruction Selection Methods0
Learning Environment Models with Continuous Stochastic Dynamics0
Benchmarking Large Language Model Capabilities for Conditional Generation0
Principles and Guidelines for Evaluating Social Robot Navigation Algorithms0
Generative AI for Programming Education: Benchmarking ChatGPT, GPT-4, and Human Tutors0
Uncovering the Limits of Machine Learning for Automatic Vulnerability DetectionCode1
Benchmarking Zero-Shot Recognition with Vision-Language Models: Challenges on Granularity and Specificity0
Effective Transfer of Pretrained Large Visual Model for Fabric Defect Segmentation via Specifc Knowledge Injection0
Emotion Analysis of Tweets Banning Education in Afghanistan0
Paradigm Shift in Sustainability Disclosure Analysis: Empowering Stakeholders with CHATREPORT, a Language Model-Based Tool0
Pulse Shape-Aided Multipath Delay Estimation for Fine-Grained WiFi Sensing0
Benchmarking Stroke Forecasting with Stroke-Level Badminton Dataset0
Enhancing Navigation Benchmarking and Perception Data Generation for Row-based Crops in Simulation0
SCENEREPLICA: Benchmarking Real-World Robot Manipulation by Creating Replicable ScenesCode1
InterCode: Standardizing and Benchmarking Interactive Coding with Execution FeedbackCode2
Improving Reference-based Distinctive Image Captioning with Contrastive Rewards0
Hybrid Precoder and Combiner Designs for Decentralized Parameter Estimation in mmWave MIMO Wireless Sensor Networks0
Show:102550
← PrevPage 126 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified