SOTAVerified

Benchmarking

Papers

Showing 19511975 of 5548 papers

TitleStatusHype
Importance of Disjoint Sampling in Conventional and Transformer Models for Hyperspectral Image ClassificationCode0
Impact of ImageNet Model Selection on Domain AdaptationCode0
Beyond Marginal Uncertainty: How Accurately can Bayesian Regression Models Estimate Posterior Predictive Correlations?Code0
Immunofluorescence Capillary Imaging Segmentation: Cases StudyCode0
Improved Multilingual Language Model Pretraining for Social Media Text via Translation Pair PredictionCode0
Beyond Document Page Classification: Design, Datasets, and ChallengesCode0
A Modular Benchmarking Infrastructure for High-Performance and Reproducible Deep LearningCode0
BASED: Benchmarking, Analysis, and Structural Estimation of DeblurringCode0
Neural Style Transfer Improves 3D Cardiovascular MR Image Segmentation on Inconsistent DataCode0
Beyond Atomic Geometry Representations in Materials Science: A Human-in-the-Loop Multimodal FrameworkCode0
Beyond Accuracy: A Consolidated Tool for Visual Question Answering BenchmarkingCode0
ImmersePro: End-to-End Stereo Video Synthesis Via Implicit Disparity LearningCode0
Improved Target-specific Stance Detection on Social Media Platforms by Delving into Conversation ThreadsCode0
Benchmarking Failures in Tool-Augmented Language ModelsCode0
IOLBENCH: Benchmarking LLMs on Linguistic ReasoningCode0
PartNet: A Large-scale Benchmark for Fine-grained and Hierarchical Part-level 3D Object UnderstandingCode0
Illuminating the Diversity-Fitness Trade-Off in Black-Box OptimizationCode0
Better Late Than Never: Formulating and Benchmarking Recommendation EditingCode0
Better force fields start with better data -- A data set of cation dipeptide interactionsCode0
BanglaNLP at BLP-2023 Task 2: Benchmarking different Transformer Models for Sentiment Analysis of Bangla Social Media PostsCode0
Benchmarking Feature Upsampling Methods for Vision Foundation Models using Interactive SegmentationCode0
Illusory VQA: Benchmarking and Enhancing Multimodal Models on Visual IllusionsCode0
IHCV: Discovery of Hidden Time-Dependent Control Variables in Non-Linear Dynamical SystemsCode0
Performance Modeling of Data Storage Systems using Generative ModelsCode0
IJCB 2022 Mobile Behavioral Biometrics Competition (MobileB2C)Code0
Show:102550
← PrevPage 79 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified