SOTAVerified

Benchmarking

Papers

Showing 19011925 of 5548 papers

TitleStatusHype
Improve Machine Learning carbon footprint using Parquet dataset format and Mixed Precision training for regression models -- Part IICode0
CREPO: An Open Repository to Benchmark Credal Network AlgorithmsCode0
Improving the Perturbation-Based Explanation of Deepfake Detectors Through the Use of Adversarially-Generated SamplesCode0
Importance of Disjoint Sampling in Conventional and Transformer Models for Hyperspectral Image ClassificationCode0
Improved Multilingual Language Model Pretraining for Social Media Text via Translation Pair PredictionCode0
Beemo: Benchmark of Expert-edited Machine-generated OutputsCode0
Critical review of conformational B-cell epitope prediction methodsCode0
Bias Reduction via Cooperative Bargaining in Synthetic Graph Dataset GenerationCode0
AdamZ: An Enhanced Optimisation Method for Neural Network TrainingCode0
ImpliRet: Benchmarking the Implicit Fact Retrieval ChallengeCode0
Improved Target-specific Stance Detection on Social Media Platforms by Delving into Conversation ThreadsCode0
Bias Analysis and Mitigation in the Evaluation of Authorship VerificationCode0
BED: Bi-Encoder-Based Detectors for Out-of-Distribution DetectionCode0
ImmersePro: End-to-End Stereo Video Synthesis Via Implicit Disparity LearningCode0
Immunofluorescence Capillary Imaging Segmentation: Cases StudyCode0
BEARD: Benchmarking the Adversarial Robustness for Dataset DistillationCode0
AMQA: An Adversarial Dataset for Benchmarking Bias of LLMs in Medicine and HealthcareCode0
Beyond Supervised vs. Unsupervised: Representative Benchmarking and Analysis of Image Representation LearningCode0
Impact of ImageNet Model Selection on Domain AdaptationCode0
Illusory VQA: Benchmarking and Enhancing Multimodal Models on Visual IllusionsCode0
Benchmarking Automated Clinical Language Simplification: Dataset, Algorithm, and EvaluationCode0
Beyond Slow Signs in High-fidelity Model ExtractionCode0
Illuminating the Diversity-Fitness Trade-Off in Black-Box OptimizationCode0
Answer Consolidation: Formulation and BenchmarkingCode0
IJCB 2022 Mobile Behavioral Biometrics Competition (MobileB2C)Code0
Show:102550
← PrevPage 77 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified