SOTAVerified

Benchmarking

Papers

Showing 20012050 of 5548 papers

TitleStatusHype
Improve Machine Learning carbon footprint using Nvidia GPU and Mixed Precision training for classification models -- Part ICode0
Improve Machine Learning carbon footprint using Parquet dataset format and Mixed Precision training for regression models -- Part IICode0
Action-conditioned Benchmarking of Robotic Video Prediction Models: a Comparative StudyCode0
Improved Multilingual Language Model Pretraining for Social Media Text via Translation Pair PredictionCode0
Importance of Disjoint Sampling in Conventional and Transformer Models for Hyperspectral Image ClassificationCode0
Improved Target-specific Stance Detection on Social Media Platforms by Delving into Conversation ThreadsCode0
Improving Generalization of Neural Vehicle Routing Problem Solvers Through the Lens of Model ArchitectureCode0
A Meta-Analysis of the Anomaly Detection ProblemCode0
Benchmarking framework for machine learning classification from fNIRS dataCode0
Deep Affinity Network for Multiple Object TrackingCode0
Benchmarks for Graph Embedding EvaluationCode0
Benchmarking Framework for Performance-Evaluation of Causal Inference AnalysisCode0
BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis DatasetCode0
deepCR: Cosmic Ray Rejection with Deep LearningCode0
Immunofluorescence Capillary Imaging Segmentation: Cases StudyCode0
Back to Basics: Benchmarking Canonical Evolution Strategies for Playing AtariCode0
Impact of ImageNet Model Selection on Domain AdaptationCode0
ImpliRet: Benchmarking the Implicit Fact Retrieval ChallengeCode0
Benchmark of Deep Learning Models on Large Healthcare MIMIC DatasetsCode0
Deepened Graph Auto-Encoders Help Stabilize and Enhance Link PredictionCode0
AlphaZip: Neural Network-Enhanced Lossless Text CompressionCode0
Benchmarking Zero-Shot Robustness of Multimodal Foundation Models: A Pilot StudyCode0
ImmersePro: End-to-End Stereo Video Synthesis Via Implicit Disparity LearningCode0
A Wild Bootstrap for Degenerate Kernel TestsCode0
Benchmarking YOLOv5 and YOLOv7 models with DeepSORT for droplet tracking applicationsCode0
Question-Answering Dense Video EventsCode0
Aux-Drop: Handling Haphazard Inputs in Online Learning Using Auxiliary DropoutsCode0
Benchmarking White Blood Cell Classification Under Domain ShiftCode0
IJCB 2022 Mobile Behavioral Biometrics Competition (MobileB2C)Code0
Illuminating the Diversity-Fitness Trade-Off in Black-Box OptimizationCode0
IHCV: Discovery of Hidden Time-Dependent Control Variables in Non-Linear Dynamical SystemsCode0
Illusory VQA: Benchmarking and Enhancing Multimodal Models on Visual IllusionsCode0
Benchmarking Vision-Language Contrastive Methods for Medical Representation LearningCode0
Identifying and Benchmarking Natural Out-of-Context Prediction ProblemsCode0
Benchmarking GPT-4 against Human Translators: A Comprehensive Evaluation Across Languages, Domains, and Expertise LevelsCode0
RCP-Bench: Benchmarking Robustness for Collaborative Perception Under Diverse CorruptionsCode0
IdeaBench: Benchmarking Large Language Models for Research Idea GenerationCode0
Identifying Money Laundering Subgraphs on the BlockchainCode0
IceBench: A Benchmark for Deep Learning based Sea Ice Type ClassificationCode0
HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination Tendency of LLMsCode0
Identifying the Smallest Adversarial Load Perturbations that Render DC-OPF InfeasibleCode0
Benchmarking Unsupervised Strategies for Anomaly Detection in Multivariate Time SeriesCode0
Hyperopt-Sklearn: Automatic Hyperparameter Configuration for Scikit-LearnCode0
Benchmarking Unsupervised Online IDS for Masquerade Attacks in CANCode0
Deep Metric Learning Meets Deep Clustering: An Novel Unsupervised Approach for Feature EmbeddingCode0
ACCESS DENIED INC: The First Benchmark Environment for Sensitivity AwarenessCode0
AutoMIR: Effective Zero-Shot Medical Information Retrieval without Relevance LabelsCode0
Hyperbolic Benchmarking Unveils Network Topology-Feature Relationship in GNN PerformanceCode0
Hyperparameter-Free Losses for Model-Based Monocular ReconstructionCode0
Hybrid Machine Learning Models of Classifying Residential Requests for Smart DispatchingCode0
Show:102550
← PrevPage 41 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified