SOTAVerified

Benchmarking

Papers

Showing 21512200 of 5548 papers

TitleStatusHype
Illuminating the Diversity-Fitness Trade-Off in Black-Box OptimizationCode0
AI Sound Recognition on Asthma Medication Adherence: Evaluation With the RDA Benchmark SuiteCode0
IJCB 2022 Mobile Behavioral Biometrics Competition (MobileB2C)Code0
IHCV: Discovery of Hidden Time-Dependent Control Variables in Non-Linear Dynamical SystemsCode0
Benchmarking Retinal Blood Vessel Segmentation Models for Cross-Dataset and Cross-Disease GeneralizationCode0
Asynchronous Batch Bayesian Optimization with Pipelining Evaluations for Experimental Resourcex2013constrained ConditionsCode0
Benchmarking Representation Learning for Natural World Image CollectionsCode0
Identifying the Smallest Adversarial Load Perturbations that Render DC-OPF InfeasibleCode0
Identifying Money Laundering Subgraphs on the BlockchainCode0
Improved Target-specific Stance Detection on Social Media Platforms by Delving into Conversation ThreadsCode0
Benchmarking Reinforcement Learning Algorithms on Real-World RobotsCode0
Hyperspectral Image Dataset for Benchmarking on Salient Object DetectionCode0
Hyperparameter-Free Losses for Model-Based Monocular ReconstructionCode0
Benchmarking quantum machine learning kernel training for classification tasksCode0
Hyperbolic Benchmarking Unveils Network Topology-Feature Relationship in GNN PerformanceCode0
Benchmarking Quantum Reinforcement LearningCode0
Hyperopt-Sklearn: Automatic Hyperparameter Configuration for Scikit-LearnCode0
HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination Tendency of LLMsCode0
HuSc3D: Human Sculpture dataset for 3D object reconstructionCode0
Hybrid Machine Learning Models of Classifying Residential Requests for Smart DispatchingCode0
A Comprehensive Comparison of Multi-Dimensional Image Denoising MethodsCode0
Hybrid Random FeaturesCode0
HSSBench: Benchmarking Humanities and Social Sciences Ability for Multimodal Large Language ModelsCode0
HRNET: AI on Edge for mask detection and social distancingCode0
HR-VILAGE-3K3M: A Human Respiratory Viral Immunization Longitudinal Gene Expression Dataset for Systems ImmunityCode0
Benchmarking Procedural Language Understanding for Low-Resource Languages: A Case Study on TurkishCode0
How to Manage Tiny Machine Learning at Scale: An Industrial PerspectiveCode0
Benchmarking Probabilistic Deep Learning Methods for License Plate RecognitionCode0
Benchmarking pre-trained text embedding models in aligning built asset informationCode0
Benchmarking Pre-trained Language Models for Multilingual NER: TraSpaS at the BSNLP2021 Shared TaskCode0
How Far Are We from Optimal Reasoning Efficiency?Code0
Towards Segment Anything Model (SAM) for Medical Image Segmentation: A SurveyCode0
A survey of probabilistic generative frameworks for molecular simulationsCode0
Benchmarking Post-Training Quantization in LLMs: Comprehensive Taxonomy, Unified Evaluation, and Comparative AnalysisCode0
Benchmarking Positional Encodings for GNNs and Graph TransformersCode0
Benchmarking Post-Hoc Interpretability Approaches for Transformer-based Misogyny DetectionCode0
HopaDIFF: Holistic-Partial Aware Fourier Conditioned Diffusion for Referring Human Action Segmentation in Multi-Person ScenariosCode0
HRIBench: Benchmarking Vision-Language Models for Real-Time Human Perception in Human-Robot InteractionCode0
IceBench: A Benchmark for Deep Learning based Sea Ice Type ClassificationCode0
Benchmarking Popular Classification Models' Robustness to Random and Targeted CorruptionsCode0
Benchmarking Perturbation-based Saliency Maps for Explaining Atari AgentsCode0
Benchmarking person re-identification datasets and approaches for practical real-world implementationsCode0
Benchmarking performance of object detection under image distortions in an uncontrolled environmentCode0
High-Quality, ROS Compatible Video Encoding and Decoding for High-Definition DatasetsCode0
High-Dynamic-Range Imaging for Cloud SegmentationCode0
Hi Guys or Hi Folks? Benchmarking Gender-Neutral Machine Translation with the GeNTE CorpusCode0
Benchmarking Pathology Foundation Models: Adaptation Strategies and ScenariosCode0
AstroVision: Towards Autonomous Feature Detection and Description for Missions to Small Bodies Using Deep LearningCode0
Hi-EF: Benchmarking Emotion Forecasting in Human-interactionCode0
Benchmarking Parameter Control Methods in Differential Evolution for Mixed-Integer Black-Box OptimizationCode0
Show:102550
← PrevPage 44 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified