Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2001–2050 of 5548 papers

Title	Date	Tasks	Status	Score
Improve Machine Learning carbon footprint using Nvidia GPU and Mixed Precision training for classification models -- Part I	Sep 12, 2024	BenchmarkingCPU	CodeCode Available	5
Improve Machine Learning carbon footprint using Parquet dataset format and Mixed Precision training for regression models -- Part II	Sep 17, 2024	BenchmarkingDescriptive	CodeCode Available	5
Action-conditioned Benchmarking of Robotic Video Prediction Models: a Comparative Study	Oct 7, 2019	BenchmarkingPrediction	CodeCode Available	5
Improved Multilingual Language Model Pretraining for Social Media Text via Translation Pair Prediction	Oct 20, 2021	BenchmarkingLanguage Modeling	CodeCode Available	5
Importance of Disjoint Sampling in Conventional and Transformer Models for Hyperspectral Image Classification	Apr 23, 2024	BenchmarkingHyperspectral Image Classification	CodeCode Available	5
Improved Target-specific Stance Detection on Social Media Platforms by Delving into Conversation Threads	Nov 6, 2022	BenchmarkingOpinion Mining	CodeCode Available	5
Improving Generalization of Neural Vehicle Routing Problem Solvers Through the Lens of Model Architecture	Jun 10, 2024	BenchmarkingDecoder	CodeCode Available	5
A Meta-Analysis of the Anomaly Detection Problem	Mar 3, 2015	Anomaly DetectionBenchmarking	CodeCode Available	5
Benchmarking framework for machine learning classification from fNIRS data	Mar 3, 2023	BenchmarkingBrain Computer Interface	CodeCode Available	5
Deep Affinity Network for Multiple Object Tracking	Oct 28, 2018	BenchmarkingMultiple Object Tracking	CodeCode Available	5
Benchmarks for Graph Embedding Evaluation	Aug 19, 2019	BenchmarkingGraph Embedding	CodeCode Available	5
Benchmarking Framework for Performance-Evaluation of Causal Inference Analysis	Feb 14, 2018	BenchmarkingCausal Inference	CodeCode Available	5
BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis Dataset	Mar 9, 2023	BenchmarkingDeep Learning	CodeCode Available	5
deepCR: Cosmic Ray Rejection with Deep Learning	Jul 22, 2019	BenchmarkingCPU	CodeCode Available	5
Immunofluorescence Capillary Imaging Segmentation: Cases Study	Jul 14, 2022	BenchmarkingImage Segmentation	CodeCode Available	5
Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari	Feb 24, 2018	Atari GamesBenchmarking	CodeCode Available	5
Impact of ImageNet Model Selection on Domain Adaptation	Feb 6, 2020	BenchmarkingDomain Adaptation	CodeCode Available	5
ImpliRet: Benchmarking the Implicit Fact Retrieval Challenge	Jun 17, 2025	BenchmarkingRetrieval	CodeCode Available	5
Benchmark of Deep Learning Models on Large Healthcare MIMIC Datasets	Oct 23, 2017	BenchmarkingBIG-bench Machine Learning	CodeCode Available	5
Deepened Graph Auto-Encoders Help Stabilize and Enhance Link Prediction	Mar 21, 2021	BenchmarkingClustering	CodeCode Available	5
AlphaZip: Neural Network-Enhanced Lossless Text Compression	Sep 23, 2024	BenchmarkingData Compression	CodeCode Available	5
Benchmarking Zero-Shot Robustness of Multimodal Foundation Models: A Pilot Study	Mar 15, 2024	Benchmarking	CodeCode Available	5
ImmersePro: End-to-End Stereo Video Synthesis Via Implicit Disparity Learning	Sep 30, 2024	BenchmarkingDisparity Estimation	CodeCode Available	5
A Wild Bootstrap for Degenerate Kernel Tests	Aug 23, 2014	BenchmarkingTime Series	CodeCode Available	5
Benchmarking YOLOv5 and YOLOv7 models with DeepSORT for droplet tracking applications	Jan 19, 2023	BenchmarkingGPU	CodeCode Available	5
Question-Answering Dense Video Events	Sep 6, 2024	BenchmarkingQuestion Answering	CodeCode Available	5
Aux-Drop: Handling Haphazard Inputs in Online Learning Using Auxiliary Dropouts	Mar 9, 2023	Benchmarking	CodeCode Available	5
Benchmarking White Blood Cell Classification Under Domain Shift	Mar 3, 2023	BenchmarkingClassification	CodeCode Available	5
IJCB 2022 Mobile Behavioral Biometrics Competition (MobileB2C)	Oct 6, 2022	Benchmarking	CodeCode Available	5
Illuminating the Diversity-Fitness Trade-Off in Black-Box Optimization	Aug 29, 2024	BenchmarkingDiversity	CodeCode Available	5
IHCV: Discovery of Hidden Time-Dependent Control Variables in Non-Linear Dynamical Systems	Apr 5, 2023	Benchmarking	CodeCode Available	5
Illusory VQA: Benchmarking and Enhancing Multimodal Models on Visual Illusions	Dec 11, 2024	BenchmarkingQuestion Answering	CodeCode Available	5
Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning	Jun 11, 2024	BenchmarkingContrastive Learning	CodeCode Available	5
Identifying and Benchmarking Natural Out-of-Context Prediction Problems	Oct 25, 2021	Benchmarking	CodeCode Available	5
Benchmarking GPT-4 against Human Translators: A Comprehensive Evaluation Across Languages, Domains, and Expertise Levels	Nov 21, 2024	BenchmarkingMachine Translation	CodeCode Available	5
RCP-Bench: Benchmarking Robustness for Collaborative Perception Under Diverse Corruptions	Jan 1, 2025	Benchmarking	CodeCode Available	5
IdeaBench: Benchmarking Large Language Models for Research Idea Generation	Oct 31, 2024	Benchmarkingscientific discovery	CodeCode Available	5
Identifying Money Laundering Subgraphs on the Blockchain	Oct 10, 2024	Benchmarking	CodeCode Available	5
IceBench: A Benchmark for Deep Learning based Sea Ice Type Classification	Mar 22, 2025	BenchmarkingClassification	CodeCode Available	5
HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination Tendency of LLMs	Feb 25, 2024	BenchmarkingChatbot	CodeCode Available	5
Identifying the Smallest Adversarial Load Perturbations that Render DC-OPF Infeasible	Jul 10, 2025	Adversarial AttackBenchmarking	CodeCode Available	5
Benchmarking Unsupervised Strategies for Anomaly Detection in Multivariate Time Series	Jun 25, 2025	Anomaly DetectionBenchmarking	CodeCode Available	5
Hyperopt-Sklearn: Automatic Hyperparameter Configuration for Scikit-Learn	Jan 1, 2014	AutoMLBenchmarking	CodeCode Available	5
Benchmarking Unsupervised Online IDS for Masquerade Attacks in CAN	Jun 19, 2024	BenchmarkingIntrusion Detection	CodeCode Available	5
Deep Metric Learning Meets Deep Clustering: An Novel Unsupervised Approach for Feature Embedding	Sep 9, 2020	BenchmarkingClustering	CodeCode Available	5
ACCESS DENIED INC: The First Benchmark Environment for Sensitivity Awareness	Jun 1, 2025	BenchmarkingManagement	CodeCode Available	5
AutoMIR: Effective Zero-Shot Medical Information Retrieval without Relevance Labels	Oct 26, 2024	BenchmarkingInformation Retrieval	CodeCode Available	5
Hyperbolic Benchmarking Unveils Network Topology-Feature Relationship in GNN Performance	Jun 4, 2024	BenchmarkingDrug Discovery	CodeCode Available	5
Hyperparameter-Free Losses for Model-Based Monocular Reconstruction	Aug 16, 2019	3D ReconstructionBenchmarking	CodeCode Available	5
Hybrid Machine Learning Models of Classifying Residential Requests for Smart Dispatching	Dec 22, 2019	BenchmarkingBIG-bench Machine Learning	CodeCode Available	5

Show:10 25 50

← PrevPage 41 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified