Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3751–3800 of 5548 papers

Title	Date	Tasks	Status
Multimodal Information Retrieval for Open World with Edit Distance Weak Supervision	Jun 25, 2025	BenchmarkingInformation Retrieval	—Unverified
Benchmarking Edge Computing Devices for Grape Bunches and Trunks Detection using Accelerated Object Detection Single Shot MultiBox Deep Learning Models	Nov 21, 2022	BenchmarkingEdge-computing	—Unverified
Benchmarking Edge AI Platforms for High-Performance ML Inference	Sep 23, 2024	BenchmarkingCPU	—Unverified
Quantum Similarity Testing with Convolutional Neural Networks	Nov 3, 2022	Benchmarking	—Unverified
Benchmarking Domain Randomisation for Visual Sim-to-Real Transfer	Nov 13, 2020	BenchmarkingPose Estimation	—Unverified
Multimodal or Text? Retrieval or BERT? Benchmarking Classifiers for the Shared Task on Hateful Memes	Aug 1, 2021	BenchmarkingBinary Classification	—Unverified
Multi-Modal Three-Stream Network for Action Recognition	Sep 8, 2019	Action ClassificationAction Recognition	—Unverified
MultiON: Benchmarking Semantic Map Memory using Multi-Object Navigation	Dec 7, 2020	BenchmarkingObject	—Unverified
Towards Spoken Mathematical Reasoning: Benchmarking Speech-based Models over Multi-faceted Math Problems	May 21, 2025	BenchmarkingMath	—Unverified
LadderMIL: Multiple Instance Learning with Coarse-to-Fine Self-Distillation	Feb 4, 2025	BenchmarkingClassification	—Unverified
Towards Stable 3D Object Detection	Jul 5, 2024	3D Object DetectionAutonomous Driving	—Unverified
Benchmarking Domain Generalization on EEG-based Emotion Recognition	Apr 18, 2022	BenchmarkingDomain Adaptation	—Unverified
MultiRobustBench: Benchmarking Robustness Against Multiple Attacks	Feb 21, 2023	Benchmarking	—Unverified
MultiSocial: Multilingual Benchmark of Machine-Generated Text Detection of Social-Media Texts	Jun 18, 2024	ArticlesBenchmarking	—Unverified
AT-Drone: Benchmarking Adaptive Teaming in Multi-Drone Pursuit	Feb 13, 2025	BenchmarkingEdge-computing	—Unverified
MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing	Dec 27, 2022	BenchmarkingSemantic Parsing	—Unverified
Benchmarking Diverse-Modal Entity Linking with Generative Models	May 27, 2023	BenchmarkingDecoder	—Unverified
Benchmarking Discrete Optimization Heuristics with IOHprofiler	Dec 19, 2019	Benchmarking	—Unverified
Non-linear Multitask Learning with Deep Gaussian Processes	May 29, 2019	BenchmarkingGaussian Processes	—Unverified
Benchmarking Differential Evolution on a Quantum Simulator	Nov 6, 2023	BenchmarkingEvolutionary Algorithms	—Unverified
Adaptive Gradient Methods with Local Guarantees	Mar 2, 2022	Benchmarking	—Unverified
Benchmarking Denoising Algorithms with Real Photographs	Jul 5, 2017	BenchmarkingDenoising	—Unverified
Multivariate Stochastic Dominance via Optimal Transport and Applications to Models Benchmarking	Jun 10, 2024	BenchmarkingEconometrics	—Unverified
Multiview Aerial Visual Recognition (MAVREC): Can Multi-view Improve Aerial Visual Perception?	Dec 7, 2023	BenchmarkingDiversity	—Unverified
Multi-view deep learning based molecule design and structural optimization accelerates the SARS-CoV-2 inhibitor discovery	Dec 3, 2022	BenchmarkingRepresentation Learning	—Unverified
MUPAX: Multidimensional Problem Agnostic eXplainable AI	Jul 17, 2025	Anatomical Landmark DetectionAudio Classification	—Unverified
Benchmarking Defeasible Reasoning with Large Language Models -- Initial Experiments and Future Directions	Oct 16, 2024	Benchmarking	—Unverified
Benchmarking Deep Trackers on Aerial Videos	Mar 24, 2021	AttributeBenchmarking	—Unverified
MVS^2: Deep Unsupervised Multi-view Stereo with Multi-View Symmetry	Aug 30, 2019	Benchmarking	—Unverified
My Boli: Code-mixed Marathi-English Corpora, Pretrained Language Models and Evaluation Benchmarks	Jun 24, 2023	BenchmarkingHate Speech Detection	—Unverified
N^2: A Unified Python Package and Test Bench for Nearest Neighbor-Based Matrix Completion	Jun 4, 2025	BenchmarkingCausal Inference	—Unverified
NABU - Multilingual Graph-based Neural RDF Verbalizer	Sep 16, 2020	BenchmarkingDecoder	—Unverified
Towards Toxic Positivity Detection	Jul 1, 2022	BenchmarkingClassification	—Unverified
Benchmarking Deep Sequential Models on Volatility Predictions for Financial Time Series	Nov 8, 2018	BenchmarkingDecision Making	—Unverified
Benchmarking Deep Reinforcement Learning Algorithms for Vision-based Robotics	Jan 11, 2022	BenchmarkingDeep Reinforcement Learning	—Unverified
Benchmarking Deep Learning Models for Object Detection on Edge Computing Devices	Sep 25, 2024	Autonomous VehiclesBenchmarking	—Unverified
Benchmarking deep learning models for bearing fault diagnosis using the CWRU dataset: A multi-label approach	Jul 19, 2024	BenchmarkingBinary Classification	—Unverified
NAS-Bench-Zero: A Large Scale Dataset for Understanding Zero-Shot Neural Architecture Search	Sep 29, 2021	BenchmarkingNeural Architecture Search	—Unverified
Benchmarking Deep Learning Frameworks for Automated Diagnosis of Ocular Toxoplasmosis: A Comprehensive Approach to Classification and Segmentation	May 18, 2023	BenchmarkingDiagnostic	—Unverified
NA-SODINN: a deep learning algorithm for exoplanet image detection based on residual noise regimes	Feb 6, 2023	BenchmarkingSpecificity	—Unverified
NativQA: Multilingual Culturally-Aligned Natural Query for LLMs	Jul 13, 2024	BenchmarkingQuestion Answering	—Unverified
Benchmarking Deep Learning Classifiers for SAR Automatic Target Recognition	Dec 12, 2023	BenchmarkingDeep Learning	—Unverified
Natural Disasters Detection in Social Media and Satellite imagery: a survey	Jan 14, 2019	Benchmarking	—Unverified
Benchmarking Deep Learning-Based Methods for Irradiance Nowcasting with Sky Images	Mar 27, 2025	Benchmarking	—Unverified
Towards Trustworthy Deception Detection: Benchmarking Model Robustness across Domains, Modalities, and Languages	Apr 23, 2021	BenchmarkingDeception Detection	—Unverified
NATURAL PLAN: Benchmarking LLMs on Natural Language Planning	Jun 6, 2024	BenchmarkingScheduling	—Unverified
Nature-Inspired Optimization Algorithms: Challenges and Open Problems	Mar 8, 2020	Benchmarking	—Unverified
NavBench: A Unified Robotics Benchmark for Reinforcement Learning-Based Autonomous Navigation	May 20, 2025	Autonomous NavigationBenchmarking	—Unverified
What Motivates You? Benchmarking Automatic Detection of Basic Needs from Short Posts	Aug 1, 2021	BenchmarkingBinary Classification	—Unverified
Benchmarking Deep Learning Architectures for Urban Vegetation Point Cloud Semantic Segmentation from MLS	Jun 17, 2023	BenchmarkingSegmentation	—Unverified

Show:10 25 50

← PrevPage 76 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified