Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3776–3800 of 5548 papers

Title	Date	Tasks	Status
MUPAX: Multidimensional Problem Agnostic eXplainable AI	Jul 17, 2025	Anatomical Landmark DetectionAudio Classification	—Unverified
Benchmarking Defeasible Reasoning with Large Language Models -- Initial Experiments and Future Directions	Oct 16, 2024	Benchmarking	—Unverified
Benchmarking Deep Trackers on Aerial Videos	Mar 24, 2021	AttributeBenchmarking	—Unverified
MVS^2: Deep Unsupervised Multi-view Stereo with Multi-View Symmetry	Aug 30, 2019	Benchmarking	—Unverified
My Boli: Code-mixed Marathi-English Corpora, Pretrained Language Models and Evaluation Benchmarks	Jun 24, 2023	BenchmarkingHate Speech Detection	—Unverified
N^2: A Unified Python Package and Test Bench for Nearest Neighbor-Based Matrix Completion	Jun 4, 2025	BenchmarkingCausal Inference	—Unverified
NABU - Multilingual Graph-based Neural RDF Verbalizer	Sep 16, 2020	BenchmarkingDecoder	—Unverified
Towards Toxic Positivity Detection	Jul 1, 2022	BenchmarkingClassification	—Unverified
Benchmarking Deep Sequential Models on Volatility Predictions for Financial Time Series	Nov 8, 2018	BenchmarkingDecision Making	—Unverified
Benchmarking Deep Reinforcement Learning Algorithms for Vision-based Robotics	Jan 11, 2022	BenchmarkingDeep Reinforcement Learning	—Unverified
Benchmarking Deep Learning Models for Object Detection on Edge Computing Devices	Sep 25, 2024	Autonomous VehiclesBenchmarking	—Unverified
Benchmarking deep learning models for bearing fault diagnosis using the CWRU dataset: A multi-label approach	Jul 19, 2024	BenchmarkingBinary Classification	—Unverified
NAS-Bench-Zero: A Large Scale Dataset for Understanding Zero-Shot Neural Architecture Search	Sep 29, 2021	BenchmarkingNeural Architecture Search	—Unverified
Benchmarking Deep Learning Frameworks for Automated Diagnosis of Ocular Toxoplasmosis: A Comprehensive Approach to Classification and Segmentation	May 18, 2023	BenchmarkingDiagnostic	—Unverified
NA-SODINN: a deep learning algorithm for exoplanet image detection based on residual noise regimes	Feb 6, 2023	BenchmarkingSpecificity	—Unverified
NativQA: Multilingual Culturally-Aligned Natural Query for LLMs	Jul 13, 2024	BenchmarkingQuestion Answering	—Unverified
Benchmarking Deep Learning Classifiers for SAR Automatic Target Recognition	Dec 12, 2023	BenchmarkingDeep Learning	—Unverified
Natural Disasters Detection in Social Media and Satellite imagery: a survey	Jan 14, 2019	Benchmarking	—Unverified
Benchmarking Deep Learning-Based Methods for Irradiance Nowcasting with Sky Images	Mar 27, 2025	Benchmarking	—Unverified
Towards Trustworthy Deception Detection: Benchmarking Model Robustness across Domains, Modalities, and Languages	Apr 23, 2021	BenchmarkingDeception Detection	—Unverified
NATURAL PLAN: Benchmarking LLMs on Natural Language Planning	Jun 6, 2024	BenchmarkingScheduling	—Unverified
Nature-Inspired Optimization Algorithms: Challenges and Open Problems	Mar 8, 2020	Benchmarking	—Unverified
NavBench: A Unified Robotics Benchmark for Reinforcement Learning-Based Autonomous Navigation	May 20, 2025	Autonomous NavigationBenchmarking	—Unverified
What Motivates You? Benchmarking Automatic Detection of Basic Needs from Short Posts	Aug 1, 2021	BenchmarkingBinary Classification	—Unverified
Benchmarking Deep Learning Architectures for Urban Vegetation Point Cloud Semantic Segmentation from MLS	Jun 17, 2023	BenchmarkingSegmentation	—Unverified

Show:10 25 50

← PrevPage 152 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified