Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3501–3550 of 5548 papers

Title	Date	Tasks	Status	Hype
AERF: Adaptive ensemble random fuzzy algorithm for anomaly detection in cloud computing	Jan 9, 2023	Anomaly DetectionBenchmarking	—Unverified	0
Logically at Factify 2: A Multi-Modal Fact Checking System Based on Evidence Retrieval techniques and Transformer Encoder Architecture	Jan 9, 2023	AvgBenchmarking	—Unverified	0
"It's a Match!" -- A Benchmark of Task Affinity Scores for Joint Learning	Jan 7, 2023	BenchmarkingMulti-Task Learning	—Unverified	0
The CropAndWeed Dataset: A Multi-Modal Learning Approach for Efficient Crop and Weed Manipulation	Jan 6, 2023	BenchmarkingCrop Classification	CodeCode Available	1
The Evolutionary Computation Methods No One Should Use	Jan 5, 2023	Benchmarking	—Unverified	0
ANNA: Abstractive Text-to-Image Synthesis with Filtered News Captions	Jan 5, 2023	ArticlesBenchmarking	CodeCode Available	0
Trace Encoding in Process Mining: a survey and benchmarking	Jan 5, 2023	BenchmarkingPredictive Process Monitoring	CodeCode Available	1
HaN-Seg: The head and neck organ-at-risk CT and MR segmentation dataset	Jan 3, 2023	BenchmarkingComputed Tomography (CT)	—Unverified	0
Improving Sequential Recommendation Models with an Enhanced Loss Function	Jan 3, 2023	BenchmarkingRecommendation Systems	CodeCode Available	0
Benchmarking common uncertainty estimation methods with histopathological images under domain shift and label noise	Jan 3, 2023	BenchmarkingClassification	—Unverified	0
Benchmarking the Robustness of LiDAR Semantic Segmentation Models	Jan 3, 2023	Autonomous DrivingBenchmarking	CodeCode Available	2
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation	Jan 3, 2023	BenchmarkingFew-shot Instance Segmentation	CodeCode Available	1
SQAD: Automatic Smartphone Camera Quality Assessment and Benchmarking	Jan 1, 2023	Benchmarking	CodeCode Available	1
Tree Instance Segmentation With Temporal Contour Graph	Jan 1, 2023	BenchmarkingInstance Segmentation	—Unverified	0
Benchmarking Robustness of 3D Object Detection to Common Corruptions	Jan 1, 2023	3D Object DetectionAutonomous Driving	CodeCode Available	1
MIGPerf: A Comprehensive Benchmark for Deep Learning Training and Inference Workloads on Multi-Instance GPUs	Jan 1, 2023	BenchmarkingGPU	CodeCode Available	1
Comparison of tree-based ensemble algorithms for merging satellite and earth-observed precipitation data at the daily time scale	Dec 31, 2022	Benchmarkingregression	—Unverified	0
4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions	Dec 31, 2022	Autonomous DrivingBenchmarking	—Unverified	0
Biologically Plausible Learning on Neuromorphic Hardware Architectures	Dec 29, 2022	BenchmarkingQuantization	—Unverified	0
MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing	Dec 27, 2022	BenchmarkingSemantic Parsing	—Unverified	0
AER: Auto-Encoder with Regression for Time Series Anomaly Detection	Dec 27, 2022	Anomaly DetectionBenchmarking	CodeCode Available	3
Quality at the Tail of Machine Learning Inference	Dec 25, 2022	Autonomous DrivingBenchmarking	—Unverified	0
Benchmarking Machine Learning Models to Predict Corporate Bankruptcy	Dec 22, 2022	Benchmarking	—Unverified	0
Ultra-High-Definition Low-Light Image Enhancement: A Benchmark and Transformer-Based Method	Dec 22, 2022	4k8k	CodeCode Available	2
A Seven-Layer Model for Standardising AI Fairness Assessment	Dec 21, 2022	BenchmarkingFairness	—Unverified	0
Causally Testing Gender Bias in LLMs: A Case Study on Occupational Bias	Dec 20, 2022	Benchmarking	CodeCode Available	0
Distributed Software-Defined Network Architecture for Smart Grid Resilience to Denial-of-Service Attacks	Dec 20, 2022	Benchmarking	—Unverified	0
AI applications in forest monitoring need remote sensing benchmark datasets	Dec 20, 2022	Benchmarking	—Unverified	0
Benchmarking person re-identification datasets and approaches for practical real-world implementations	Dec 20, 2022	BenchmarkingPedestrian Detection	CodeCode Available	0
A Comprehensive Study of the Robustness for LiDAR-based 3D Object Detectors against Adversarial Attacks	Dec 20, 2022	3D Object DetectionBenchmarking	CodeCode Available	1
AnyTOD: A Programmable Task-Oriented Dialog System	Dec 20, 2022	BenchmarkingLanguage Modeling	—Unverified	0
Benchmarking Spatial Relationships in Text-to-Image Generation	Dec 20, 2022	BenchmarkingImage Generation	CodeCode Available	1
Trial-Based Dominance Enables Non-Parametric Tests to Compare both the Speed and Accuracy of Stochastic Optimizers	Dec 19, 2022	BenchmarkingStochastic Optimization	—Unverified	0
GiCCS: A German in-Context Conversational Similarity Benchmark	Dec 16, 2022	BenchmarkingSemantic Textual Similarity	—Unverified	0
Biomedical image analysis competitions: The state of current participation practice	Dec 16, 2022	BenchmarkingSurvey	—Unverified	0
Automatic vehicle trajectory data reconstruction at scale	Dec 15, 2022	Benchmarkingvehicle detection	—Unverified	0
Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift	Dec 15, 2022	BenchmarkingImage Captioning	CodeCode Available	1
Benchmarking Large Language Models for Automated Verilog RTL Code Generation	Dec 13, 2022	BenchmarkingCode Generation	CodeCode Available	1
Mind the Retrosynthesis Gap: Bridging the divide between Single-step and Multi-step Retrosynthesis Prediction	Dec 12, 2022	BenchmarkingMulti-step retrosynthesis	—Unverified	0
PyPop7: A Pure-Python Library for Population-Based Black-Box Optimization	Dec 12, 2022	BenchmarkingEvolutionary Algorithms	CodeCode Available	2
On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline	Dec 12, 2022	BenchmarkingData Augmentation	CodeCode Available	1
Momentum Contrastive Pre-training for Question Answering	Dec 12, 2022	BenchmarkingContrastive Learning	—Unverified	0
Progressive Multi-view Human Mesh Recovery with Self-Supervision	Dec 10, 2022	BenchmarkingDiversity	—Unverified	0
Ego-Body Pose Estimation via Ego-Head Pose Estimation	Dec 9, 2022	BenchmarkingDisentanglement	CodeCode Available	1
On Distribution Grid Optimal Power Flow Development and Integration	Dec 9, 2022	Benchmarking	—Unverified	0
Benchmarking Self-Supervised Learning on Diverse Pathology Datasets	Dec 9, 2022	BenchmarkingClassification	CodeCode Available	1
Is Bio-Inspired Learning Better than Backprop? Benchmarking Bio Learning vs. Backprop	Dec 9, 2022	Benchmarking	—Unverified	0
Model-based trajectory stitching for improved behavioural cloning and its applications	Dec 8, 2022	Behavioural cloningBenchmarking	—Unverified	0
CODEBench: A Neural Architecture and Hardware Accelerator Co-Design Framework	Dec 7, 2022	Benchmarking	CodeCode Available	1
An open unified deep graph learning framework for discovering drug leads	Dec 6, 2022	BenchmarkingDrug Discovery	CodeCode Available	0

Show:10 25 50

← PrevPage 71 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified