Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4001–4025 of 5548 papers

Title	Date	Tasks	Status
Hawk: An Industrial-strength Multi-label Document Classifier	Jan 15, 2023	BenchmarkingDocument Classification	—Unverified
Benchmarking Robustness in Neural Radiance Fields	Jan 10, 2023	BenchmarkingCamera Calibration	—Unverified
Evaluating the Transferability of Machine-Learned Force Fields for Material Property Modeling	Jan 10, 2023	BenchmarkingGraph Neural Network	CodeCode Available
Critical review of conformational B-cell epitope prediction methods	Jan 10, 2023	BenchmarkingDrug Design	CodeCode Available
Logically at Factify 2: A Multi-Modal Fact Checking System Based on Evidence Retrieval techniques and Transformer Encoder Architecture	Jan 9, 2023	AvgBenchmarking	—Unverified
AERF: Adaptive ensemble random fuzzy algorithm for anomaly detection in cloud computing	Jan 9, 2023	Anomaly DetectionBenchmarking	—Unverified
"It's a Match!" -- A Benchmark of Task Affinity Scores for Joint Learning	Jan 7, 2023	BenchmarkingMulti-Task Learning	—Unverified
The Evolutionary Computation Methods No One Should Use	Jan 5, 2023	Benchmarking	—Unverified
ANNA: Abstractive Text-to-Image Synthesis with Filtered News Captions	Jan 5, 2023	ArticlesBenchmarking	CodeCode Available
Benchmarking common uncertainty estimation methods with histopathological images under domain shift and label noise	Jan 3, 2023	BenchmarkingClassification	—Unverified
HaN-Seg: The head and neck organ-at-risk CT and MR segmentation dataset	Jan 3, 2023	BenchmarkingComputed Tomography (CT)	—Unverified
Improving Sequential Recommendation Models with an Enhanced Loss Function	Jan 3, 2023	BenchmarkingRecommendation Systems	CodeCode Available
Tree Instance Segmentation With Temporal Contour Graph	Jan 1, 2023	BenchmarkingInstance Segmentation	—Unverified
Comparison of tree-based ensemble algorithms for merging satellite and earth-observed precipitation data at the daily time scale	Dec 31, 2022	Benchmarkingregression	—Unverified
4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions	Dec 31, 2022	Autonomous DrivingBenchmarking	—Unverified
Biologically Plausible Learning on Neuromorphic Hardware Architectures	Dec 29, 2022	BenchmarkingQuantization	—Unverified
MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing	Dec 27, 2022	BenchmarkingSemantic Parsing	—Unverified
Quality at the Tail of Machine Learning Inference	Dec 25, 2022	Autonomous DrivingBenchmarking	—Unverified
Benchmarking Machine Learning Models to Predict Corporate Bankruptcy	Dec 22, 2022	Benchmarking	—Unverified
A Seven-Layer Model for Standardising AI Fairness Assessment	Dec 21, 2022	BenchmarkingFairness	—Unverified
Distributed Software-Defined Network Architecture for Smart Grid Resilience to Denial-of-Service Attacks	Dec 20, 2022	Benchmarking	—Unverified
AI applications in forest monitoring need remote sensing benchmark datasets	Dec 20, 2022	Benchmarking	—Unverified
AnyTOD: A Programmable Task-Oriented Dialog System	Dec 20, 2022	BenchmarkingLanguage Modeling	—Unverified
Causally Testing Gender Bias in LLMs: A Case Study on Occupational Bias	Dec 20, 2022	Benchmarking	CodeCode Available
Benchmarking person re-identification datasets and approaches for practical real-world implementations	Dec 20, 2022	BenchmarkingPedestrian Detection	CodeCode Available

Show:10 25 50

← PrevPage 161 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified