Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4001–4050 of 5548 papers

Title	Date	Tasks	Status
Hawk: An Industrial-strength Multi-label Document Classifier	Jan 15, 2023	BenchmarkingDocument Classification	—Unverified
Benchmarking Robustness in Neural Radiance Fields	Jan 10, 2023	BenchmarkingCamera Calibration	—Unverified
Evaluating the Transferability of Machine-Learned Force Fields for Material Property Modeling	Jan 10, 2023	BenchmarkingGraph Neural Network	CodeCode Available
Critical review of conformational B-cell epitope prediction methods	Jan 10, 2023	BenchmarkingDrug Design	CodeCode Available
Logically at Factify 2: A Multi-Modal Fact Checking System Based on Evidence Retrieval techniques and Transformer Encoder Architecture	Jan 9, 2023	AvgBenchmarking	—Unverified
AERF: Adaptive ensemble random fuzzy algorithm for anomaly detection in cloud computing	Jan 9, 2023	Anomaly DetectionBenchmarking	—Unverified
"It's a Match!" -- A Benchmark of Task Affinity Scores for Joint Learning	Jan 7, 2023	BenchmarkingMulti-Task Learning	—Unverified
The Evolutionary Computation Methods No One Should Use	Jan 5, 2023	Benchmarking	—Unverified
ANNA: Abstractive Text-to-Image Synthesis with Filtered News Captions	Jan 5, 2023	ArticlesBenchmarking	CodeCode Available
Benchmarking common uncertainty estimation methods with histopathological images under domain shift and label noise	Jan 3, 2023	BenchmarkingClassification	—Unverified
HaN-Seg: The head and neck organ-at-risk CT and MR segmentation dataset	Jan 3, 2023	BenchmarkingComputed Tomography (CT)	—Unverified
Improving Sequential Recommendation Models with an Enhanced Loss Function	Jan 3, 2023	BenchmarkingRecommendation Systems	CodeCode Available
Tree Instance Segmentation With Temporal Contour Graph	Jan 1, 2023	BenchmarkingInstance Segmentation	—Unverified
Comparison of tree-based ensemble algorithms for merging satellite and earth-observed precipitation data at the daily time scale	Dec 31, 2022	Benchmarkingregression	—Unverified
4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions	Dec 31, 2022	Autonomous DrivingBenchmarking	—Unverified
Biologically Plausible Learning on Neuromorphic Hardware Architectures	Dec 29, 2022	BenchmarkingQuantization	—Unverified
MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing	Dec 27, 2022	BenchmarkingSemantic Parsing	—Unverified
Quality at the Tail of Machine Learning Inference	Dec 25, 2022	Autonomous DrivingBenchmarking	—Unverified
Benchmarking Machine Learning Models to Predict Corporate Bankruptcy	Dec 22, 2022	Benchmarking	—Unverified
A Seven-Layer Model for Standardising AI Fairness Assessment	Dec 21, 2022	BenchmarkingFairness	—Unverified
Distributed Software-Defined Network Architecture for Smart Grid Resilience to Denial-of-Service Attacks	Dec 20, 2022	Benchmarking	—Unverified
AI applications in forest monitoring need remote sensing benchmark datasets	Dec 20, 2022	Benchmarking	—Unverified
AnyTOD: A Programmable Task-Oriented Dialog System	Dec 20, 2022	BenchmarkingLanguage Modeling	—Unverified
Causally Testing Gender Bias in LLMs: A Case Study on Occupational Bias	Dec 20, 2022	Benchmarking	CodeCode Available
Benchmarking person re-identification datasets and approaches for practical real-world implementations	Dec 20, 2022	BenchmarkingPedestrian Detection	CodeCode Available
Trial-Based Dominance Enables Non-Parametric Tests to Compare both the Speed and Accuracy of Stochastic Optimizers	Dec 19, 2022	BenchmarkingStochastic Optimization	—Unverified
GiCCS: A German in-Context Conversational Similarity Benchmark	Dec 16, 2022	BenchmarkingSemantic Textual Similarity	—Unverified
Biomedical image analysis competitions: The state of current participation practice	Dec 16, 2022	BenchmarkingSurvey	—Unverified
Automatic vehicle trajectory data reconstruction at scale	Dec 15, 2022	Benchmarkingvehicle detection	—Unverified
Momentum Contrastive Pre-training for Question Answering	Dec 12, 2022	BenchmarkingContrastive Learning	—Unverified
Mind the Retrosynthesis Gap: Bridging the divide between Single-step and Multi-step Retrosynthesis Prediction	Dec 12, 2022	BenchmarkingMulti-step retrosynthesis	—Unverified
Progressive Multi-view Human Mesh Recovery with Self-Supervision	Dec 10, 2022	BenchmarkingDiversity	—Unverified
Is Bio-Inspired Learning Better than Backprop? Benchmarking Bio Learning vs. Backprop	Dec 9, 2022	Benchmarking	—Unverified
On Distribution Grid Optimal Power Flow Development and Integration	Dec 9, 2022	Benchmarking	—Unverified
Model-based trajectory stitching for improved behavioural cloning and its applications	Dec 8, 2022	Behavioural cloningBenchmarking	—Unverified
An open unified deep graph learning framework for discovering drug leads	Dec 6, 2022	BenchmarkingDrug Discovery	CodeCode Available
Benchmarking AutoML algorithms on a collection of synthetic classification problems	Dec 6, 2022	AutoMLBenchmarking	CodeCode Available
Benchmarking Offline Reinforcement Learning Algorithms for E-Commerce Order Fraud Evaluation	Dec 5, 2022	BenchmarkingBinary Classification	—Unverified
INCLUSIFY: A benchmark and a model for gender-inclusive German	Dec 5, 2022	Benchmarking	—Unverified
DFEE: Interactive DataFlow Execution and Evaluation Kit	Dec 4, 2022	BenchmarkingScheduling	CodeCode Available
Multi-view deep learning based molecule design and structural optimization accelerates the SARS-CoV-2 inhibitor discovery	Dec 3, 2022	BenchmarkingRepresentation Learning	—Unverified
Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking	Dec 2, 2022	BenchmarkingInformation Retrieval	—Unverified
BenchENAS: A Benchmarking Platform for Evolutionary Neural Architecture Search	Dec 1, 2022	BenchmarkingGPU	CodeCode Available
Device Modeling Bias in ReRAM-based Neural Network Simulations	Nov 29, 2022	Benchmarking	—Unverified
BBOB Instance Analysis: Landscape Properties and Algorithm Performance across Problem Instances	Nov 29, 2022	Benchmarking	—Unverified
A Boosting Approach to Constructing an Ensemble Stack	Nov 28, 2022	BenchmarkingEnsemble Learning	—Unverified
Tackling Visual Control via Multi-View Exploration Maximization	Nov 28, 2022	BenchmarkingReinforcement Learning (RL)	—Unverified
Predicting Football Match Outcomes with eXplainable Machine Learning and the Kelly Index	Nov 28, 2022	Benchmarking	—Unverified
Benchmarking simulated and physical quantum processing units using quantum and hybrid algorithms	Nov 28, 2022	Benchmarking	—Unverified
Efficient Demand Response Location Targeting for Price Spike Mitigation by Exploiting Price-demand Relationship	Nov 27, 2022	Benchmarking	—Unverified

Show:10 25 50

← PrevPage 81 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified