SOTAVerified

Benchmarking

Papers

Showing 40014050 of 5548 papers

TitleStatusHype
Hawk: An Industrial-strength Multi-label Document Classifier0
Benchmarking Robustness in Neural Radiance Fields0
Evaluating the Transferability of Machine-Learned Force Fields for Material Property ModelingCode0
Critical review of conformational B-cell epitope prediction methodsCode0
Logically at Factify 2: A Multi-Modal Fact Checking System Based on Evidence Retrieval techniques and Transformer Encoder Architecture0
AERF: Adaptive ensemble random fuzzy algorithm for anomaly detection in cloud computing0
"It's a Match!" -- A Benchmark of Task Affinity Scores for Joint Learning0
The Evolutionary Computation Methods No One Should Use0
ANNA: Abstractive Text-to-Image Synthesis with Filtered News CaptionsCode0
Benchmarking common uncertainty estimation methods with histopathological images under domain shift and label noise0
HaN-Seg: The head and neck organ-at-risk CT and MR segmentation dataset0
Improving Sequential Recommendation Models with an Enhanced Loss FunctionCode0
Tree Instance Segmentation With Temporal Contour Graph0
Comparison of tree-based ensemble algorithms for merging satellite and earth-observed precipitation data at the daily time scale0
4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions0
Biologically Plausible Learning on Neuromorphic Hardware Architectures0
MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing0
Quality at the Tail of Machine Learning Inference0
Benchmarking Machine Learning Models to Predict Corporate Bankruptcy0
A Seven-Layer Model for Standardising AI Fairness Assessment0
Distributed Software-Defined Network Architecture for Smart Grid Resilience to Denial-of-Service Attacks0
AI applications in forest monitoring need remote sensing benchmark datasets0
AnyTOD: A Programmable Task-Oriented Dialog System0
Causally Testing Gender Bias in LLMs: A Case Study on Occupational BiasCode0
Benchmarking person re-identification datasets and approaches for practical real-world implementationsCode0
Trial-Based Dominance Enables Non-Parametric Tests to Compare both the Speed and Accuracy of Stochastic Optimizers0
GiCCS: A German in-Context Conversational Similarity Benchmark0
Biomedical image analysis competitions: The state of current participation practice0
Automatic vehicle trajectory data reconstruction at scale0
Momentum Contrastive Pre-training for Question Answering0
Mind the Retrosynthesis Gap: Bridging the divide between Single-step and Multi-step Retrosynthesis Prediction0
Progressive Multi-view Human Mesh Recovery with Self-Supervision0
Is Bio-Inspired Learning Better than Backprop? Benchmarking Bio Learning vs. Backprop0
On Distribution Grid Optimal Power Flow Development and Integration0
Model-based trajectory stitching for improved behavioural cloning and its applications0
An open unified deep graph learning framework for discovering drug leadsCode0
Benchmarking AutoML algorithms on a collection of synthetic classification problemsCode0
Benchmarking Offline Reinforcement Learning Algorithms for E-Commerce Order Fraud Evaluation0
INCLUSIFY: A benchmark and a model for gender-inclusive German0
DFEE: Interactive DataFlow Execution and Evaluation KitCode0
Multi-view deep learning based molecule design and structural optimization accelerates the SARS-CoV-2 inhibitor discovery0
Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking0
BenchENAS: A Benchmarking Platform for Evolutionary Neural Architecture SearchCode0
Device Modeling Bias in ReRAM-based Neural Network Simulations0
BBOB Instance Analysis: Landscape Properties and Algorithm Performance across Problem Instances0
A Boosting Approach to Constructing an Ensemble Stack0
Tackling Visual Control via Multi-View Exploration Maximization0
Predicting Football Match Outcomes with eXplainable Machine Learning and the Kelly Index0
Benchmarking simulated and physical quantum processing units using quantum and hybrid algorithms0
Efficient Demand Response Location Targeting for Price Spike Mitigation by Exploiting Price-demand Relationship0
Show:102550
← PrevPage 81 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified