SOTAVerified

Benchmarking

Papers

Showing 37013725 of 5548 papers

TitleStatusHype
Application of Machine Learning for Online Reputation Systems0
FORLORN: A Framework for Comparing Offline Methods and Reinforcement Learning for Optimization of RAN ParametersCode0
Improving plant disease classification by adaptive minimal ensembling0
Benchmarking Multimodal Variational Autoencoders: CdSprites+ Dataset and ToolkitCode1
RF Fingerprinting Needs Attention: Multi-task Approach for Real-World WiFi and Bluetooth0
Low Complexity Hybrid Beamforming for mmWave Full-Duplex Integrated Access and BackhaulCode0
Structural Bias for Aspect Sentiment Triplet ExtractionCode1
nnOOD: A Framework for Benchmarking Self-supervised Anomaly Localisation MethodsCode1
Complexity of Representations in Deep Learning0
An evaluation framework for comparing causal inference models0
AutoWS-Bench-101: Benchmarking Automated Weak Supervision with 100 Labels0
Hardware-aware mobile building block evaluation for computer vision0
Benchmarking Human Face Similarity Using Identical Twins0
TEP-GNN: Accurate Execution Time Prediction of Functional Tests using Graph Neural Networks0
Towards Benchmarking Explainable Artificial Intelligence Methods0
Bugs in the Data: How ImageNet Misrepresents BiodiversityCode0
StEduCov: An Explored and Benchmarked Dataset on Stance Detection in Tweets towards Online Education during COVID-19 Pandemic0
MechProNet: Machine Learning Prediction of Mechanical Properties in Metal Additive Manufacturing0
SIM2E: Benchmarking the Group Equivariant Capability of Correspondence Matching Algorithms0
A biologically-inspired multi-modal evaluation of molecular generative machine learning0
Wildfire Forecasting with Satellite Images and Deep Generative Model0
Benchmarking Compositionality with Formal LanguagesCode1
MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code GenerationCode2
The Low Emission Oil&Gas Open (LEOGO) Reference Platform of an Off-Grid Energy System for Renewable Integration Studies0
Unsupervised machine learning approach for building composite indicators with fuzzy metrics0
Show:102550
← PrevPage 149 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified