SOTAVerified

Benchmarking

Papers

Showing 41014150 of 5548 papers

TitleStatusHype
FIMP: Foundation Model-Informed Message Passing for Graph Neural Networks0
Conditional Neural Processes for Molecules0
DyFEn: Agent-Based Fee Setting in Payment Channel Networks0
A Survey of Parameters Associated with the Quality of Benchmarks in NLP0
TweetNERD -- End to End Entity Linking Benchmark for TweetsCode0
Benchmarking Long-tail Generalization with Likelihood SplitsCode0
OpenOOD: Benchmarking Generalized Out-of-Distribution DetectionCode0
Simulated Contextual Bandits for Personalization Tasks from Recommendation DatasetsCode0
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems0
Vote'n'Rank: Revision of Benchmarking with Social Choice TheoryCode0
A Survey of Methods for Addressing Class Imbalance in Deep-Learning Based Natural Language Processing0
Quantifying Social Biases Using Templates is Unreliable0
Are All Steps Equally Important? Benchmarking Essentiality Detection of Events0
Is margin all you need? An extensive empirical study of active learning on tabular data0
A Theory of Dynamic Benchmarks0
IJCB 2022 Mobile Behavioral Biometrics Competition (MobileB2C)Code0
SynBench: Task-Agnostic Benchmarking of Pretrained Representations using Synthetic Data0
MEDFAIR: Benchmarking Fairness for Medical ImagingCode0
Detection and Evaluation of Clusters within Sequential Data0
A Framework for Large Scale Synthetic Graph Dataset Generation0
Benchmarking Learnt Radio Localisation under Distribution Shift0
The current state of single-cell proteomics data analysisCode0
DELAD: Deep Landweber-guided deconvolution with Hessian and sparse prior0
Benchmarking Learning Efficiency in Deep Reservoir ComputingCode0
Towards Parameter-Efficient Integration of Pre-Trained Language Models In Temporal Video GroundingCode0
Deep Feature Selection Using a Novel Complementary Feature Mask0
Feature Encodings for Gradient Boosting with Automunge0
Removal of Ocular Artifacts in EEG Using Deep Learning0
How Good Is Neural Combinatorial Optimization? A Systematic Evaluation on the Traveling Salesman Problem0
Periodic Extrapolative Generalisation in Neural NetworksCode0
Progressive with Purpose: Guiding Progressive Inpainting DNNs through Context and Structure0
Benchmarking Apache Spark and Hadoop MapReduce on Big Data ClassificationCode0
Benchmarking energy consumption and latency for neuromorphic computing in condensed matter and particle physics0
FACT: Learning Governing Abstractions Behind Integer Sequences0
Feature embedding in click-through rate predictionCode0
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning0
Skills and Liquidity Barriers to Youth Employment: Medium-term Evidence from a Cash Benchmarking Experiment in Rwanda0
LAVIS: A Library for Language-Vision Intelligence0
Is Synthetic Dataset Reliable for Benchmarking Generalizable Person Re-Identification?0
OpenMixup: Open Mixup Toolbox and Benchmark for Visual Representation Learning0
Application of Machine Learning for Online Reputation Systems0
FORLORN: A Framework for Comparing Offline Methods and Reinforcement Learning for Optimization of RAN ParametersCode0
Improving plant disease classification by adaptive minimal ensembling0
RF Fingerprinting Needs Attention: Multi-task Approach for Real-World WiFi and Bluetooth0
Low Complexity Hybrid Beamforming for mmWave Full-Duplex Integrated Access and BackhaulCode0
Complexity of Representations in Deep Learning0
An evaluation framework for comparing causal inference models0
AutoWS-Bench-101: Benchmarking Automated Weak Supervision with 100 Labels0
Hardware-aware mobile building block evaluation for computer vision0
Benchmarking Human Face Similarity Using Identical Twins0
Show:102550
← PrevPage 83 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified