SOTAVerified

Benchmarking

Papers

Showing 13011325 of 5548 papers

TitleStatusHype
Benchmarking Differential Privacy and Federated Learning for BERT ModelsCode1
You are AllSet: A Multiset Function Framework for Hypergraph Neural NetworksCode1
Synthetic Benchmarks for Scientific Research in Explainable Machine LearningCode1
Mutual-Information Based Few-Shot ClassificationCode1
Underwater Image Restoration via Contrastive Learning and a Real-world DatasetCode1
Intrinsic Image HarmonizationCode1
Perception Matters: Detecting Perception Failures of VQA Models Using Metamorphic TestingCode1
Understanding and Evaluating Racial Biases in Image CaptioningCode1
Selection of Source Images Heavily Influences the Effectiveness of Adversarial AttacksCode1
Online Learning with Optimism and DelayCode1
Shades of BLEU, Flavours of Success: The Case of MultiWOZCode1
Signals to Spikes for Neuromorphic Regulated Reservoir Computing and EMG Hand Gesture RecognitionCode1
Benchmarking Bias Mitigation Algorithms in Representation Learning through Fairness MetricsCode1
RobustNav: Towards Benchmarking Robustness in Embodied NavigationCode1
EXPObench: Benchmarking Surrogate-based Optimisation Algorithms on Expensive Black-box FunctionsCode1
The Medkit-Learn(ing) Environment: Medical Decision Modelling through SimulationCode1
DFGC 2021: A DeepFake Game CompetitionCode1
FedScale: Benchmarking Model and System Performance of Federated Learning at ScaleCode1
Benchmarking the Performance of Bayesian Optimization across Multiple Experimental Materials Science DomainsCode1
Anabranch Network for Camouflaged Object SegmentationCode1
Multimodal Fusion via Teacher-Student Network for Indoor Action RecognitionCode1
DACBench: A Benchmark Library for Dynamic Algorithm ConfigurationCode1
Best practices for constructing, preparing, and evaluating protein-ligand binding affinity benchmarksCode1
A Reinforcement Learning Environment for Multi-Service UAV-enabled Wireless SystemsCode1
AnomalyHop: An SSL-based Image Anomaly Localization MethodCode1
Show:102550
← PrevPage 53 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified