SOTAVerified

Benchmarking

Papers

Showing 36763700 of 5548 papers

TitleStatusHype
FOR-instance: a UAV laser scanning benchmark dataset for semantic and instance segmentation of individual trees0
Holistic Dynamic Frequency Transformer for Image Fusion and Exposure Correction0
FederatedScope-LLM: A Comprehensive Package for Fine-tuning Large Language Models in Federated Learning0
NeMig -- A Bilingual News Collection and Knowledge Graph about MigrationCode0
Can humans help BERT gain "confidence"?0
Benchmarking Robustness and Generalization in Multi-Agent Systems: A Case Study on Neural MMO0
Benchmarking Multilabel Topic Classification in the Kyrgyz LanguageCode0
Speech Self-Supervised Representations Benchmarking: a Case for Larger Probing Heads0
Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models0
Beyond Document Page Classification: Design, Datasets, and ChallengesCode0
Finding the Perfect Fit: Applying Regression Models to ClimateBench v1.0Code0
Benchmarking Causal Study to Interpret Large Language Models for Source Code0
Efficient Benchmarking of Language Models0
Benchmarking Domain Adaptation for Chemical Processes on the Tennessee Eastman ProcessCode0
Beyond MD17: the reactive xxMD datasetCode0
Expecting The Unexpected: Towards Broad Out-Of-Distribution DetectionCode0
UGSL: A Unified Framework for Benchmarking Graph Structure Learning0
Measuring the Effect of Causal Disentanglement on the Adversarial Robustness of Neural Network Models0
Neurological Prognostication of Post-Cardiac-Arrest Coma Patients Using EEG Data: A Dynamic Survival Analysis Framework with Competing RisksCode0
Benchmarking Adversarial Robustness of Compressed Deep Learning Models0
A Survey on Model Compression for Large Language Models0
IoT Data Trust Evaluation via Machine LearningCode0
Benchmarking Scalable Epistemic Uncertainty Quantification in Organ SegmentationCode0
Deep Neural Operator Driven Real Time Inference for Nuclear Systems to Enable Digital Twin Solutions0
Does AI for science need another ImageNet Or totally different benchmarks? A case study of machine learning force fields0
Show:102550
← PrevPage 148 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified