SOTAVerified

Benchmarking

Papers

Showing 801825 of 5548 papers

TitleStatusHype
A SWAT-based Reinforcement Learning Framework for Crop ManagementCode1
Continual Learning with Foundation Models: An Empirical Study of Latent ReplayCode1
FreeMan: Towards Benchmarking 3D Human Pose Estimation under Real-World ConditionsCode1
ClimART: A Benchmark Dataset for Emulating Atmospheric Radiative Transfer in Weather and Climate ModelsCode1
Benchmarking AI scientists in omics data-driven biological researchCode1
FTNet: Feature Transverse Network for Thermal Image Semantic SegmentationCode1
Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization CorrelationsCode1
Benchmarking Algorithms for Federated Domain GeneralizationCode1
Benchmarking Algorithms for Submodular Optimization Problems Using IOHProfilerCode1
GAMA: a General Automated Machine learning AssistantCode1
ClearPose: Large-scale Transparent Object Dataset and BenchmarkCode1
Coarse-to-Fine Q-attention with Learned Path RankingCode1
Benchmarking and Analysis of Unsupervised Object Segmentation from Real-world Single ImagesCode1
Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized CodebaseCode1
Benchmarking and Analyzing 3D Human Pose and Shape Estimation Beyond AlgorithmsCode1
A Benchmarking Study of Kolmogorov-Arnold Networks on Tabular DataCode1
Collab-Overcooked: Benchmarking and Evaluating Large Language Models as Collaborative AgentsCode1
Benchmarking and Analyzing Point Cloud Classification under CorruptionsCode1
Benchmarking and Analyzing Robust Point Cloud Recognition: Bag of Tricks for Defending Adversarial ExamplesCode1
Generative and reproducible benchmarks for comprehensive evaluation of machine learning classifiersCode1
Generative Evaluation of Complex Reasoning in Large Language ModelsCode1
Generative Wind Power Curve Modeling Via Machine Vision: A Self-learning Deep Convolutional Network Based MethodCode1
Depth-Driven Geometric Prompt Learning for Laparoscopic Liver Landmark DetectionCode1
Evaluating Multimodal Representations on Visual Semantic Textual SimilarityCode1
CHILI: Chemically-Informed Large-scale Inorganic Nanomaterials Dataset for Advancing Graph Machine LearningCode1
Show:102550
← PrevPage 33 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified