SOTAVerified

Benchmarking

Papers

Showing 12761300 of 5548 papers

TitleStatusHype
A framework for benchmarking clustering algorithmsCode1
CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity QuantificationCode1
Hyperparameter optimization in deep multi-target predictionCode1
Comprehensive benchmarking of large language models for RNA secondary structure predictionCode1
Combinatorial Optimization with Policy Adaptation using Latent Space SearchCode1
Benchmarking Relief-Based Feature Selection Methods for Bioinformatics Data MiningCode1
Image Colorization: A Survey and DatasetCode1
Arctique: An artificial histopathological dataset unifying realism and controllability for uncertainty quantificationCode1
A SWAT-based Reinforcement Learning Framework for Crop ManagementCode1
AirSim Drone Racing LabCode1
Comics Datasets Framework: Mix of Comics datasets for detection benchmarkingCode1
Benchmarking Simulation-Based InferenceCode1
A Comprehensive Overview of Large Language ModelsCode1
A framework for benchmarking class-out-of-distribution detection and its application to ImageNetCode1
CombiBench: Benchmarking LLM Capability for Combinatorial MathematicsCode1
Constellation Dataset: Benchmarking High-Altitude Object Detection for an Urban IntersectionCode1
Implicit Multi-Spectral Transformer: An Lightweight and Effective Visible to Infrared Image Translation ModelCode1
A Systematic Benchmarking Analysis of Transfer Learning for Medical Image AnalysisCode1
Improving and Benchmarking Offline Reinforcement Learning AlgorithmsCode1
CovDocker: Benchmarking Covalent Drug Design with Tasks, Datasets, and SolutionsCode1
DataRec: A Python Library for Standardized and Reproducible Data Management in Recommender SystemsCode1
CodeUpdateArena: Benchmarking Knowledge Editing on API UpdatesCode1
Geometric Deep Learning for Structure-Based Drug Design: A SurveyCode1
CodeS: Natural Language to Code Repository via Multi-Layer SketchCode1
CoDEx: A Comprehensive Knowledge Graph Completion BenchmarkCode1
Show:102550
← PrevPage 52 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified