SOTAVerified

Benchmarking

Papers

Showing 25512560 of 5548 papers

TitleStatusHype
The ParClusterers Benchmark Suite (PCBS): A Fine-Grained Analysis of Scalable Graph Clustering0
WelQrate: Defining the Gold Standard in Small Molecule Drug Discovery Benchmarking0
BEARD: Benchmarking the Adversarial Robustness for Dataset DistillationCode0
A survey of probabilistic generative frameworks for molecular simulationsCode0
HyperFace: Generating Synthetic Face Recognition Datasets by Exploring Face Embedding Hypersphere0
Anomaly Detection in Large-Scale Cloud Systems: An Industry Case and DatasetCode0
A Survey on Vision Autoregressive Model0
Evaluating the Generation of Spatial Relations in Text and Image Generative Models0
BuckTales : A multi-UAV dataset for multi-object tracking and re-identification of wild antelopes0
Retrieval or Global Context Understanding? On Many-Shot In-Context Learning for Long-Context EvaluationCode0
Show:102550
← PrevPage 256 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified