SOTAVerified

Benchmarking

Papers

Showing 13311340 of 5548 papers

TitleStatusHype
HyperFace: Generating Synthetic Face Recognition Datasets by Exploring Face Embedding Hypersphere0
A Survey on Vision Autoregressive Model0
FM-TS: Flow Matching for Time Series GenerationCode1
Evaluating the Generation of Spatial Relations in Text and Image Generative Models0
Retrieval or Global Context Understanding? On Many-Shot In-Context Learning for Long-Context EvaluationCode0
General Geospatial Inference with a Population Dynamics Foundation ModelCode3
BuckTales : A multi-UAV dataset for multi-object tracking and re-identification of wild antelopes0
Benchmarking LLMs' Judgments with No Gold StandardCode0
Arctique: An artificial histopathological dataset unifying realism and controllability for uncertainty quantificationCode1
MolMiner: Towards Controllable, 3D-Aware, Fragment-Based Molecular Design0
Show:102550
← PrevPage 134 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified