SOTAVerified

Benchmarking

Papers

Showing 49214930 of 5548 papers

TitleStatusHype
Can LLMs perform structured graph reasoning?Code0
Attention-based Class-Conditioned Alignment for Multi-Source Domain Adaptation of Object DetectorsCode0
Exploring Model-based Planning with Policy NetworksCode0
Exploring Context Generalizability in Citywide Crowd Mobility Prediction: An Analytic Framework and BenchmarkCode0
Multimodal Multi-User Surface Recognition with the Kernel Two-Sample TestCode0
Exploiting Out-of-Domain Parallel Data through Multilingual Transfer Learning for Low-Resource Neural Machine TranslationCode0
Zero-shot generation of synthetic neurosurgical data with large language modelsCode0
Benchmarking Pathology Foundation Models: Adaptation Strategies and ScenariosCode0
Three Revisits to Node-Level Graph Anomaly Detection: Outliers, Message Passing and Hyperbolic Neural NetworksCode0
Multiple Instance Learning: A Survey of Problem Characteristics and ApplicationsCode0
Show:102550
← PrevPage 493 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified