SOTAVerified

Benchmarking

Papers

Showing 42114220 of 5548 papers

TitleStatusHype
Towards Zero-Shot Differential Morphing Attack Detection with Multimodal Large Language Models0
Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks0
TP-RAG: Benchmarking Retrieval-Augmented Large Language Model Agents for Spatiotemporal-Aware Travel Planning0
Tracking Everything in Robotic-Assisted Surgery0
Training Mixed-Domain Translation Models via Federated Learning0
Training neural mapping schemes for satellite altimetry with simulation data0
Training Transformers with Enforced Lipschitz Constants0
Trajectory Normalized Gradients for Distributed Optimization0
TRAM: Benchmarking Temporal Reasoning for Large Language Models0
Transactive Local Energy Markets Enable Community-Level Resource Coordination Using Individual Rewards0
Show:102550
← PrevPage 422 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified