SOTAVerified

Benchmarking

Papers

Showing 42264250 of 5548 papers

TitleStatusHype
Transforming Game Play: A Comparative Study of DCQN and DTQN Architectures in Reinforcement Learning0
Translation Canvas: An Explainable Interface to Pinpoint and Analyze Translation Systems0
TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation0
TransOpt: Transformer-based Representation Learning for Optimization Problem Classification0
TransportationGames: Benchmarking Transportation Knowledge of (Multimodal) Large Language Models0
Treatment Learning Causal Transformer for Noisy Image Classification0
Tree Instance Segmentation With Temporal Contour Graph0
Trial-Based Dominance Enables Non-Parametric Tests to Compare both the Speed and Accuracy of Stochastic Optimizers0
Trident: Efficient 4PC Framework for Privacy Preserving Machine Learning0
TriSAM: Tri-Plane SAM for zero-shot cortical blood vessel segmentation in VEM images0
Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms0
True Online TD-Replan(lambda) Achieving Planning through Replaying0
Trust but Verify: Programmatic VLM Evaluation in the Wild0
TTSlow: Slow Down Text-to-Speech with Efficiency Robustness Evaluations0
Turbulence in Focus: Benchmarking Scaling Behavior of 3D Volumetric Super-Resolution with BLASTNet 2.0 Data0
U2-BENCH: Benchmarking Large Vision-Language Models on Ultrasound Understanding0
UAV-Flow Colosseo: A Real-World Benchmark for Flying-on-a-Word UAV Imitation Learning0
UAV Immersive Video Streaming: A Comprehensive Survey, Benchmarking, and Open Challenges0
UCCIX: Irish-eXcellence Large Language Model0
UCLID-Net: Single View Reconstruction in Object Space0
UDTIRI: An Online Open-Source Intelligent Road Inspection Benchmark Suite0
UGSL: A Unified Framework for Benchmarking Graph Structure Learning0
UKAN: Unbound Kolmogorov-Arnold Network Accompanied with Accelerated Library0
Unbounded Bayesian Optimization via Regularization0
Uncertainty estimation for Cross-dataset performance in Trajectory prediction0
Show:102550
← PrevPage 170 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified