SOTAVerified

Benchmarking

Papers

Showing 42514275 of 5548 papers

TitleStatusHype
Uncertainty Estimation with Deep Learning for Rainfall-Runoff Modelling0
Understanding and Benchmarking Artificial Intelligence: OpenAI's o3 Is Not AGI0
Understanding Foundation Models: Are We Back in 1924?0
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems0
Understanding Recurrent Neural Architectures by Analyzing and Synthesizing Long Distance Dependencies in Benchmark Sequential Datasets0
Understanding the Limits of Lifelong Knowledge Editing in LLMs0
Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective0
Understanding the User: An Intent-Based Ranking Dataset0
Uniform Discretized Integrated Gradients: An effective attribution based method for explaining large language models0
Unifying Few- and Zero-Shot Egocentric Action Recognition0
UniIR: Training and Benchmarking Universal Multimodal Information Retrievers0
Uni-Render: A Unified Accelerator for Real-Time Rendering Across Diverse Neural Renderers0
Unitail: Detecting, Reading, and Matching in Retail Scene0
Unleashing OpenTitan's Potential: a Silicon-Ready Embedded Secure Element for Root of Trust and Cryptographic Offloading0
Unlocking the Potential: Benchmarking Large Language Models in Water Engineering and Research0
Unmasking Deceptive Visuals: Benchmarking Multimodal Large Language Models on Misleading Chart Question Answering0
Unreal Robotics Lab: A High-Fidelity Robotics Simulator with Advanced Physics and Rendering0
UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI0
UnsafeBench: Benchmarking Image Safety Classifiers on Real-World and AI-Generated Images0
Unsupervised Deep Epipolar Flow for Stationary or Dynamic Scenes0
Unsupervised Feature Learning for Environmental Sound Classification Using Weighted Cycle-Consistent Generative Adversarial Network0
Unsupervised Hierarchical Grouping of Knowledge Graph Entities0
Unsupervised Learning of 3D Object Categories from Videos in the Wild0
Unsupervised machine learning approach for building composite indicators with fuzzy metrics0
Unsupervised Person Re-identification by Deep Learning Tracklet Association0
Show:102550
← PrevPage 171 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified