SOTAVerified

Benchmarking

Papers

Showing 42514300 of 5548 papers

TitleStatusHype
Uncertainty Estimation with Deep Learning for Rainfall-Runoff Modelling0
Understanding and Benchmarking Artificial Intelligence: OpenAI's o3 Is Not AGI0
Understanding Foundation Models: Are We Back in 1924?0
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems0
Understanding Recurrent Neural Architectures by Analyzing and Synthesizing Long Distance Dependencies in Benchmark Sequential Datasets0
Understanding the Limits of Lifelong Knowledge Editing in LLMs0
Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective0
Understanding the User: An Intent-Based Ranking Dataset0
Uniform Discretized Integrated Gradients: An effective attribution based method for explaining large language models0
Unifying Few- and Zero-Shot Egocentric Action Recognition0
UniIR: Training and Benchmarking Universal Multimodal Information Retrievers0
Uni-Render: A Unified Accelerator for Real-Time Rendering Across Diverse Neural Renderers0
Unitail: Detecting, Reading, and Matching in Retail Scene0
Unleashing OpenTitan's Potential: a Silicon-Ready Embedded Secure Element for Root of Trust and Cryptographic Offloading0
Unlocking the Potential: Benchmarking Large Language Models in Water Engineering and Research0
Unmasking Deceptive Visuals: Benchmarking Multimodal Large Language Models on Misleading Chart Question Answering0
Unreal Robotics Lab: A High-Fidelity Robotics Simulator with Advanced Physics and Rendering0
UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI0
UnsafeBench: Benchmarking Image Safety Classifiers on Real-World and AI-Generated Images0
Unsupervised Deep Epipolar Flow for Stationary or Dynamic Scenes0
Unsupervised Feature Learning for Environmental Sound Classification Using Weighted Cycle-Consistent Generative Adversarial Network0
Unsupervised Hierarchical Grouping of Knowledge Graph Entities0
Unsupervised Learning of 3D Object Categories from Videos in the Wild0
Unsupervised machine learning approach for building composite indicators with fuzzy metrics0
Unsupervised Person Re-identification by Deep Learning Tracklet Association0
Unsupervised Single Image Deraining with Self-supervised Constraints0
Unsupervised Spectral Demosaicing with Lightweight Spectral Attention Networks0
Unsupervised Synthetic Image Refinement via Contrastive Learning and Consistent Semantic-Structural Constraints0
Unveiling the potential of large language models in generating semantic and cross-language clones0
UPREVE: An End-to-End Causal Discovery Benchmarking System0
Urania: Differentially Private Insights into AI Use0
UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban Spaces0
Use of Deep Neural Networks for Uncertain Stress Functions with Extensions to Impact Mechanics0
User Profile with Large Language Models: Construction, Updating, and Benchmarking0
Using Affine Combinations of BBOB Problems for Performance Assessment0
Using generative adversarial networks to synthesize artificial financial datasets0
Using Multi-Temporal Sentinel-1 and Sentinel-2 data for water bodies mapping0
Using Neural Architecture Search for Improving Software Flaw Detection in Multimodal Deep Learning Models0
Using PCA to Efficiently Represent State Spaces0
Using Regular Languages to Explore the Representational Capacity of Recurrent Neural Architectures0
Using Well-Understood Single-Objective Functions in Multiobjective Black-Box Optimization Test Suites0
uTHCD: A New Benchmarking for Tamil Handwritten OCR0
Utility-Optimized Synthesis of Differentially Private Location Traces0
Validation of neural spike sorting algorithms without ground-truth information0
Value-at-Risk-Based Portfolio Insurance: Performance Evaluation and Benchmarking Against CPPI in a Markov-Modulated Regime-Switching Market0
Varco Arena: A Tournament Approach to Reference-Free Benchmarking Large Language Models0
Variational Laplace for Bayesian neural networks0
Variational Quantum Circuits Enhanced Generative Adversarial Network0
Parametrized quantum policies for reinforcement learning0
Policy Gradients using Variational Quantum Circuits0
Show:102550
← PrevPage 86 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified