SOTAVerified

Natural Language Queries

Papers

Showing 8190 of 337 papers

TitleStatusHype
A Modular Multitask Reasoning Framework Integrating Spatio-temporal Models and LLMs0
Invocable APIs derived from NL2SQL datasets for LLM Tool-Calling Evaluation0
Improving Personalized Search with Regularized Low-Rank Parameter UpdatesCode0
MLVTG: Mamba-Based Feature Alignment and LLM-Driven Purification for Multi-Modal Video Temporal Grounding0
Technical Report for Argoverse2 Scenario Mining Challenges on Iterative Error Correction and Spatially-Aware Prompting0
DGMO: Training-Free Audio Source Separation through Diffusion-Guided Mask Optimization0
ACCESS DENIED INC: The First Benchmark Environment for Sensitivity AwarenessCode0
A Graph-Retrieval-Augmented Generation Framework Enhances Decision-Making in the Circular Economy0
MGS3: A Multi-Granularity Self-Supervised Code Search Framework0
CoRet: Improved Retriever for Code Editing0
Show:102550
← PrevPage 9 of 34Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EgoVideoR@1 Mean(0.3 and 0.5)23.68Unverified
2DeCafNet-100%R@1 Mean(0.3 and 0.5)18.86Unverified
3DeCafNet-50%R@1 Mean(0.3 and 0.5)17.93Unverified
4RGNetR@1 Mean(0.3 and 0.5)16.55Unverified
5DeCafNet-50% (no NaQ)R@1 Mean(0.3 and 0.5)15.32Unverified
6InternVideoR@1 Mean(0.3 and 0.5)13.26Unverified
7EgoVLPv2R@1 IoU=0.312.95Unverified
8UniMD+Sync.R@1 Mean(0.3 and 0.5)12.11Unverified
9ReLER@ZJU-AlibabaR@1 Mean(0.3 and 0.5)10.52Unverified
10EgoVLPR@1 Mean(0.3 and 0.5)8.35Unverified