SOTAVerified

Natural Language Queries

Papers

Showing 150 of 337 papers

TitleStatusHype
SPAZER: Spatial-Semantic Progressive Reasoning Agent for Zero-shot 3D Visual Grounding0
Towards Probabilistic Question Answering Over Tabular Data0
A Modular Multitask Reasoning Framework Integrating Spatio-temporal Models and LLMs0
Invocable APIs derived from NL2SQL datasets for LLM Tool-Calling Evaluation0
Improving Personalized Search with Regularized Low-Rank Parameter UpdatesCode0
Technical Report for Argoverse2 Scenario Mining Challenges on Iterative Error Correction and Spatially-Aware Prompting0
MLVTG: Mamba-Based Feature Alignment and LLM-Driven Purification for Multi-Modal Video Temporal Grounding0
SEED: Enhancing Text-to-SQL Performance and Practical Usability Through Automatic Evidence GenerationCode1
OSGNet @ Ego4D Episodic Memory Challenge 2025Code1
DGMO: Training-Free Audio Source Separation through Diffusion-Guided Mask Optimization0
DualMap: Online Open-Vocabulary Semantic Mapping for Natural Language Navigation in Dynamic Changing ScenesCode2
A Graph-Retrieval-Augmented Generation Framework Enhances Decision-Making in the Circular Economy0
ACCESS DENIED INC: The First Benchmark Environment for Sensitivity AwarenessCode0
MGS3: A Multi-Granularity Self-Supervised Code Search Framework0
CoRet: Improved Retriever for Code Editing0
StreamLink: Large-Language-Model Driven Distributed Data Engineering System0
Text-Queried Audio Source Separation via Hierarchical Modeling0
Complex System Diagnostics Using a Knowledge Graph-Informed and Large Language Model-Enhanced Framework0
RefAV: Towards Planning-Centric Scenario MiningCode1
Structuring the Unstructured: A Multi-Agent System for Extracting and Querying Financial KPIs and Guidance0
LLM-Powered Agents for Navigating Venice's Historical Cadastre0
DeCafNet: Delegate and Conquer for Efficient Temporal Grounding in Long VideosCode1
CRAFT: Training-Free Cascaded Retrieval for Tabular QA0
Meta-Learning an In-Context Transformer Model of Human Higher Visual Cortex0
RAZER: Robust Accelerated Zero-Shot 3D Open-Vocabulary Panoptic Reconstruction with Spatio-Temporal Aggregation0
Knowledge Graph Based Repository-Level Code Generation0
RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs0
VTS-LLM: Domain-Adaptive LLM Agent for Enhancing Awareness in Vessel Traffic Services through Natural LanguageCode0
Skill Discovery for Software Scripting Automation via Offline Simulations with LLMs0
Grounding-MD: Grounded Video-language Pre-training for Open-World Moment Detection0
Bridging the Semantic Gaps: Improving Medical VQA Consistency with LLM-Augmented Question Sets0
FindAnything: Open-Vocabulary and Object-Centric Mapping for Robot Exploration in Any Environment0
Automated Construction of a Knowledge Graph of Nuclear Fusion Energy for Effective Elicitation and Retrieval of Information0
Zero-Shot Cross-Domain Code Search without Fine-TuningCode1
GeoRAG: A Question-Answering Approach from a Geographical Perspective0
FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal ReasoningCode2
EllieSQL: Cost-Efficient Text-to-SQL with Complexity-Aware Routing0
Retrieving Time-Series Differences Using Natural Language Queries0
GateLens: A Reasoning-Enhanced LLM Agent for Automotive Software Release Analytics0
Enhancing Retrieval Systems with Inference-Time Logical Reasoning0
nvBench 2.0: A Benchmark for Natural Language to Visualization under Ambiguity0
Datrics Text2SQL. A Framework for Natural Language to SQL Query GenerationCode2
Bring Remote Sensing Object Detect Into Nature Language Model: Using SFT Method0
SQLCritic: Correcting Text-to-SQL Generation via Clause-wise Critic0
Improving Access to Trade and Investment Information in Thailand through Intelligent Document Retrieval0
DB-Explore: Automated Database Exploration and Instruction Synthesis for Text-to-SQL0
AILS-NTUA at SemEval-2025 Task 8: Language-to-Code prompting and Error Fixing for Tabular Question AnsweringCode0
Time-MQA: Time Series Multi-Task Question Answering with Context Enhancement0
QueryAdapter: Rapid Adaptation of Vision-Language Models in Response to Natural Language Queries0
A Socratic RAG Approach to Connect Natural Language Queries on Research Topics with Knowledge Organization Systems0
Show:102550
← PrevPage 1 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EgoVideoR@1 Mean(0.3 and 0.5)23.68Unverified
2DeCafNet-100%R@1 Mean(0.3 and 0.5)18.86Unverified
3DeCafNet-50%R@1 Mean(0.3 and 0.5)17.93Unverified
4RGNetR@1 Mean(0.3 and 0.5)16.55Unverified
5DeCafNet-50% (no NaQ)R@1 Mean(0.3 and 0.5)15.32Unverified
6InternVideoR@1 Mean(0.3 and 0.5)13.26Unverified
7EgoVLPv2R@1 IoU=0.312.95Unverified
8UniMD+Sync.R@1 Mean(0.3 and 0.5)12.11Unverified
9ReLER@ZJU-AlibabaR@1 Mean(0.3 and 0.5)10.52Unverified
10EgoVLPR@1 Mean(0.3 and 0.5)8.35Unverified