SOTAVerified

Natural Language Queries

Papers

Showing 2650 of 337 papers

TitleStatusHype
Knowledge Graph Based Repository-Level Code Generation0
RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs0
VTS-LLM: Domain-Adaptive LLM Agent for Enhancing Awareness in Vessel Traffic Services through Natural LanguageCode0
Skill Discovery for Software Scripting Automation via Offline Simulations with LLMs0
Grounding-MD: Grounded Video-language Pre-training for Open-World Moment Detection0
Bridging the Semantic Gaps: Improving Medical VQA Consistency with LLM-Augmented Question Sets0
FindAnything: Open-Vocabulary and Object-Centric Mapping for Robot Exploration in Any Environment0
Automated Construction of a Knowledge Graph of Nuclear Fusion Energy for Effective Elicitation and Retrieval of Information0
Zero-Shot Cross-Domain Code Search without Fine-TuningCode1
GeoRAG: A Question-Answering Approach from a Geographical Perspective0
FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal ReasoningCode2
EllieSQL: Cost-Efficient Text-to-SQL with Complexity-Aware Routing0
Retrieving Time-Series Differences Using Natural Language Queries0
GateLens: A Reasoning-Enhanced LLM Agent for Automotive Software Release Analytics0
Enhancing Retrieval Systems with Inference-Time Logical Reasoning0
nvBench 2.0: A Benchmark for Natural Language to Visualization under Ambiguity0
Datrics Text2SQL. A Framework for Natural Language to SQL Query GenerationCode2
Bring Remote Sensing Object Detect Into Nature Language Model: Using SFT Method0
SQLCritic: Correcting Text-to-SQL Generation via Clause-wise Critic0
Improving Access to Trade and Investment Information in Thailand through Intelligent Document Retrieval0
DB-Explore: Automated Database Exploration and Instruction Synthesis for Text-to-SQL0
AILS-NTUA at SemEval-2025 Task 8: Language-to-Code prompting and Error Fixing for Tabular Question AnsweringCode0
Time-MQA: Time Series Multi-Task Question Answering with Context Enhancement0
QueryAdapter: Rapid Adaptation of Vision-Language Models in Response to Natural Language Queries0
A Socratic RAG Approach to Connect Natural Language Queries on Research Topics with Knowledge Organization Systems0
Show:102550
← PrevPage 2 of 14Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EgoVideoR@1 Mean(0.3 and 0.5)23.68Unverified
2DeCafNet-100%R@1 Mean(0.3 and 0.5)18.86Unverified
3DeCafNet-50%R@1 Mean(0.3 and 0.5)17.93Unverified
4RGNetR@1 Mean(0.3 and 0.5)16.55Unverified
5DeCafNet-50% (no NaQ)R@1 Mean(0.3 and 0.5)15.32Unverified
6InternVideoR@1 Mean(0.3 and 0.5)13.26Unverified
7EgoVLPv2R@1 IoU=0.312.95Unverified
8UniMD+Sync.R@1 Mean(0.3 and 0.5)12.11Unverified
9ReLER@ZJU-AlibabaR@1 Mean(0.3 and 0.5)10.52Unverified
10EgoVLPR@1 Mean(0.3 and 0.5)8.35Unverified