SOTAVerified

Natural Language Queries

Papers

Showing 150 of 337 papers

TitleStatusHype
A Survey of Text-to-SQL in the Era of LLMs: Where are we, and where are we going?Code5
OpenAGI: When LLM Meets Domain ExpertsCode4
Separate Anything You DescribeCode3
UniMD: Towards Unifying Moment Retrieval and Temporal Action DetectionCode2
Query2CAD: Generating CAD models using natural language queriesCode2
DualMap: Online Open-Vocabulary Semantic Mapping for Natural Language Navigation in Dynamic Changing ScenesCode2
SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQLCode2
LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an AgentCode2
FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal ReasoningCode2
Datrics Text2SQL. A Framework for Natural Language to SQL Query GenerationCode2
Query-Dependent Video Representation for Moment Retrieval and Highlight DetectionCode2
Fact Finder -- Enhancing Domain Expertise of Large Language Models by Incorporating Knowledge GraphsCode2
TR-DETR: Task-Reciprocal Transformer for Joint Moment Retrieval and Highlight DetectionCode2
Egocentric Video-Language PretrainingCode2
Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature FieldsCode2
TableQuery: Querying tabular data with natural languageCode2
UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight DetectionCode2
EgoVideo: Exploring Egocentric Foundation Model and Downstream AdaptationCode2
E-SQL: Direct Schema Linking via Question Enrichment in Text-to-SQLCode2
Pseudo-Q: Generating Pseudo Language Queries for Visual GroundingCode1
PAPERCLIP: Associating Astronomical Observations and Natural Language with Multi-Modal ModelsCode1
QVHighlights: Detecting Moments and Highlights in Videos via Natural Language QueriesCode1
NaQ: Leveraging Narrations as Queries to Supervise Episodic MemoryCode1
Multi-modal Transformer for Video RetrievalCode1
Look, Listen, and Answer: Overcoming Biases for Audio-Visual Question AnsweringCode1
Neural Code Search Revisited: Enhancing Code Snippet Retrieval through Natural Language IntentCode1
Audio Retrieval with Natural Language QueriesCode1
Audio Retrieval with Natural Language Queries: A Benchmark StudyCode1
MUSE: Mamba is Efficient Multi-scale Learner for Text-video RetrievalCode1
OSGNet @ Ego4D Episodic Memory Challenge 2025Code1
R^3-NL2GQL: A Model Coordination and Knowledge Graph Alignment Approach for NL2GQLCode1
Inductive Entity Representations from Text via Link PredictionCode1
H-STAR: LLM-driven Hybrid SQL-Text Adaptive Reasoning on TablesCode1
InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D ChallengesCode1
Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding in VideosCode1
BookSQL: A Large Scale Text-to-SQL Dataset for Accounting DomainCode1
ESTER: A Machine Reading Comprehension Dataset for Event Semantic Relation ReasoningCode1
GraphSearchNet: Enhancing GNNs via Capturing Global Dependencies for Semantic Code SearchCode1
Joint Moment Retrieval and Highlight Detection Via Natural Language QueriesCode1
Are Gender-Neutral Queries Really Gender-Neutral? Mitigating Gender Bias in Image SearchCode1
EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the BackboneCode1
Enhancing Network Management Using Code Generated by Large Language ModelsCode1
GroundNLQ @ Ego4D Natural Language Queries Challenge 2023Code1
How Much Knowledge Can You Pack Into the Parameters of a Language Model?Code1
CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical ReasoningCode1
Improving Multi-hop Question Answering over Knowledge Graphs using Knowledge Base EmbeddingsCode1
Learning Commonsense-aware Moment-Text Alignment for Fast Video Temporal GroundingCode1
Detecting Moments and Highlights in Videos via Natural Language QueriesCode1
Entity-aware Transformers for Entity SearchCode1
CoSQA: 20,000+ Web Queries for Code Search and Question AnsweringCode1
Show:102550
← PrevPage 1 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EgoVideoR@1 Mean(0.3 and 0.5)23.68Unverified
2DeCafNet-100%R@1 Mean(0.3 and 0.5)18.86Unverified
3DeCafNet-50%R@1 Mean(0.3 and 0.5)17.93Unverified
4RGNetR@1 Mean(0.3 and 0.5)16.55Unverified
5DeCafNet-50% (no NaQ)R@1 Mean(0.3 and 0.5)15.32Unverified
6InternVideoR@1 Mean(0.3 and 0.5)13.26Unverified
7EgoVLPv2R@1 IoU=0.312.95Unverified
8UniMD+Sync.R@1 Mean(0.3 and 0.5)12.11Unverified
9ReLER@ZJU-AlibabaR@1 Mean(0.3 and 0.5)10.52Unverified
10EgoVLPR@1 Mean(0.3 and 0.5)8.35Unverified