| A Survey of Text-to-SQL in the Era of LLMs: Where are we, and where are we going? | Aug 9, 2024 | Natural Language QueriesText to SQL | CodeCode Available | 5 |
| OpenAGI: When LLM Meets Domain Experts | Apr 10, 2023 | BenchmarkingNatural Language Queries | CodeCode Available | 4 |
| Separate Anything You Describe | Aug 9, 2023 | Audio Source SeparationNatural Language Queries | CodeCode Available | 3 |
| DualMap: Online Open-Vocabulary Semantic Mapping for Natural Language Navigation in Dynamic Changing Scenes | Jun 2, 2025 | Natural Language QueriesNavigate | CodeCode Available | 2 |
| FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning | Apr 1, 2025 | Audio-visual Question AnsweringAudio-Visual Question Answering (AVQA) | CodeCode Available | 2 |
| Datrics Text2SQL. A Framework for Natural Language to SQL Query Generation | Mar 15, 2025 | Natural Language QueriesRAG | CodeCode Available | 2 |
| SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL | Feb 17, 2025 | Few-Shot LearningHeuristic Search | CodeCode Available | 2 |
| E-SQL: Direct Schema Linking via Question Enrichment in Text-to-SQL | Sep 25, 2024 | Natural Language QueriesText to SQL | CodeCode Available | 2 |
| Fact Finder -- Enhancing Domain Expertise of Large Language Models by Incorporating Knowledge Graphs | Aug 6, 2024 | Knowledge GraphsNatural Language Queries | CodeCode Available | 2 |
| EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation | Jun 26, 2024 | Action AnticipationAction Recognition | CodeCode Available | 2 |
| Query2CAD: Generating CAD models using natural language queries | May 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection | Apr 7, 2024 | Action DetectionMoment Queries | CodeCode Available | 2 |
| TR-DETR: Task-Reciprocal Transformer for Joint Moment Retrieval and Highlight Detection | Jan 4, 2024 | Highlight DetectionMoment Retrieval | CodeCode Available | 2 |
| Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields | Dec 6, 2023 | 3DGS3D scene Editing | CodeCode Available | 2 |
| LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent | Sep 21, 2023 | 3D visual groundingLanguage Modeling | CodeCode Available | 2 |
| Query-Dependent Video Representation for Moment Retrieval and Highlight Detection | Mar 24, 2023 | Highlight DetectionMoment Retrieval | CodeCode Available | 2 |
| Egocentric Video-Language Pretraining | Jun 3, 2022 | Action RecognitionContrastive Learning | CodeCode Available | 2 |
| UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection | Mar 23, 2022 | DecoderHighlight Detection | CodeCode Available | 2 |
| TableQuery: Querying tabular data with natural language | Jan 27, 2022 | Deep LearningNatural Language Queries | CodeCode Available | 2 |
| SEED: Enhancing Text-to-SQL Performance and Practical Usability Through Automatic Evidence Generation | Jun 9, 2025 | Natural Language QueriesText to SQL | CodeCode Available | 1 |
| OSGNet @ Ego4D Episodic Memory Challenge 2025 | Jun 4, 2025 | Moment QueriesNatural Language Queries | CodeCode Available | 1 |
| RefAV: Towards Planning-Centric Scenario Mining | May 27, 2025 | Autonomous VehiclesMotion Planning | CodeCode Available | 1 |
| DeCafNet: Delegate and Conquer for Efficient Temporal Grounding in Long Videos | May 22, 2025 | Natural Language Moment RetrievalNatural Language Queries | CodeCode Available | 1 |
| Zero-Shot Cross-Domain Code Search without Fine-Tuning | Apr 10, 2025 | Code SearchNatural Language Queries | CodeCode Available | 1 |
| TransitGPT: A Generative AI-based framework for interacting with GTFS data using Large Language Models | Dec 7, 2024 | ChatbotNatural Language Queries | CodeCode Available | 1 |
| CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning | Oct 14, 2024 | MathMathematical Reasoning | CodeCode Available | 1 |
| Saliency-Guided DETR for Moment Retrieval and Highlight Detection | Oct 2, 2024 | Highlight DetectionMoment Retrieval | CodeCode Available | 1 |
| RoundTable: Leveraging Dynamic Schema and Contextual Autocomplete for Enhanced Query Precision in Tabular Question Answering | Aug 22, 2024 | Natural Language QueriesQuestion Answering | CodeCode Available | 1 |
| MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval | Aug 20, 2024 | MambaNatural Language Queries | CodeCode Available | 1 |
| H-STAR: LLM-driven Hybrid SQL-Text Adaptive Reasoning on Tables | Jun 29, 2024 | Fact VerificationMathematical Reasoning | CodeCode Available | 1 |
| BookSQL: A Large Scale Text-to-SQL Dataset for Accounting Domain | Jun 12, 2024 | Natural Language QueriesText to SQL | CodeCode Available | 1 |
| Look, Listen, and Answer: Overcoming Biases for Audio-Visual Question Answering | Apr 18, 2024 | Audio-visual Question AnsweringAudio-Visual Question Answering (AVQA) | CodeCode Available | 1 |
| S3LLM: Large-Scale Scientific Software Understanding with LLMs using Source, Metadata, and Document | Mar 15, 2024 | Natural Language QueriesRAG | CodeCode Available | 1 |
| PAPERCLIP: Associating Astronomical Observations and Natural Language with Multi-Modal Models | Mar 13, 2024 | Image RetrievalNatural Language Queries | CodeCode Available | 1 |
| Rewriting the Code: A Simple Method for Large Language Model Augmented Code Search | Jan 9, 2024 | Code GenerationCode Search | CodeCode Available | 1 |
| RGNet: A Unified Clip Retrieval and Grounding Network for Long Videos | Dec 11, 2023 | Natural Language Moment RetrievalNatural Language Queries | CodeCode Available | 1 |
| R^3-NL2GQL: A Model Coordination and Knowledge Graph Alignment Approach for NL2GQL | Nov 3, 2023 | Knowledge GraphsNatural Language Queries | CodeCode Available | 1 |
| Talk2BEV: Language-enhanced Bird's-eye View Maps for Autonomous Driving | Oct 3, 2023 | Autonomous DrivingDecision Making | CodeCode Available | 1 |
| Enhancing Network Management Using Code Generated by Large Language Models | Aug 11, 2023 | ManagementNatural Language Queries | CodeCode Available | 1 |
| EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone | Jul 11, 2023 | Action RecognitionMoment Queries | CodeCode Available | 1 |
| GroundNLQ @ Ego4D Natural Language Queries Challenge 2023 | Jun 27, 2023 | Natural Language Queries | CodeCode Available | 1 |
| Backdooring Neural Code Search | May 27, 2023 | Autonomous DrivingCode Search | CodeCode Available | 1 |
| Joint Moment Retrieval and Highlight Detection Via Natural Language Queries | May 8, 2023 | DecoderHighlight Detection | CodeCode Available | 1 |
| V3CTRON | Data Retrieval & Access System For Flexible Semantic Search & Retrieval Of Proprietary Document Collections Using Natural Language Queries. | Apr 26, 2023 | Conversational SearchInformation Retrieval | CodeCode Available | 1 |
| NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory | Jan 2, 2023 | Data AugmentationNatural Language Queries | CodeCode Available | 1 |
| InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges | Nov 17, 2022 | Future Hand PredictionMoment Queries | CodeCode Available | 1 |
| YORO -- Lightweight End to End Visual Grounding | Nov 15, 2022 | Natural Language QueriesVisual Grounding | CodeCode Available | 1 |
| ReaRev: Adaptive Reasoning for Question Answering over Knowledge Graphs | Oct 24, 2022 | Graph Question AnsweringKnowledge Graphs | CodeCode Available | 1 |
| Weakly-Supervised Temporal Article Grounding | Oct 22, 2022 | AllArticles | CodeCode Available | 1 |
| SpCQL: A Semantic Parsing Dataset for Converting Natural Language into Cypher | Oct 17, 2022 | Natural Language QueriesSemantic Parsing | CodeCode Available | 1 |