| RefAV: Towards Planning-Centric Scenario Mining | May 27, 2025 | Autonomous VehiclesMotion Planning | CodeCode Available | 1 | 5 |
| CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning | Oct 14, 2024 | MathMathematical Reasoning | CodeCode Available | 1 | 5 |
| Joint Moment Retrieval and Highlight Detection Via Natural Language Queries | May 8, 2023 | DecoderHighlight Detection | CodeCode Available | 1 | 5 |
| InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges | Nov 17, 2022 | Future Hand PredictionMoment Queries | CodeCode Available | 1 | 5 |
| R^3-NL2GQL: A Model Coordination and Knowledge Graph Alignment Approach for NL2GQL | Nov 3, 2023 | Knowledge GraphsNatural Language Queries | CodeCode Available | 1 | 5 |
| Learning Commonsense-aware Moment-Text Alignment for Fast Video Temporal Grounding | Apr 4, 2022 | cross-modal alignmentNatural Language Queries | CodeCode Available | 1 | 5 |
| Audio Retrieval with Natural Language Queries: A Benchmark Study | Dec 17, 2021 | AudioCapsAudio captioning | CodeCode Available | 1 | 5 |
| QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries | Jul 20, 2021 | Highlight DetectionMoment Retrieval | CodeCode Available | 1 | 5 |
| ReaRev: Adaptive Reasoning for Question Answering over Knowledge Graphs | Oct 24, 2022 | Graph Question AnsweringKnowledge Graphs | CodeCode Available | 1 | 5 |
| Look, Listen, and Answer: Overcoming Biases for Audio-Visual Question Answering | Apr 18, 2024 | Audio-visual Question AnsweringAudio-Visual Question Answering (AVQA) | CodeCode Available | 1 | 5 |
| ReLER@ZJU-Alibaba Submission to the Ego4D Natural Language Queries Challenge 2022 | Jul 1, 2022 | Data AugmentationDiversity | CodeCode Available | 1 | 5 |
| MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval | Aug 20, 2024 | MambaNatural Language Queries | CodeCode Available | 1 | 5 |
| Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding | Mar 16, 2022 | Language ModellingNatural Language Queries | CodeCode Available | 1 | 5 |
| DeCafNet: Delegate and Conquer for Efficient Temporal Grounding in Long Videos | May 22, 2025 | Natural Language Moment RetrievalNatural Language Queries | CodeCode Available | 1 | 5 |
| PAPERCLIP: Associating Astronomical Observations and Natural Language with Multi-Modal Models | Mar 13, 2024 | Image RetrievalNatural Language Queries | CodeCode Available | 1 | 5 |
| CoSQA: 20,000+ Web Queries for Code Search and Question Answering | May 27, 2021 | Code SearchContrastive Learning | CodeCode Available | 1 | 5 |
| Backdooring Neural Code Search | May 27, 2023 | Autonomous DrivingCode Search | CodeCode Available | 1 | 5 |
| ESTER: A Machine Reading Comprehension Dataset for Event Semantic Relation Reasoning | Apr 16, 2021 | Machine Reading ComprehensionNatural Language Queries | CodeCode Available | 1 | 5 |
| Dense Regression Network for Video Grounding | Apr 7, 2020 | Natural Language Moment RetrievalNatural Language Queries | CodeCode Available | 1 | 5 |
| Detecting Moments and Highlights in Videos via Natural Language Queries | Dec 1, 2021 | DecoderMoment Retrieval | CodeCode Available | 1 | 5 |
| EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone | Jul 11, 2023 | Action RecognitionMoment Queries | CodeCode Available | 1 | 5 |
| OSGNet @ Ego4D Episodic Memory Challenge 2025 | Jun 4, 2025 | Moment QueriesNatural Language Queries | CodeCode Available | 1 | 5 |
| Retrieving Complex Tables with Multi-Granular Graph Representation Learning | May 4, 2021 | Graph Representation LearningNatural Language Queries | CodeCode Available | 1 | 5 |
| SpCQL: A Semantic Parsing Dataset for Converting Natural Language into Cypher | Oct 17, 2022 | Natural Language QueriesSemantic Parsing | CodeCode Available | 1 | 5 |
| V3CTRON | Data Retrieval & Access System For Flexible Semantic Search & Retrieval Of Proprietary Document Collections Using Natural Language Queries. | Apr 26, 2023 | Conversational SearchInformation Retrieval | CodeCode Available | 1 | 5 |