| H-STAR: LLM-driven Hybrid SQL-Text Adaptive Reasoning on Tables | Jun 29, 2024 | Fact VerificationMathematical Reasoning | CodeCode Available | 1 | 5 |
| DeCafNet: Delegate and Conquer for Efficient Temporal Grounding in Long Videos | May 22, 2025 | Natural Language Moment RetrievalNatural Language Queries | CodeCode Available | 1 | 5 |
| Look, Listen, and Answer: Overcoming Biases for Audio-Visual Question Answering | Apr 18, 2024 | Audio-visual Question AnsweringAudio-Visual Question Answering (AVQA) | CodeCode Available | 1 | 5 |
| Detecting Moments and Highlights in Videos via Natural Language Queries | Dec 1, 2021 | DecoderMoment Retrieval | CodeCode Available | 1 | 5 |
| Joint Moment Retrieval and Highlight Detection Via Natural Language Queries | May 8, 2023 | DecoderHighlight Detection | CodeCode Available | 1 | 5 |
| MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval | Aug 20, 2024 | MambaNatural Language Queries | CodeCode Available | 1 | 5 |
| Enhancing Network Management Using Code Generated by Large Language Models | Aug 11, 2023 | ManagementNatural Language Queries | CodeCode Available | 1 | 5 |
| EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone | Jul 11, 2023 | Action RecognitionMoment Queries | CodeCode Available | 1 | 5 |
| CoSQA: 20,000+ Web Queries for Code Search and Question Answering | May 27, 2021 | Code SearchContrastive Learning | CodeCode Available | 1 | 5 |
| CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning | Oct 14, 2024 | MathMathematical Reasoning | CodeCode Available | 1 | 5 |