CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for Task-Aware Parameter-Efficient Fine-tuning Jun 7, 2024 Instruction Following Math
Code Code Available 2RetroMAE-2: Duplex Masked Auto-Encoder For Pre-Training Retrieval-Oriented Language Models May 4, 2023 Information Retrieval Open-Domain Question Answering
Code Code Available 2RetroMAE v2: Duplex Masked Auto-Encoder For Pre-Training Retrieval-Oriented Language Models Nov 16, 2022 Dimensionality Reduction Information Retrieval
Code Code Available 2Revealing Single Frame Bias for Video-and-Language Learning Jun 7, 2022 Action Recognition Fine-grained Action Recognition
Code Code Available 2Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMs Oct 14, 2024 Computational Efficiency Question Answering
Code Code Available 2FlagEvalMM: A Flexible Framework for Comprehensive Multimodal Model Evaluation Jun 10, 2025 Image-text Retrieval Question Answering
Code Code Available 2FinMem: A Performance-Enhanced LLM Trading Agent with Layered Memory and Character Design Nov 23, 2023 Decision Making Language Modelling
Code Code Available 2A Survey on Benchmarks of Multimodal Large Language Models Aug 16, 2024 Question Answering Survey
Code Code Available 2From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models Oct 13, 2023 Hallucination Image Captioning
Code Code Available 2Scientific QA System with Verifiable Answers Jul 16, 2024 Articles Information Retrieval
Code Code Available 2GIT: A Generative Image-to-text Transformer for Vision and Language May 27, 2022 Decoder Image Captioning
Code Code Available 2How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions Jul 6, 2024 Question Answering RAG
Code Code Available 2AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents Jul 5, 2024 Decision Making Multi-hop Question Answering
Code Code Available 2FanOutQA: A Multi-Hop, Multi-Document Question Answering Benchmark for Large Language Models Feb 21, 2024 Question Answering
Code Code Available 2Atlas: Few-shot Learning with Retrieval Augmented Language Models Aug 5, 2022 Fact Checking Few-Shot Learning
Code Code Available 2ALBERT: A Lite BERT for Self-supervised Learning of Language Representations Sep 26, 2019 Common Sense Reasoning GPU
Code Code Available 2A Replication Study of Dense Passage Retriever Apr 12, 2021 Open-Domain Question Answering Question Answering
Code Code Available 2CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion Feb 8, 2024 Computational Efficiency Multimodal Reasoning
Code Code Available 2FakeBench: Probing Explainable Fake Image Detection via Large Multimodal Models Apr 20, 2024 Binary Classification Fake Image Detection
Code Code Available 2SimGRAG: Leveraging Similar Subgraphs for Knowledge Graphs Driven Retrieval-Augmented Generation Dec 17, 2024 Fact Verification Knowledge Graphs
Code Code Available 2Are Language Models Puzzle Prodigies? Algorithmic Puzzles Unveil Serious Challenges in Multimodal Reasoning Mar 6, 2024 Multimodal Reasoning Question Answering
Code Code Available 2Explore the Limits of Omni-modal Pretraining at Scale Jun 13, 2024 Language Modeling Language Modelling
Code Code Available 2Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer Oct 23, 2019 Answer Generation Common Sense Reasoning
Code Code Available 2SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers Jul 12, 2024 Articles Question Answering
Code Code Available 2Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering Oct 21, 2024 Open-Domain Question Answering Question Answering
Code Code Available 2Streaming Video Question-Answering with In-context Video KV-Cache Retrieval Mar 1, 2025 GPU Question Answering
Code Code Available 2Evaluating RAG-Fusion with RAGElo: an Automated Elo-based Framework Jun 20, 2024 Hallucination Question Answering
Code Code Available 2StructGPT: A General Framework for Large Language Model to Reason over Structured Data May 16, 2023 Language Modeling Language Modelling
Code Code Available 2Synthetic QA Corpora Generation with Roundtrip Consistency Jun 12, 2019 Question Answering Question Generation
Code Code Available 2ANAH: Analytical Annotation of Hallucinations in Large Language Models May 30, 2024 Generative Question Answering Hallucination
Code Code Available 2TableQuery: Querying tabular data with natural language Jan 27, 2022 Deep Learning Natural Language Queries
Code Code Available 2TableRAG: A Retrieval Augmented Generation Framework for Heterogeneous Document Reasoning Jun 12, 2025 Answer Generation Chunking
Code Code Available 2Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner May 16, 2025 Cross-Modal Retrieval Diagnostic
Code Code Available 2DALK: Dynamic Co-Augmentation of LLMs and KG to answer Alzheimer's Disease Questions with Scientific Literature May 8, 2024 Question Answering
Code Code Available 2DanmakuTPPBench: A Multi-modal Benchmark for Temporal Point Process Modeling and Understanding May 23, 2025 Language Modeling Language Modelling
Code Code Available 2EyeCLIP: A visual-language foundation model for multi-modal ophthalmic image analysis Sep 10, 2024 Contrastive Learning Cross-Modal Retrieval
Code Code Available 2FinBERT-QA: Financial Question Answering with pre-trained BERT Language Models Apr 24, 2025 Answer Selection Information Retrieval
Code Code Available 2Task Me Anything Jun 17, 2024 2k Attribute
Code Code Available 2DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing Nov 18, 2021 Language Modeling Language Modelling
Code Code Available 2Debiasing Multimodal Large Language Models Mar 8, 2024 Fairness Question Answering
Code Code Available 2E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding Sep 26, 2024 Question Answering Video Understanding
Code Code Available 2ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis Mar 11, 2024 Question Answering
Code Code Available 2A Pilot Study for Chinese SQL Semantic Parsing Sep 29, 2019 Cross-Lingual Word Embeddings Question Answering
Code Code Available 2AnyAnomaly: Zero-Shot Customizable Video Anomaly Detection with LVLM Mar 6, 2025 Anomaly Detection Language Modeling
Code Code Available 2All in One: Exploring Unified Video-Language Pre-training Mar 14, 2022 All Language Modelling
Code Code Available 2End-to-End Navigation with Vision Language Models: Transforming Spatial Reasoning into Question-Answering Nov 8, 2024 Language Modeling Language Modelling
Code Code Available 2End-To-End Memory Networks Mar 31, 2015 Language Modeling Language Modelling
Code Code Available 2The First Place Solution of WSDM Cup 2024: Leveraging Large Language Models for Conversational Multi-Doc QA Feb 28, 2024 Natural Language Understanding Question Answering
Code Code Available 2Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement May 24, 2024 Hallucination Image Comprehension
Code Code Available 2Evaluating LLM Reasoning in the Operations Research Domain with ORQA Dec 22, 2024 Question Answering
Code Code Available 2