AuditWen:An Open-Source Large Language Model for Audit Oct 9, 2024 Answer Generation Language Modeling
Code Code Available 1ActiView: Evaluating Active Perception Ability for Multimodal Large Language Models Oct 7, 2024 Question Answering Visual Question Answering
Code Code Available 1ZEBRA: Zero-Shot Example-Based Retrieval Augmentation for Commonsense Question Answering Oct 7, 2024 Question Answering Retrieval
Code Code Available 1MC-CoT: A Modular Collaborative CoT Framework for Zero-shot Medical-VQA with LLM and MLLM Integration Oct 6, 2024 Medical Visual Question Answering Question Answering
Code Code Available 1MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions Oct 3, 2024 Code Generation Dialogue Generation
Code Code Available 1FastAdaSP: Multitask-Adapted Efficient Inference for Large Speech Language Model Oct 3, 2024 Emotion Recognition Language Modeling
Code Code Available 1Question-guided Knowledge Graph Re-scoring and Injection for Knowledge Graph Question Answering Oct 2, 2024 Graph Question Answering Question Answering
Code Code Available 1A Hitchhikers Guide to Fine-Grained Face Forgery Detection Using Common Sense Reasoning Oct 1, 2024 Common Sense Reasoning DeepFake Detection
Code Code Available 1Empowering Large Language Model for Continual Video Question Answering with Collaborative Prompting Oct 1, 2024 Continual Learning Language Modeling
Code Code Available 1VideoINSTA: Zero-shot Long Video Understanding via Informative Spatial-Temporal Reasoning with LLMs Sep 30, 2024 EgoSchema Language Modelling
Code Code Available 1CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph Question Answering Sep 29, 2024 Graph Question Answering Question Answering
Code Code Available 1T2Vs Meet VLMs: A Scalable Multimodal Dataset for Visual Harmfulness Recognition Sep 29, 2024 In-Context Learning Question Answering
Code Code Available 1Uni-Med: A Unified Medical Generalist Foundation Model For Multi-Task Learning Via Connector-MoE Sep 26, 2024 image-classification Image Classification
Code Code Available 1Exploring Hint Generation Approaches in Open-Domain Question Answering Sep 24, 2024 Hint Generation Open-Domain Question Answering
Code Code Available 1Boosting Healthcare LLMs Through Retrieved Context Sep 23, 2024 Benchmarking Multiple-choice
Code Code Available 1MediConfusion: Can you trust your AI radiologist? Probing the reliability of multimodal medical foundation models Sep 23, 2024 Medical Visual Question Answering Question Answering
Code Code Available 1Scene-Text Grounding for Text-Based Video Question Answering Sep 22, 2024 2k Contrastive Learning
Code Code Available 1ShizishanGPT: An Agricultural Large Language Model Integrating Tools and Resources Sep 20, 2024 Language Modeling Language Modelling
Code Code Available 1Language Models Learn to Mislead Humans via RLHF Sep 19, 2024 Question Answering
Code Code Available 1Evaluating Image Hallucination in Text-to-Image Generation with Question-Answering Sep 19, 2024 Hallucination Hallucination Evaluation
Code Code Available 1Development and bilingual evaluation of Japanese medical large language model within reasonably low computational resources Sep 18, 2024 GPU Language Modeling
Code Code Available 1Less is More: A Simple yet Effective Token Reduction Method for Efficient Multi-modal LLMs Sep 17, 2024 Question Answering Token Reduction
Code Code Available 1Improving LLM Reasoning with Multi-Agent Tree-of-Thought Validator Agent Sep 17, 2024 GSM8K Question Answering
Code Code Available 1L3Cube-IndicQuest: A Benchmark Question Answering Dataset for Evaluating Knowledge of LLMs in Indic Context Sep 13, 2024 Question Answering
Code Code Available 1AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge Sep 11, 2024 Language Modelling Large Language Model
Code Code Available 1LIME: Less Is More for MLLM Evaluation Sep 10, 2024 Image Captioning Question Answering
Code Code Available 1GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering Sep 10, 2024 Question Answering RAG
Code Code Available 1RIRAG: Regulatory Information Retrieval and Answer Generation Sep 9, 2024 Answer Generation Information Retrieval
Code Code Available 1M3-Jepa: Multimodal Alignment via Multi-directional MoE based on the JEPA framework Sep 9, 2024 Computational Efficiency Cross-Modal Retrieval
Code Code Available 1Debate on Graph: a Flexible and Reliable Reasoning Framework for Large Language Models Sep 5, 2024 Answer Generation Graph Question Answering
Code Code Available 1What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices Sep 3, 2024 Question Answering Question Generation
Code Code Available 1CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation Sep 3, 2024 Dataset Generation Question Answering
Code Code Available 1VProChart: Answering Chart Question through Visual Perception Alignment Agent and Programmatic Solution Reasoning Sep 3, 2024 Chart Question Answering Data Visualization
Code Code Available 1Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos Aug 26, 2024 Form Language Modelling
Code Code Available 1RoundTable: Leveraging Dynamic Schema and Contextual Autocomplete for Enhanced Query Precision in Tabular Question Answering Aug 22, 2024 Natural Language Queries Question Answering
Code Code Available 1Leveraging Fine-Tuned Retrieval-Augmented Generation with Long-Context Support: For 3GPP Standards Aug 21, 2024 Chunking Computational Efficiency
Code Code Available 1Hierarchical Retrieval-Augmented Generation Model with Rethink for Multi-hop Question Answering Aug 20, 2024 Multi-hop Question Answering Question Answering
Code Code Available 1V-RoAst: Visual Road Assessment. Can VLM be a Road Safety Assessor Using the iRAP Standard? Aug 20, 2024 Few-Shot Learning In-Context Learning
Code Code Available 1Visual Agents as Fast and Slow Thinkers Aug 16, 2024 Question Answering Reasoning Segmentation
Code Code Available 1W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering Aug 15, 2024 Open-Domain Question Answering Question Answering
Code Code Available 1FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection Aug 12, 2024 Answer Generation Decoder
Code Code Available 1Surgical-VQLA++: Adversarial Contrastive Learning for Calibrated Robust Visual Question-Localized Answering in Robotic Surgery Aug 9, 2024 Contrastive Learning Medical Visual Question Answering
Code Code Available 1Citekit: A Modular Toolkit for Large Language Model Citation Generation Aug 6, 2024 Language Modeling Language Modelling
Code Code Available 1DiReCT: Diagnostic Reasoning for Clinical Notes via Large Language Models Aug 4, 2024 Diagnostic Medical Question Answering
Code Code Available 1DebateQA: Evaluating Question Answering on Debatable Knowledge Aug 2, 2024 Diversity Question Answering
Code Code Available 1Tree-of-Traversals: A Zero-Shot Reasoning Algorithm for Augmenting Black-box Language Models with Knowledge Graphs Jul 31, 2024 Knowledge Graphs Question Answering
Code Code Available 1Learning Video Context as Interleaved Multimodal Sequences Jul 31, 2024 Language Modeling Language Modelling
Code Code Available 1Boosting Audio Visual Question Answering via Key Semantic-Aware Cues Jul 30, 2024 Audio-visual Question Answering Audio-Visual Question Answering (AVQA)
Code Code Available 1DyGKT: Dynamic Graph Learning for Knowledge Tracing Jul 30, 2024 Graph Learning Knowledge Tracing
Code Code Available 1Enhancing LLM's Cognition via Structurization Jul 23, 2024 Hallucination Hallucination Evaluation
Code Code Available 1