LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image Aug 14, 2024 Autonomous Driving Logical Reasoning
— Unverified 0Vision Language Model for Interpretable and Fine-grained Detection of Safety Compliance in Diverse Workplaces Aug 13, 2024 Attribute Language Modeling
— Unverified 0MAQA: Evaluating Uncertainty Quantification in LLMs Regarding Data Uncertainty Aug 13, 2024 Mathematical Reasoning Question Answering
Code Code Available 0CROME: Cross-Modal Adapters for Efficient Multimodal LLM Aug 13, 2024 Instruction Following Language Modeling
— Unverified 0Creating Arabic LLM Prompts at Scale Aug 12, 2024 Headline Generation Instruction Following
— Unverified 0FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection Aug 12, 2024 Answer Generation Decoder
Code Code Available 1Quantum Algorithms for Compositional Text Processing Aug 12, 2024 Question Answering text similarity
— Unverified 0Reference-free Hallucination Detection for Large Vision-Language Models Aug 11, 2024 Hallucination Question Answering
— Unverified 0Chain of Condition: Construct, Verify and Solve Conditions for Conditional Question Answering Aug 10, 2024 Question Answering
— Unverified 0SWIFT:A Scalable lightWeight Infrastructure for Fine-Tuning Aug 10, 2024 Hallucination Optical Character Recognition
Code Code Available 11Context-Driven Index Trimming: A Data Quality Perspective to Enhancing Precision of RALMs Aug 10, 2024 Question Answering Retrieval
Code Code Available 0MSG-Chart: Multimodal Scene Graph for ChartQA Aug 9, 2024 Chart Question Answering Inductive Bias
Code Code Available 0Surgical-VQLA++: Adversarial Contrastive Learning for Calibrated Robust Visual Question-Localized Answering in Robotic Surgery Aug 9, 2024 Contrastive Learning Medical Visual Question Answering
Code Code Available 1Towards a Generative Approach for Emotion Detection and Reasoning Aug 9, 2024 Emotion Recognition Generative Question Answering
— Unverified 0Revisiting Multi-Modal LLM Evaluation Aug 9, 2024 Chart Understanding Optical Character Recognition
— Unverified 0Enhancing Robustness of Retrieval-Augmented Language Models with In-Context Learning Aug 8, 2024 In-Context Learning Machine Reading Comprehension
— Unverified 0Enhancing Healthcare through Large Language Models: A Study on Medical Question Answering Aug 8, 2024 Medical Question Answering Question Answering
— Unverified 0Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models Aug 8, 2024 Contrastive Learning Fine-Grained Image Recognition
— Unverified 0EfficientRAG: Efficient Retriever for Multi-Hop Question Answering Aug 8, 2024 Multi-hop Question Answering Question Answering
Code Code Available 2NatLan: Native Language Prompting Facilitates Knowledge Elicitation Through Language Trigger Provision and Domain Trigger Retention Aug 7, 2024 Question Answering
Code Code Available 0Target Prompting for Information Extraction with Vision Language Model Aug 7, 2024 Language Modeling Language Modelling
— Unverified 0Optimus: Accelerating Large-Scale Multi-Modal LLM Training by Bubble Exploitation Aug 7, 2024 GPU Question Answering
— Unverified 0Citekit: A Modular Toolkit for Large Language Model Citation Generation Aug 6, 2024 Language Modeling Language Modelling
Code Code Available 1GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI Aug 6, 2024 Question Answering Visual Question Answering
Code Code Available 2500xCompressor: Generalized Prompt Compression for Large Language Models Aug 6, 2024 Language Modelling Large Language Model
Code Code Available 2Targeted Visual Prompting for Medical Visual Question Answering Aug 6, 2024 Medical Visual Question Answering Question Answering
Code Code Available 0Leveraging Inter-Chunk Interactions for Enhanced Retrieval in Large Language Model-Based Question Answering Aug 6, 2024 Answer Generation Language Modeling
— Unverified 0XMainframe: A Large Language Model for Mainframe Modernization Aug 5, 2024 Code Summarization Language Modeling
Code Code Available 2Entity Retrieval for Answering Entity-Centric Questions Aug 5, 2024 Entity Retrieval Question Answering
— Unverified 0Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining Aug 5, 2024 Decoder Depth Estimation
Code Code Available 7Developing PUGG for Polish: A Modern Approach to KBQA, MRC, and IR Dataset Construction Aug 5, 2024 Information Retrieval Knowledge Base Question Answering
Code Code Available 0REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models Aug 5, 2024 Question Answering Spatial Reasoning
— Unverified 0Knowledge AI: Fine-tuning NLP Models for Facilitating Scientific Knowledge Extraction and Understanding Aug 4, 2024 named-entity-recognition Named Entity Recognition
— Unverified 0DiReCT: Diagnostic Reasoning for Clinical Notes via Large Language Models Aug 4, 2024 Diagnostic Medical Question Answering
Code Code Available 1MMPKUBase: A Comprehensive and High-quality Chinese Multi-modal Knowledge Graph Aug 3, 2024 Attribute Contrastive Learning
— Unverified 0Compositional Physical Reasoning of Objects and Events from Videos Aug 2, 2024 counterfactual Question Answering
— Unverified 0DebateQA: Evaluating Question Answering on Debatable Knowledge Aug 2, 2024 Diversity Question Answering
Code Code Available 1Adaptive Contrastive Decoding in Retrieval-Augmented Generation for Handling Noisy Contexts Aug 2, 2024 Open-Domain Question Answering Question Answering
— Unverified 0Tensor Train Low-rank Approximation (TT-LoRA): Democratizing AI with Accelerated LLMs Aug 2, 2024 Machine Translation Model Compression
— Unverified 0RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework Aug 2, 2024 Benchmarking Dataset Generation
Code Code Available 3BioRAG: A RAG-LLM Framework for Biological Question Reasoning Aug 2, 2024 Information Retrieval Question Answering
— Unverified 0Improving Retrieval-Augmented Generation in Medicine with Iterative Follow-up Questions Aug 1, 2024 Medical Question Answering MedQA
Code Code Available 4Towards Flexible Evaluation for Generative Visual Question Answering Aug 1, 2024 Decoder Generative Visual Question Answering
Code Code Available 0SimpleLLM4AD: An End-to-End Vision-Language Model with Graph Visual Question Answering for Autonomous Driving Jul 31, 2024 Autonomous Driving Language Modeling
— Unverified 0Learning Video Context as Interleaved Multimodal Sequences Jul 31, 2024 Language Modeling Language Modelling
Code Code Available 1Cost-Effective Hallucination Detection for LLMs Jul 31, 2024 Decision Making Fact Checking
— Unverified 0Prompting Medical Large Vision-Language Models to Diagnose Pathologies by Visual Question Answering Jul 31, 2024 Diagnostic Hallucination
— Unverified 0The Llama 3 Herd of Models Jul 31, 2024 answerability prediction Language Modeling
Code Code Available 4Tree-of-Traversals: A Zero-Shot Reasoning Algorithm for Augmenting Black-box Language Models with Knowledge Graphs Jul 31, 2024 Knowledge Graphs Question Answering
Code Code Available 1Decomposed Prompting to Answer Questions on a Course Discussion Board Jul 30, 2024 Language Modeling Language Modelling
Code Code Available 0