Beyond Text: Frozen Large Language Models in Visual Signal Comprehension Mar 12, 2024 Deblurring Decoder
Code Code Available 2Beyond Memorization: The Challenge of Random Memory Access in Language Models Mar 12, 2024 Memorization Open-Domain Question Answering
Code Code Available 1Complex Reasoning over Logical Queries on Commonsense Knowledge Graphs Mar 12, 2024 Knowledge Graphs Multiple-choice
Code Code Available 1InfiBench: Evaluating the Question-Answering Capabilities of Code Large Language Models Mar 11, 2024 Code Generation HumanEval
Code Code Available 1SPA: Towards A Computational Friendly Cloud-Base and On-Devices Collaboration Seq2seq Personalized Generation with Casual Inference Mar 11, 2024 Question Answering Single Particle Analysis
Code Code Available 0ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis Mar 11, 2024 Question Answering
Code Code Available 2ALaRM: Align Language Models via Hierarchical Rewards Modeling Mar 11, 2024 Long Form Question Answering Machine Translation
Code Code Available 1Answering Diverse Questions via Text Attached with Key Audio-Visual Clues Mar 11, 2024 Audio-visual Question Answering Audio-Visual Question Answering (AVQA)
Code Code Available 0From Instructions to Constraints: Language Model Alignment with Automatic Constraint Verification Mar 10, 2024 Abstractive Text Summarization Entity Typing
— Unverified 0KG-Rank: Enhancing Large Language Models for Medical QA with Knowledge Graphs and Ranking Techniques Mar 9, 2024 Knowledge Graphs Long Form Question Answering
Code Code Available 2Calibrating Large Language Models Using Their Generations Only Mar 9, 2024 Question Answering Text Generation
Code Code Available 1MP2D: An Automated Topic Shift Dialogue Generation Framework Leveraging Knowledge Graphs Mar 9, 2024 Conversational Question Answering Dialogue Generation
— Unverified 0Debiasing Multimodal Large Language Models Mar 8, 2024 Fairness Question Answering
Code Code Available 2Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context Mar 8, 2024 1 Image, 2*2 Stitching Code Generation
Code Code Available 3Harnessing Multi-Role Capabilities of Large Language Models for Open-Domain Question Answering Mar 8, 2024 Answer Generation Open-Domain Question Answering
Code Code Available 1ChatASU: Evoking LLM's Reflexion to Truly Understand Aspect Sentiment in Dialogues Mar 8, 2024 Hallucination Question Answering
— Unverified 0Can't Remember Details in Long Documents? You Need Some R&R Mar 8, 2024 Question Answering
Code Code Available 1Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought Mar 8, 2024 Language Modeling Language Modelling
Code Code Available 1Few shot chain-of-thought driven reasoning to prompt LLMs for open ended medical question answering Mar 7, 2024 Information Retrieval Language Modelling
Code Code Available 0CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios Mar 7, 2024 Audio-visual Question Answering Audio-Visual Question Answering (AVQA)
Code Code Available 2Effectiveness Assessment of Recent Large Vision-Language Models Mar 7, 2024 Anomaly Detection Attribute
— Unverified 0Advancing Chinese biomedical text mining with community challenges Mar 7, 2024 Attribute Attribute Extraction
— Unverified 0TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document Mar 7, 2024 document understanding Key Information Extraction
Code Code Available 5HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild Mar 7, 2024 Hallucination Question Answering
Code Code Available 0QAQ: Quality Adaptive Quantization for LLM KV Cache Mar 7, 2024 Quantization Question Answering
Code Code Available 2SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM Mar 7, 2024 Question Answering Retrieval
— Unverified 0Are Language Models Puzzle Prodigies? Algorithmic Puzzles Unveil Serious Challenges in Multimodal Reasoning Mar 6, 2024 Multimodal Reasoning Question Answering
Code Code Available 2Benchmarking Hallucination in Large Language Models based on Unanswerable Math Word Problem Mar 6, 2024 Benchmarking Hallucination
Code Code Available 0Evaluating the Elementary Multilingual Capabilities of Large Language Models with MultiQ Mar 6, 2024 Open-Ended Question Answering Question Answering
Code Code Available 0CLEVR-POC: Reasoning-Intensive Visual Question Answering in Partially Observable Environments Mar 5, 2024 Language Modelling Large Language Model
— Unverified 0Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use Mar 5, 2024 image-classification Image Classification
— Unverified 0Reliable, Adaptable, and Attributable Language Models with Retrieval Mar 5, 2024 Question Answering Retrieval
— Unverified 0A General and Flexible Multi-concept Parsing Framework for Multilingual Semantic Matching Mar 5, 2024 Chatbot Community Question Answering
— Unverified 0Evidence-Focused Fact Summarization for Knowledge-Augmented Zero-Shot Question Answering Mar 5, 2024 Form Knowledge Graphs
Code Code Available 0MOKA: Open-World Robotic Manipulation through Mark-Based Visual Prompting Mar 5, 2024 In-Context Learning Object Rearrangement
— Unverified 0Enhancing Generalization in Medical Visual Question Answering Tasks via Gradient-Guided Model Perturbation Mar 5, 2024 Data Augmentation Medical Visual Question Answering
— Unverified 0An Improved Traditional Chinese Evaluation Suite for Foundation Model Mar 4, 2024 Multiple-choice Question Answering
— Unverified 0Brilla AI: AI Contestant for the National Science and Maths Quiz Mar 4, 2024 Math Question Answering
Code Code Available 1Vision-Language Models for Medical Report Generation and Visual Question Answering: A Review Mar 4, 2024 Medical Report Generation Question Answering
Code Code Available 3The Claude 3 Model Family: Opus, Sonnet, Haiku Mar 4, 2024 1 Image, 2*2 Stitching Arithmetic Reasoning
— Unverified 0To Generate or to Retrieve? On the Effectiveness of Artificial Contexts for Medical Open-Domain Question Answering Mar 4, 2024 MedQA MMLU
Code Code Available 1EEE-QA: Exploring Effective and Efficient Question-Answer Representations Mar 4, 2024 Knowledge Graphs Question Answering
Code Code Available 0SyllabusQA: A Course Logistics Question Answering Dataset Mar 3, 2024 Language Modeling Language Modelling
Code Code Available 0KorMedMCQA: Multi-Choice Question Answering Benchmark for Korean Healthcare Professional Licensing Examinations Mar 3, 2024 MedQA MMLU
— Unverified 0Answerability in Retrieval-Augmented Open-Domain Question Answering Mar 3, 2024 Open-Domain Question Answering Question Answering
— Unverified 0Automatic Question-Answer Generation for Long-Tail Knowledge Mar 3, 2024 Answer Generation Knowledge Graphs
— Unverified 0CR-LT-KGQA: A Knowledge Graph Question Answering Dataset Requiring Commonsense Reasoning and Long-Tail Knowledge Mar 3, 2024 Claim Verification Graph Question Answering
Code Code Available 1Right for Right Reasons: Large Language Models for Verifiable Commonsense Knowledge Graph Question Answering Mar 3, 2024 Claim Verification Graph Question Answering
— Unverified 0Fine Tuning vs. Retrieval Augmented Generation for Less Popular Knowledge Mar 3, 2024 Data Augmentation Question Answering
Code Code Available 0Improving Cross-lingual Representation for Semantic Retrieval with Code-switching Mar 3, 2024 Question Answering Retrieval
— Unverified 0