TextGrad: Automatic "Differentiation" via Text Jun 11, 2024 Question Answering Specificity
Code Code Available 7Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning Jun 11, 2024 Benchmarking Contrastive Learning
Code Code Available 0MBBQ: A Dataset for Cross-Lingual Comparison of Stereotypes in Generative LLMs Jun 11, 2024 Question Answering
Code Code Available 0DARA: Decomposition-Alignment-Reasoning Autonomous Language Agent for Question Answering over Knowledge Graphs Jun 11, 2024 In-Context Learning Knowledge Graphs
Code Code Available 0Situational Awareness Matters in 3D Vision Language Reasoning Jun 11, 2024 Question Answering
Code Code Available 1Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway Jun 11, 2024 Language Modeling Language Modelling
Code Code Available 0Paraphrasing in Affirmative Terms Improves Negation Understanding Jun 11, 2024 Natural Language Inference Natural Language Understanding
— Unverified 0VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs Jun 11, 2024 Multiple-choice Question Answering
Code Code Available 5RS-Agent: Automating Remote Sensing Tasks through Intelligent Agent Jun 11, 2024 AI Agent Descriptive
Code Code Available 2DR-RAG: Applying Dynamic Document Relevance to Retrieval-Augmented Generation for Question-Answering Jun 11, 2024 Question Answering RAG
— Unverified 0BrainChat: Decoding Semantic Information from fMRI using Vision-language Pretrained Models Jun 10, 2024 Decoder Question Answering
— Unverified 0SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature Jun 10, 2024 Claim Verification Instruction Following
Code Code Available 1Evaluating the Retrieval Component in LLM-Based Question Answering Systems Jun 10, 2024 Information Retrieval Question Answering
— Unverified 0Harnessing AI for efficient analysis of complex policy documents: a case study of Executive Order 14110 Jun 10, 2024 Question Answering
— Unverified 0HOLMES: Hyper-Relational Knowledge Graphs for Multi-hop Question Answering using LLMs Jun 10, 2024 Knowledge Graphs Multi-hop Question Answering
— Unverified 0Solution for SMART-101 Challenge of CVPR Multi-modal Algorithmic Reasoning Task 2024 Jun 10, 2024 Language Modelling object-detection
— Unverified 0Transforming Wearable Data into Health Insights using Large Language Model Agents Jun 10, 2024 Code Generation Information Retrieval
— Unverified 0VCR: A Task for Pixel-Level Complex Reasoning in Vision Language Models via Restoring Occluded Text Jun 10, 2024 Language Modeling Language Modelling
Code Code Available 1Recurrent Context Compression: Efficiently Expanding the Context Window of LLM Jun 10, 2024 Long-Context Understanding Question Answering
Code Code Available 2Should We Fine-Tune or RAG? Evaluating Different Techniques to Adapt LLMs for Dialogue Jun 10, 2024 In-Context Learning Question Answering
Code Code Available 0CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark Jun 10, 2024 Diversity Question Answering
— Unverified 0Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning Jun 10, 2024 Multi-hop Question Answering Question Answering
Code Code Available 3MedExQA: Medical Question Answering Benchmark with Multiple Explanations Jun 10, 2024 Medical Question Answering Question Answering
Code Code Available 0MrRank: Improving Question Answering Retrieval System through Multi-Result Ranking Model Jun 9, 2024 Information Retrieval Learning-To-Rank
— Unverified 0MedREQAL: Examining Medical Knowledge Recall of Large Language Models via Question Answering Jun 9, 2024 Question Answering
— Unverified 0Zero-Shot End-To-End Spoken Question Answering In Medical Domain Jun 9, 2024 Answer Selection Question Answering
— Unverified 0Do LLMs Exhibit Human-Like Reasoning? Evaluating Theory of Mind in LLMs for Open-Ended Responses Jun 9, 2024 Question Answering Semantic Similarity
— Unverified 0RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation Jun 9, 2024 Document Ranking Natural Questions
Code Code Available 0F-LMM: Grounding Frozen Large Multimodal Models Jun 9, 2024 General Knowledge Instruction Following
Code Code Available 2Do LLMs Recognize me, When I is not me: Assessment of LLMs Understanding of Turkish Indexical Pronouns in Indexical Shift Contexts Jun 8, 2024 Machine Translation Multiple-choice
— Unverified 0Investigating and Addressing Hallucinations of LLMs in Tasks Involving Negation Jun 8, 2024 Abstractive Text Summarization Dialogue Generation
— Unverified 0CERET: Cost-Effective Extrinsic Refinement for Text Generation Jun 8, 2024 Abstractive Text Summarization Question Answering
Code Code Available 0Towards a Benchmark for Causal Business Process Reasoning with LLMs Jun 8, 2024 Question Answering
Code Code Available 0CaLM: Contrasting Large and Small Language Models to Verify Grounded Generation Jun 8, 2024 Open-Domain Question Answering Question Answering
— Unverified 0Venn Diagram Prompting : Accelerating Comprehension with Scaffolding Effect Jun 8, 2024 Question Answering
— Unverified 0Composition Vision-Language Understanding via Segment and Depth Anything Model Jun 7, 2024 Question Answering Visual Question Answering (VQA)
Code Code Available 0LinkQ: An LLM-Assisted Visual Interface for Knowledge Graph Question-Answering Jun 7, 2024 Graph Question Answering Language Modeling
Code Code Available 1TCMD: A Traditional Chinese Medicine QA Dataset for Evaluating Large Language Models Jun 7, 2024 Medical Question Answering Question Answering
— Unverified 0On Subjective Uncertainty Quantification and Calibration in Natural Language Generation Jun 7, 2024 In-Context Learning Machine Translation
Code Code Available 0CRiskEval: A Chinese Multi-Level Risk Evaluation Benchmark Dataset for Large Language Models Jun 7, 2024 Multiple-choice Philosophy
Code Code Available 0CRAG -- Comprehensive RAG Benchmark Jun 7, 2024 Hallucination Language Modelling
Code Code Available 3CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for Task-Aware Parameter-Efficient Fine-tuning Jun 7, 2024 Instruction Following Math
Code Code Available 2ComplexTempQA: A Large-Scale Dataset for Complex Temporal Question Answering Jun 7, 2024 Information Retrieval Question Answering
Code Code Available 1MATTER: Memory-Augmented Transformer Using Heterogeneous Knowledge Sources Jun 7, 2024 Language Modeling Language Modelling
— Unverified 0Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive? Jun 6, 2024 Multiple-choice Question Answering
— Unverified 0Synthesizing Conversations from Unlabeled Documents using Automatic Response Segmentation Jun 6, 2024 Conversational Question Answering Question Answering
— Unverified 0M-QALM: A Benchmark to Assess Clinical Reading Comprehension and Knowledge Recall in Large Language Models via Question Answering Jun 6, 2024 abstractive question answering Clinical Knowledge
Code Code Available 0Understanding Information Storage and Transfer in Multi-modal Large Language Models Jun 6, 2024 Factual Visual Question Answering Model Editing
— Unverified 0FairytaleQA Translated: Enabling Educational Question and Answer Generation in Less-Resourced Languages Jun 6, 2024 Answer Generation Question Answering
Code Code Available 0Semantically Diverse Language Generation for Uncertainty Estimation in Language Models Jun 6, 2024 Question Answering Text Generation
Code Code Available 1