Uncertainty-aware Language Modeling for Selective Question Answering Nov 26, 2023 Language Modeling Language Modelling
— Unverified 0Local Convergence of Approximate Newton Method for Two Layer Nonlinear Regression Nov 26, 2023 Question Answering regression
— Unverified 0See and Think: Embodied Agent in Virtual Environment Nov 26, 2023 Minecraft Question Answering
— Unverified 0GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation Nov 25, 2023 Instruction Following Language Modeling
— Unverified 0Walking a Tightrope -- Evaluating Large Language Models in High-Risk Domains Nov 25, 2023 Question Answering
— Unverified 0AutoEval-Video: An Automatic Benchmark for Assessing Large Vision Language Models in Open-Ended Video Question Answering Nov 25, 2023 Question Answering Video Question Answering
Code Code Available 1GeoChat: Grounded Large Vision-Language Model for Remote Sensing Nov 24, 2023 Instruction Following Language Modeling
Code Code Available 2Question Answering in Natural Language: the Special Case of Temporal Expressions Nov 23, 2023 Question Answering
— Unverified 0Lego: Learning to Disentangle and Invert Personalized Concepts Beyond Object Appearance in Text-to-Image Diffusion Models Nov 23, 2023 Language Modelling Large Language Model
— Unverified 0FinMem: A Performance-Enhanced LLM Trading Agent with Layered Memory and Character Design Nov 23, 2023 Decision Making Language Modelling
Code Code Available 2PG-Video-LLaVA: Pixel Grounding Large Video-Language Models Nov 22, 2023 Benchmarking Phrase Grounding
Code Code Available 2Drilling Down into the Discourse Structure with LLMs for Long Document Question Answering Nov 22, 2023 Multi-hop Question Answering Question Answering
— Unverified 0AlignedCoT: Prompting Large Language Models via Native-Speaking Demonstrations Nov 22, 2023 Common Sense Reasoning GSM8K
Code Code Available 0Vamos: Versatile Action Models for Video Understanding Nov 22, 2023 EgoSchema Hard Attention
Code Code Available 0CSMeD: Bridging the Dataset Gap in Automated Citation Screening for Systematic Literature Reviews Nov 21, 2023 Question Answering Retrieval
Code Code Available 1AcademicGPT: Empowering Academic Research Nov 21, 2023 Abstract generation General Knowledge
— Unverified 0Do Smaller Language Models Answer Contextualised Questions Through Memorisation Or Generalisation? Nov 21, 2023 Question Answering Semantic Similarity
— Unverified 0Extracting Definienda in Mathematical Scholarly Articles with Transformers Nov 21, 2023 Articles Language Modeling
Code Code Available 1ATLANTIC: Structure-Aware Retrieval-Augmented Language Model for Interdisciplinary Science Nov 21, 2023 Document Classification Graph Neural Network
— Unverified 0nach0: Multimodal Natural and Chemical Languages Foundation Model Nov 21, 2023 Decoder model
Code Code Available 1Unifying Corroborative and Contributive Attributions in Large Language Models Nov 20, 2023 Language Modeling Language Modelling
— Unverified 0Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions Nov 20, 2023 Question Answering Visual Question Answering
Code Code Available 0Taiyi: A Bilingual Fine-Tuned Large Language Model for Diverse Biomedical Tasks Nov 20, 2023 Language Modeling Language Modelling
Code Code Available 1Towards Robust Text Retrieval with Progressive Learning Nov 20, 2023 Machine Reading Comprehension Question Answering
Code Code Available 0FinanceBench: A New Benchmark for Financial Question Answering Nov 20, 2023 How to refund a wrong transaction in PhonePe Question Answering
Code Code Available 3Zero-Shot Question Answering over Financial Documents using Large Language Models Nov 19, 2023 Language Modeling Language Modelling
— Unverified 0LLM aided semi-supervision for Extractive Dialog Summarization Nov 19, 2023 Extractive Summarization Question Answering
— Unverified 0Journey of Hallucination-minimized Generative AI Solutions for Financial Decision Makers Nov 18, 2023 Answer Generation Decision Making
— Unverified 0An Embodied Generalist Agent in 3D World Nov 18, 2023 3D dense captioning 3D Question Answering (3D-QA)
Code Code Available 2Orca 2: Teaching Small Language Models How to Reason Nov 18, 2023 Arithmetic Reasoning Common Sense Reasoning
— Unverified 0DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines Nov 17, 2023 Language Modelling Large Language Model
Code Code Available 1PEFT-MedAware: Large Language Model for Medical Awareness Nov 17, 2023 Computational Efficiency Language Modeling
— Unverified 0Clarify When Necessary: Resolving Ambiguity Through Interaction with LMs Nov 16, 2023 Machine Translation Natural Language Inference
— Unverified 0Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge Nov 16, 2023 Question Answering valid
Code Code Available 1You don't need a personality test to know these models are unreliable: Assessing the Reliability of Large Language Models on Psychometric Instruments Nov 16, 2023 Natural Language Understanding Negation
Code Code Available 0Graph Elicitation for Guiding Multi-Step Reasoning in Large Language Models Nov 16, 2023 Multi-hop Question Answering Question Answering
— Unverified 0Online Continual Knowledge Learning for Language Models Nov 16, 2023 Continual Learning Fact Checking
— Unverified 0SQATIN: Supervised Instruction Tuning Meets Question Answering for Improved Dialogue NLU Nov 16, 2023 Intent Detection Natural Language Understanding
Code Code Available 0What if you said that differently?: How Explanation Formats Affect Human Feedback Efficacy and User Perception Nov 16, 2023 In-Context Learning Question Answering
Code Code Available 0Downstream Trade-offs of a Family of Text Watermarks Nov 16, 2023 Form Language Modelling
Code Code Available 0Towards Robust Temporal Reasoning of Large Language Models via a Multi-Hop QA Dataset and Pseudo-Instruction Tuning Nov 16, 2023 Data Augmentation Question Answering
Code Code Available 0StorySparkQA: Expert-Annotated QA Pairs with Real-World Knowledge for Children's Story-Based Learning Nov 16, 2023 Question Answering World Knowledge
Code Code Available 0On Evaluating the Integration of Reasoning and Action in LLM Agents with Database Question Answering Nov 16, 2023 Question Answering Retrieval
— Unverified 0Investigating Data Contamination in Modern Benchmarks for Large Language Models Nov 16, 2023 Common Sense Reasoning MMLU
— Unverified 0Pregnant Questions: The Importance of Pragmatic Awareness in Maternal Health Question Answering Nov 16, 2023 Question Answering
— Unverified 0Crafting In-context Examples according to LMs' Parametric Knowledge Nov 16, 2023 Hallucination In-Context Learning
Code Code Available 0Video-LLaVA: Learning United Visual Representation by Alignment Before Projection Nov 16, 2023 Language Modeling Language Modelling
Code Code Available 4Leveraging LLMs in Scholarly Knowledge Graph Question Answering Nov 16, 2023 Graph Question Answering Language Modeling
Code Code Available 0VideoCon: Robust Video-Language Alignment via Contrast Captions Nov 15, 2023 Language Modeling Language Modelling
Code Code Available 1Long-form Question Answering: An Iterative Planning-Retrieval-Generation Approach Nov 15, 2023 Form Long Form Question Answering
— Unverified 0