Modulating Language Model Experiences through Frictions Jun 24, 2024 Friction Information Retrieval
— Unverified 0Claude 3.5 Sonnet Model Card Addendum Jun 24, 2024 Code Generation MMR total
— Unverified 0Training-Free Exponential Context Extension via Cascading KV Cache Jun 24, 2024 Book summarization Computational Efficiency
Code Code Available 0MM-SpuBench: Towards Better Understanding of Spurious Biases in Multimodal LLMs Jun 24, 2024 Question Answering Visual Question Answering
— Unverified 0DEXTER: A Benchmark for open-domain Complex Question Answering using LLMs Jun 24, 2024 Question Answering Retrieval
Code Code Available 1Attention Instruction: Amplifying Attention in the Middle via Prompting Jun 24, 2024 Position Question Answering
Code Code Available 0GPT-4V Explorations: Mining Autonomous Driving Jun 24, 2024 Autonomous Driving Decision Making
— Unverified 0UniPSDA: Unsupervised Pseudo Semantic Data Augmentation for Zero-Shot Cross-Lingual Natural Language Understanding Jun 24, 2024 Data Augmentation Natural Language Understanding
Code Code Available 0Directed Domain Fine-Tuning: Tailoring Separate Modalities for Specific Training Tasks Jun 24, 2024 Question Answering Text Generation
— Unverified 0Is your benchmark truly adversarial? AdvScore: Evaluating Human-Grounded Adversarialness Jun 24, 2024 Language Modeling Language Modelling
— Unverified 0LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing Jun 24, 2024 Question Answering
Code Code Available 1Context-augmented Retrieval: A Novel Framework for Fast Information Retrieval based Response Generation using Large Language Model Jun 24, 2024 Answer Generation Information Retrieval
— Unverified 0SEAM: A Stochastic Benchmark for Multi-Document Tasks Jun 23, 2024 coreference-resolution Coreference Resolution
— Unverified 0HCQA @ Ego4D EgoSchema Challenge 2024 Jun 22, 2024 Caption Generation
Code Code Available 1MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception Jun 22, 2024 Common Sense Reasoning Language Modelling
— Unverified 0TorchSpatial: A Location Encoding Framework and Benchmark for Spatial Representation Learning Jun 21, 2024 Fairness Geographic Question Answering
Code Code Available 270B-parameter large language models in Japanese medical question-answering Jun 21, 2024 Continual Pretraining Domain Adaptation
— Unverified 0Generate-then-Ground in Retrieval-Augmented Generation for Multi-hop Question Answering Jun 21, 2024 Multi-hop Question Answering Question Answering
— Unverified 0UDA: A Benchmark Suite for Retrieval Augmented Generation in Real-world Document Analysis Jun 21, 2024 Question Answering RAG
Code Code Available 1Towards Retrieval Augmented Generation over Large Video Libraries Jun 21, 2024 Answer Generation Question Answering
— Unverified 0Tri-VQA: Triangular Reasoning Medical Visual Question Answering for Multi-Attribute Analysis Jun 21, 2024 Attribute Medical Visual Question Answering
— Unverified 0Prompting Whisper for QA-driven Zero-shot End-to-end Spoken Language Understanding Jun 21, 2024 Cross-corpus Decoder
— Unverified 0Sports Intelligence: Assessing the Sports Understanding Capabilities of Language Models through Question Answering from Text to Video Jun 21, 2024 Benchmarking Few-Shot Learning
— Unverified 0PKU-SafeRLHF: Towards Multi-Level Safety Alignment for LLMs with Human Preference Jun 20, 2024 Question Answering Safety Alignment
— Unverified 0Understanding Finetuning for Factual Knowledge Extraction Jun 20, 2024 MMLU Question Answering
— Unverified 0A Learn-Then-Reason Model Towards Generalization in Knowledge Base Question Answering Jun 20, 2024 Knowledge Base Question Answering Language Modelling
— Unverified 0TTQA-RS- A break-down prompting approach for Multi-hop Table-Text Question Answering with Reasoning and Summarization Jun 20, 2024 Information Retrieval Question Answering
— Unverified 0SuperGLEBer: German Language Understanding Evaluation Benchmark Jun 20, 2024 Document Classification Natural Language Understanding
Code Code Available 1TAGLAS: An atlas of text-attributed graph datasets in the era of large graph and language models Jun 20, 2024 Graph Question Answering Node Classification
Code Code Available 2Evaluating RAG-Fusion with RAGElo: an Automated Elo-based Framework Jun 20, 2024 Hallucination Question Answering
Code Code Available 2VGA: Vision GUI Assistant -- Minimizing Hallucinations through Image-Centric Fine-Tuning Jun 20, 2024 Image Comprehension Question Answering
Code Code Available 0LLaSA: A Multimodal LLM for Human Activity Analysis Through Wearable and Smartphone Sensors Jun 20, 2024 16k Instruction Following
Code Code Available 1Investigating Mysteries of CoT-Augmented Distillation Jun 20, 2024 Question Answering
— Unverified 0Does Object Grounding Really Reduce Hallucination of Large Vision-Language Models? Jun 20, 2024 Caption Generation Hallucination
— Unverified 0Ranking LLMs by compression Jun 20, 2024 coreference-resolution Coreference Resolution
— Unverified 0SynDARin: Synthesising Datasets for Automated Reasoning in Low-Resource Languages Jun 20, 2024 Language Modelling Large Language Model
— Unverified 0The Fire Thief Is Also the Keeper: Balancing Usability and Privacy in Prompts Jun 20, 2024 Code Generation Question Answering
— Unverified 0QPaug: Question and Passage Augmentation for Open-Domain Question Answering of LLMs Jun 20, 2024 Open-Domain Question Answering Question Answering
Code Code Available 0Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs Jun 20, 2024 Knowledge Distillation Knowledge Graphs
Code Code Available 1Temporal Knowledge Graph Question Answering: A Survey Jun 20, 2024 Graph Question Answering Knowledge Base Question Answering
— Unverified 0Timo: Towards Better Temporal Reasoning for Language Models Jun 20, 2024 Question Answering
Code Code Available 1Robust Few-shot Transfer Learning for Knowledge Base Question Answering with Unanswerable Questions Jun 20, 2024 Knowledge Base Question Answering Question Answering
— Unverified 0Detecting hallucinations in large language models using semantic entropy Jun 19, 2024 Large Language Model Question Answering
Code Code Available 3LIVE: Learnable In-Context Vector for Visual Question Answering Jun 19, 2024 In-Context Learning Question Answering
Code Code Available 1QRMeM: Unleash the Length Limitation through Question then Reflection Memory Mechanism Jun 19, 2024 Multiple-choice Question Answering
— Unverified 0Transferable speech-to-text large language model alignment module Jun 19, 2024 Language Modeling Language Modelling
— Unverified 0MoreHopQA: More Than Multi-hop Reasoning Jun 19, 2024 Question Answering
Code Code Available 1AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding Jun 19, 2024 Question Answering Spatial Reasoning
Code Code Available 1Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators Jun 19, 2024 Fact Verification Question Answering
Code Code Available 1Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation Jun 19, 2024 Question Answering RAG
Code Code Available 1