Generic Attention-model Explainability by Weighted Relevance Accumulation Aug 20, 2023 Image Captioning Question Answering
— Unverified 0ViT-Lens: Initiating Omni-Modal Exploration through 3D Insights Aug 20, 2023 3D Classification Question Answering
Code Code Available 1Imaginations of WALL-E : Reconstructing Experiences with an Imagination-Inspired Module for Advanced AI Systems Aug 20, 2023 Emotion Recognition Language Modelling
— Unverified 0GameEval: Evaluating LLMs on Conversational Games Aug 19, 2023 Question Answering
Code Code Available 1Breaking Language Barriers: A Question Answering Dataset for Hindi and Marathi Aug 19, 2023 Question Answering
— Unverified 0BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions Aug 19, 2023 MME Optical Character Recognition (OCR)
Code Code Available 2Towards Grounded Visual Spatial Reasoning in Multi-Modal Vision Language Models Aug 18, 2023 Image-text matching Object Localization
— Unverified 0BioMedGPT: Open Multimodal Generative Pre-trained Transformer for BioMedicine Aug 18, 2023 Few-Shot Learning Language Modeling
— Unverified 0Differentiable Retrieval Augmentation via Generative Language Modeling for E-commerce Query Intent Classification Aug 18, 2023 intent-classification Intent Classification
— Unverified 0Accelerated materials language processing enabled by GPT Aug 18, 2023 Document Classification Extractive Question-Answering
— Unverified 0PUMGPT: A Large Vision-Language Model for Product Understanding Aug 18, 2023 Attribute Attribute Extraction
— Unverified 0Open-vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models Aug 18, 2023 Multiple-choice Question Answering
Code Code Available 1EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language Understanding Aug 17, 2023 Diagnostic EgoSchema
Code Code Available 1MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language Models Aug 17, 2023 Decision Making Hallucination
Code Code Available 2End-to-End Beam Retrieval for Multi-Hop Question Answering Aug 17, 2023 Language Modelling Large Language Model
Code Code Available 1Linguistically-Informed Neural Architectures for Lexical, Syntactic and Semantic Tasks in Sanskrit Aug 17, 2023 Dependency Parsing Machine Translation
— Unverified 0MaScQA: A Question Answering Dataset for Investigating Materials Science Knowledge of Large Language Models Aug 17, 2023 Question Answering
— Unverified 0Uni-NLX: Unifying Textual Explanations for Vision and Vision-Language Tasks Aug 17, 2023 Question Answering Text Generation
Code Code Available 1Semantic Consistency for Assuring Reliability of Large Language Models Aug 17, 2023 Question Answering Text Generation
— Unverified 0Learning the meanings of function words from grounded language using a visual question answering model Aug 16, 2023 Logical Reasoning Question Answering
Code Code Available 0Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer Aug 16, 2023 Decoder Question Answering
Code Code Available 1TeCH: Text-guided Reconstruction of Lifelike Clothed Humans Aug 16, 2023 Descriptive Question Answering
Code Code Available 2Answering Ambiguous Questions with a Database of Questions, Answers, and Revisions Aug 16, 2023 Passage Retrieval Question Answering
— Unverified 0Pro-Cap: Leveraging a Frozen Vision-Language Model for Hateful Meme Detection Aug 16, 2023 Image Captioning Language Modeling
Code Code Available 1AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Aug 16, 2023 3D Human Pose Estimation College Computer Science
Code Code Available 1Question Answering over Linked Data with GPT-3 Aug 15, 2023 Knowledge Base Question Answering Question Answering
Code Code Available 0DiagGPT: An LLM-based and Multi-agent Dialogue System with Automatic Topic Management for Flexible Task-Oriented Dialogue Aug 15, 2023 Chatbot Diagnostic
— Unverified 0Automated Testing and Improvement of Named Entity Recognition Systems Aug 14, 2023 named-entity-recognition Named Entity Recognition
— Unverified 0Large Language Models for Information Retrieval: A Survey Aug 14, 2023 Information Retrieval Question Answering
Code Code Available 2An Ensemble Approach to Question Classification: Integrating Electra Transformer, GloVe, and LSTM Aug 13, 2023 Classification Machine Translation
— Unverified 0Performance Prediction for Multi-hop Questions Aug 12, 2023 Multi-hop Question Answering Prediction
— Unverified 0Foundation Model is Efficient Multimodal Multitask Model Selector Aug 11, 2023 model Model Selection
Code Code Available 1Detecting and Preventing Hallucinations in Large Vision Language Models Aug 11, 2023 16k Hallucination
Code Code Available 1KETM:A Knowledge-Enhanced Text Matching method Aug 11, 2023 Common Sense Reasoning Question Answering
Code Code Available 1LittleMu: Deploying an Online Virtual Teaching Assistant via Heterogeneous Sources Integration and Chain of Teach Prompts Aug 11, 2023 Language Modelling Question Answering
Code Code Available 0Progressive Spatio-temporal Perception for Audio-Visual Question Answering Aug 10, 2023 Audio-visual Question Answering Audio-Visual Question Answering (AVQA)
Code Code Available 1Building Interpretable and Reliable Open Information Retriever for New Domains Overnight Aug 9, 2023 Information Retrieval Open-Domain Question Answering
— Unverified 0Answering Unseen Questions With Smaller Language Models Using Rationale Generation and Dense Retrieval Aug 9, 2023 ARC Language Modelling
— Unverified 0ADMUS: A Progressive Question Answering Framework Adaptable to Multiple Knowledge Sources Aug 9, 2023 Knowledge Base Question Answering Question Answering
— Unverified 0Sci-CoT: Leveraging Large Language Models for Enhanced Knowledge Distillation in Small Models for Scientific QA Aug 9, 2023 ARC Knowledge Distillation
— Unverified 0Top K Relevant Passage Retrieval for Biomedical Question Answering Aug 8, 2023 Articles Passage Retrieval
Code Code Available 0On Monotonic Aggregation for Open-domain QA Aug 8, 2023 Language Modeling Language Modelling
Code Code Available 03D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment Aug 8, 2023 3D Question Answering (3D-QA) Dense Captioning
Code Code Available 2OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data Generation Aug 8, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Towards an AI to Win Ghana's National Science and Maths Quiz Aug 8, 2023 Math Question Answering
Code Code Available 1TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored Models Aug 7, 2023 backdoor defense object-detection
Code Code Available 0Trusting Language Models in Education Aug 7, 2023 Question Answering
— Unverified 0Prompt Guided Copy Mechanism for Conversational Question Answering Aug 7, 2023 Conversational Question Answering Question Answering
— Unverified 0SciGraphQA: A Large-Scale Synthetic Multi-Turn Question-Answering Dataset for Scientific Graphs Aug 7, 2023 Question Answering Visual Question Answering
Code Code Available 1PaniniQA: Enhancing Patient Education Through Interactive Question Answering Aug 7, 2023 Question Answering
Code Code Available 0