@Bench: Benchmarking Vision-Language Models for Human-centered Assistive Technology Sep 21, 2024 Benchmarking Depth Estimation
— Unverified 0QMOS: Enhancing LLMs for Telecommunication with Question Masked loss and Option Shuffling Sep 21, 2024 Multiple-choice Prompt Engineering
Code Code Available 0Co-occurrence is not Factual Association in Language Models Sep 21, 2024 Multi-hop Question Answering Question Answering
Code Code Available 0SMART-RAG: Selection using Determinantal Matrices for Augmented Retrieval Sep 21, 2024 Diversity Point Processes
— Unverified 0Drift to Remember Sep 21, 2024 GPU image-classification
— Unverified 0ReMEmbR: Building and Reasoning Over Long-Horizon Spatio-Temporal Memory for Robot Navigation Sep 20, 2024 Descriptive Question Answering
Code Code Available 3ShizishanGPT: An Agricultural Large Language Model Integrating Tools and Resources Sep 20, 2024 Language Modeling Language Modelling
Code Code Available 1Beyond Accuracy Optimization: Computer Vision Losses for Large Language Model Fine-Tuning Sep 20, 2024 Language Modeling Language Modelling
Code Code Available 0First Place Solution to the Multiple-choice Video QA Track of The Second Perception Test Challenge Sep 20, 2024 Multiple-choice Question Answering
— Unverified 0A Multimodal Dense Retrieval Approach for Speech-Based Open-Domain Question Answering Sep 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enhancing Large Language Models with Domain-specific Retrieval Augment Generation: A Case Study on Long-form Consumer Health Question Answering in Ophthalmology Sep 20, 2024 Evidence Selection Form
— Unverified 0Unlocking Memorization in Large Language Models with Dynamic Soft Prompting Sep 20, 2024 Code Generation Memorization
— Unverified 0AQA: Adaptive Question Answering in a Society of LLMs via Contextual Multi-Armed Bandit Sep 20, 2024 Question Answering
Code Code Available 0TACO-RL: Task Aware Prompt Compression Optimization with Reinforcement Learning Sep 19, 2024 Code Summarization Computational Efficiency
— Unverified 0Evaluating Image Hallucination in Text-to-Image Generation with Question-Answering Sep 19, 2024 Hallucination Hallucination Evaluation
Code Code Available 1CamelEval: Advancing Culturally Aligned Arabic Language Models and Benchmarks Sep 19, 2024 Instruction Following Open-Ended Question Answering
— Unverified 0Edu-Values: Towards Evaluating the Chinese Education Values of Large Language Models Sep 19, 2024 Ethics Multiple-choice
Code Code Available 0Vision Language Models Can Parse Floor Plan Maps Sep 19, 2024 Image Captioning Question Answering
— Unverified 0Language Models Learn to Mislead Humans via RLHF Sep 19, 2024 Question Answering
Code Code Available 1Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large Language Model Reasoning Sep 19, 2024 Language Modeling Language Modelling
Code Code Available 2MQA-KEAL: Multi-hop Question Answering under Knowledge Editing for Arabic Language Sep 18, 2024 knowledge editing Multi-hop Question Answering
— Unverified 0Development and bilingual evaluation of Japanese medical large language model within reasonably low computational resources Sep 18, 2024 GPU Language Modeling
Code Code Available 1Finetuning Language Models to Emit Linguistic Expressions of Uncertainty Sep 18, 2024 Decision Making Question Answering
— Unverified 0TART: An Open-Source Tool-Augmented Framework for Explainable Table-based Reasoning Sep 18, 2024 Fact Verification Question Answering
Code Code Available 2Uncertainty-Guided Self-Questioning and Answering for Video-Language Alignment Sep 17, 2024 Question Answering Video Question Answering
— Unverified 0Improving LLM Reasoning with Multi-Agent Tree-of-Thought Validator Agent Sep 17, 2024 GSM8K Question Answering
Code Code Available 1Mamba Fusion: Learning Actions Through Questioning Sep 17, 2024 Action Anticipation Action Recognition
Code Code Available 0ProSLM : A Prolog Synergized Language Model for explainable Domain Specific Knowledge Based Question Answering Sep 17, 2024 Formal Logic Language Modeling
— Unverified 0Contextual Breach: Assessing the Robustness of Transformer-based QA Models Sep 17, 2024 Question Answering
— Unverified 0Sparks of Artificial General Intelligence(AGI) in Semiconductor Material Science: Early Explorations into the Next Frontier of Generative AI-Assisted Electron Micrograph Analysis Sep 17, 2024 In-Context Learning Question Answering
— Unverified 0CAST: Cross-modal Alignment Similarity Test for Vision Language Models Sep 17, 2024 cross-modal alignment Question Answering
Code Code Available 0OneEncoder: A Lightweight Framework for Progressive Alignment of Modalities Sep 17, 2024 cross-modal alignment Question Answering
— Unverified 0Less is More: A Simple yet Effective Token Reduction Method for Efficient Multi-modal LLMs Sep 17, 2024 Question Answering Token Reduction
Code Code Available 1HALO: Hallucination Analysis and Learning Optimization to Empower LLMs with Retrieval-Augmented Context for Guided Clinical Decision Making Sep 16, 2024 Answer Generation Decision Making
Code Code Available 0StruEdit: Structured Outputs Enable the Fast and Accurate Knowledge Editing for Large Language Models Sep 16, 2024 knowledge editing Question Answering
— Unverified 0Explore the Hallucination on Low-level Perception for MLLMs Sep 15, 2024 Hallucination Question Answering
— Unverified 0A Benchmark Dataset with Larger Context for Non-Factoid Question Answering over Islamic Text Sep 15, 2024 Question Answering
— Unverified 0NEVLP: Noise-Robust Framework for Efficient Vision-Language Pre-training Sep 15, 2024 Contrastive Learning cross-modal alignment
— Unverified 0Active Learning to Guide Labeling Efforts for Question Difficulty Estimation Sep 14, 2024 Active Learning Question Answering
Code Code Available 0One missing piece in Vision and Language: A Survey on Comics Understanding Sep 14, 2024 document understanding image-classification
Code Code Available 2QTG-VQA: Question-Type-Guided Architectural for VideoQA Systems Sep 14, 2024 Question Answering Video Question Answering
— Unverified 0Guiding Vision-Language Model Selection for Visual Question-Answering Across Tasks, Domains, and Knowledge Types Sep 14, 2024 Language Modeling Language Modelling
Code Code Available 0KodeXv0.1: A Family of State-of-the-Art Financial Large Language Models Sep 13, 2024 Question Answering RAG
— Unverified 0Contextual Evaluation of Large Language Models for Classifying Tropical and Infectious Diseases Sep 13, 2024 Medical Question Answering Navigate
— Unverified 0L3Cube-IndicQuest: A Benchmark Question Answering Dataset for Evaluating Knowledge of LLMs in Indic Context Sep 13, 2024 Question Answering
Code Code Available 1Electrocardiogram Report Generation and Question Answering via Retrieval-Augmented Self-Supervised Modeling Sep 13, 2024 Decision Making Language Modeling
— Unverified 0Expediting and Elevating Large Language Model Reasoning via Hidden Chain-of-Thought Decoding Sep 13, 2024 Contrastive Learning Language Modeling
— Unverified 0Contri(e)ve: Context + Retrieve for Scholarly Question Answering Sep 13, 2024 Information Retrieval Knowledge Graphs
— Unverified 0Enhancing Q&A Text Retrieval with Ranking Models: Benchmarking, fine-tuning and deploying Rerankers for RAG Sep 12, 2024 Benchmarking Question Answering
— Unverified 0Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources Sep 12, 2024 Multi-hop Question Answering Question Answering
— Unverified 0