MR-GSM8K: A Meta-Reasoning Benchmark for Large Language Model Evaluation Dec 28, 2023 GSM8K Language Model Evaluation
Code Code Available 1AQUALLM: Audio Question Answering Data Generation Using Large Language Models Dec 28, 2023 Audio Question Answering Question Answering
Code Code Available 0MIVC: Multiple Instance Visual Component for Visual-Language Models Dec 28, 2023 Question Answering Visual Question Answering
— Unverified 0GUITAR: Gradient Pruning toward Fast Neural Ranking Dec 28, 2023 Question Answering Representation Learning
— Unverified 0OmniDialog: An Omnipotent Pre-training Model for Task-Oriented Dialogue System Dec 28, 2023 Dialogue Generation Dialogue Management
— Unverified 0Adapting Large Language Models for Education: Foundational Capabilities, Potentials, and Challenges Dec 27, 2023 Question Answering
— Unverified 0Gemini Pro Defeated by GPT-4V: Evidence from Education Dec 27, 2023 image-classification Image Classification
— Unverified 0Conversational Question Answering with Reformulations over Knowledge Graph Dec 27, 2023 Conversational Question Answering Knowledge Graphs
— Unverified 0S2M: Converting Single-Turn to Multi-Turn Datasets for Conversational Question Answering Dec 27, 2023 Conversational Question Answering Data Augmentation
— Unverified 0From text to multimodal: a survey of adversarial example generation in question answering systems Dec 26, 2023 Question Answering Question Generation
— Unverified 0Detection-based Intermediate Supervision for Visual Question Answering Dec 26, 2023 cross-modal alignment Logical Reasoning
— Unverified 0KnowledgeNavigator: Leveraging Large Language Models for Enhanced Reasoning over Knowledge Graph Dec 26, 2023 Hallucination Language Modeling
— Unverified 0Supervised Knowledge Makes Large Language Models Better In-context Learners Dec 26, 2023 In-Context Learning Natural Language Understanding
Code Code Available 0SecQA: A Concise Question-Answering Dataset for Evaluating Large Language Models in Computer Security Dec 26, 2023 Computer Security Multiple-choice
Code Code Available 0PersianLLaMA: Towards Building First Persian Large Language Model Dec 25, 2023 Language Modeling Language Modelling
— Unverified 0On the Promises and Challenges of Multimodal Foundation Models for Geographical, Environmental, Agricultural, and Urban Planning Applications Dec 23, 2023 geo-localization image-classification
— Unverified 0Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue Dec 23, 2023 Attribute Language Modeling
— Unverified 0Reverse Multi-Choice Dialogue Commonsense Inference with Graph-of-Thought Dec 23, 2023 Question Answering
Code Code Available 1PokeMQA: Programmable knowledge editing for Multi-hop Question Answering Dec 23, 2023 Answer Generation knowledge editing
Code Code Available 1Towards a Unified Multimodal Reasoning Framework Dec 22, 2023 Multimodal Reasoning Multiple-choice
Code Code Available 0Numerical Reasoning for Financial Reports Dec 22, 2023 Decision Making Question Answering
Code Code Available 0Computational Semantics and Evaluation Benchmark for Interrogative Sentences via Combinatory Categorial Grammar Dec 22, 2023 Question Answering
— Unverified 0Typhoon: Thai Large Language Models Dec 21, 2023 Question Answering World Knowledge
— Unverified 0Text2Analysis: A Benchmark of Table Question Answering with Advanced Data Analysis and Unclear Queries Dec 21, 2023 Question Answering
— Unverified 0DriveLM: Driving with Graph Visual Question Answering Dec 21, 2023 Autonomous Driving Question Answering
Code Code Available 3Shai: A large language model for asset management Dec 21, 2023 Asset Management Language Modeling
— Unverified 0Diversifying Knowledge Enhancement of Biomedical Language Models using Adapter Modules and Knowledge Graphs Dec 21, 2023 Document Classification Knowledge Graphs
— Unverified 0LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding Dec 21, 2023 Instruction Following Language Modeling
— Unverified 0Reducing Hallucinations: Enhancing VQA for Flood Disaster Damage Assessment with Visual Contexts Dec 21, 2023 Hallucination Question Answering
— Unverified 0VCoder: Versatile Vision Encoders for Multimodal Large Language Models Dec 21, 2023 Image Captioning Image Generation
Code Code Available 2LingoQA: Visual Question Answering for Autonomous Driving Dec 21, 2023 Autonomous Driving Decision Making
Code Code Available 2Object Attribute Matters in Visual Question Answering Dec 20, 2023 Attribute Graph Neural Network
Code Code Available 0DSPy Assertions: Computational Constraints for Self-Refining Language Model Pipelines Dec 20, 2023 Language Modeling Language Modelling
— Unverified 0Interactive Visual Task Learning for Robots Dec 20, 2023 Continual Learning Novel Concepts
— Unverified 0Multi-Clue Reasoning with Memory Augmentation for Knowledge-based Visual Question Answering Dec 20, 2023 Question Answering Visual Question Answering
— Unverified 0Lookahead: An Inference Acceleration Framework for Large Language Model with Lossless Generation Accuracy Dec 20, 2023 Language Modeling Language Modelling
Code Code Available 2Perception Test 2023: A Summary of the First Challenge And Outcome Dec 20, 2023 Benchmarking Grounded Video Question Answering
— Unverified 0Object-aware Adaptive-Positivity Learning for Audio-Visual Question Answering Dec 20, 2023 Audio-visual Question Answering Audio-Visual Question Answering (AVQA)
Code Code Available 0Cross-Modal Reasoning with Event Correlation for Video Question Answering Dec 20, 2023 Question Answering Video Question Answering
— Unverified 0Generative Multimodal Models are In-Context Learners Dec 20, 2023 In-Context Learning Personalized Image Generation
Code Code Available 3Contextual Code Switching for Machine Translation using Language Models Dec 20, 2023 Machine Translation Question Answering
— Unverified 0On Early Detection of Hallucinations in Factual Question Answering Dec 19, 2023 Hallucination Open-Ended Question Answering
Code Code Available 1MELO: Enhancing Model Editing with Neuron-Indexed Dynamic LoRA Dec 19, 2023 Document Classification Hallucination
Code Code Available 0PEPT: Expert Finding Meets Personalized Pre-training Dec 19, 2023 Community Question Answering Language Modelling
— Unverified 0Relation-Aware Question Answering for Heterogeneous Knowledge Graphs Dec 19, 2023 Knowledge Base Question Answering Knowledge Graphs
Code Code Available 0VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering Dec 19, 2023 Image Retrieval Question Answering
Code Code Available 0EarthVQA: Towards Queryable Earth via Relational Reasoning-Based Remote Sensing Visual Question Answering Dec 19, 2023 Object Object Counting
Code Code Available 1CLOVA: A Closed-Loop Visual Assistant with Tool Usage and Update Dec 18, 2023 Continual Learning Question Answering
— Unverified 0OsmLocator: locating overlapping scatter marks with a non-training generative perspective Dec 18, 2023 Clustering Combinatorial Optimization
Code Code Available 0HAAR: Text-Conditioned Generative Model of 3D Strand-based Human Hairstyles Dec 18, 2023 Question Answering Visual Question Answering
Code Code Available 1