VersaVid-R1: A Versatile Video Understanding and Reasoning Model from Question Answering to Captioning Tasks Jun 10, 2025 Multiple-choice Open-Ended Question Answering
— Unverified 0WIP: Large Language Model-Enhanced Smart Tutor for Undergraduate Circuit Analysis Jun 10, 2025 Language Modeling Language Modelling
— Unverified 0anyECG-chat: A Generalist ECG-MLLM for Flexible ECG Input and Multi-Task Understanding Jun 1, 2025 Open-Ended Question Answering Question Answering
— Unverified 0CulFiT: A Fine-grained Cultural-aware LLM Training Paradigm via Multilingual Critique Data Synthesis May 26, 2025 Diversity Open-Ended Question Answering
Code Code Available 0O^2-Searcher: A Searching-based Agent Model for Open-Domain Open-Ended Question Answering May 22, 2025 Answer Generation Open-Ended Question Answering
Code Code Available 1TinyRS-R1: Compact Multimodal Language Model for Remote Sensing May 17, 2025 Language Modeling Language Modelling
— Unverified 0Ranked Voting based Self-Consistency of Large Language Models May 16, 2025 Multiple-choice Open-Ended Question Answering
Code Code Available 1VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making May 6, 2025 Decision Making General Knowledge
— Unverified 0Accommodate Knowledge Conflicts in Retrieval-augmented LLMs: Towards Reliable Response Generation in the Wild Apr 17, 2025 Decision Making Information Retrieval
— Unverified 0AutoDrive-QA- Automated Generation of Multiple-Choice Questions for Autonomous Driving Datasets Using Large Vision-Language Models Mar 20, 2025 Autonomous Driving Multiple-choice
— Unverified 0Time-MQA: Time Series Multi-Task Question Answering with Context Enhancement Feb 26, 2025 Anomaly Detection Natural Language Queries
— Unverified 0FSPO: Few-Shot Preference Optimization of Synthetic Preference Data in LLMs Elicits Effective Personalization to Real Users Feb 26, 2025 In-Context Learning Meta-Learning
Code Code Available 1PRIV-QA: Privacy-Preserving Question Answering for Cloud Large Language Models Feb 19, 2025 Open-Ended Question Answering Privacy Preserving
Code Code Available 0Neptune: The Long Orbit to Benchmarking Long Video Understanding Dec 12, 2024 Benchmarking Multimodal Reasoning
Code Code Available 2Can Knowledge Graphs Make Large Language Models More Trustworthy? An Empirical Study over Open-ended Question Answering Oct 10, 2024 Hallucination Knowledge Graphs
— Unverified 0TVBench: Redesigning Video-Language Evaluation Oct 10, 2024 Multiple-choice Open-Ended Question Answering
— Unverified 0Utilize the Flow before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction Tuning Oct 9, 2024 Hallucination Multiple-choice
Code Code Available 0Video Instruction Tuning With Synthetic Data Oct 3, 2024 3D Question Answering (3D-QA)
— Unverified 0CamelEval: Advancing Culturally Aligned Arabic Language Models and Benchmarks Sep 19, 2024 Instruction Following Open-Ended Question Answering
— Unverified 0Ranking Generated Answers: On the Agreement of Retrieval Models with Humans on Consumer Health Questions Aug 19, 2024 Open-Ended Question Answering Question Answering
Code Code Available 0Reference-Guided Verdict: LLMs-as-Judges in Automatic Evaluation of Free-Form Text Aug 17, 2024 Diversity Form
— Unverified 0TelecomGPT: A Framework to Build Telecom-Specfic Large Language Models Jul 12, 2024 Code Generation Math
— Unverified 0LLaSA: A Multimodal LLM for Human Activity Analysis Through Wearable and Smartphone Sensors Jun 20, 2024 16k Instruction Following
Code Code Available 1Extrinsic Evaluation of Cultural Competence in Large Language Models Jun 17, 2024 Open-Ended Question Answering Question Answering
Code Code Available 0SCAR: Efficient Instruction-Tuning for Large Language Models via Style Consistency-Aware Response Ranking Jun 16, 2024 Open-Ended Question Answering Question Answering
Code Code Available 1Long Story Short: Story-level Video Understanding from 20K Short Films Jun 14, 2024 Multiple Choice Question Answering (MCQA) Open-Ended Question Answering
— Unverified 0Perception of Knowledge Boundary for Large Language Models through Semi-open-ended Question Answering May 23, 2024 Open-Ended Question Answering Question Answering
— Unverified 0Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation May 22, 2024 Informativeness Language Modeling
Code Code Available 2SciQAG: A Framework for Auto-Generated Science Question Answering Dataset with Fine-grained Evaluation May 16, 2024 Open-Ended Question Answering Question Answering
Code Code Available 1Evaluating the Elementary Multilingual Capabilities of Large Language Models with MultiQ Mar 6, 2024 Open-Ended Question Answering Question Answering
Code Code Available 0API Is Enough: Conformal Prediction for Large Language Models Without Logit-Access Mar 2, 2024 Conformal Prediction Open-Ended Question Answering
— Unverified 0Chain-of-Discussion: A Multi-Model Framework for Complex Evidence-Based Question Answering Feb 26, 2024 Evidence Selection Open-Ended Question Answering
Code Code Available 4BiMediX: Bilingual Medical Mixture of Experts LLM Feb 20, 2024 Mixture-of-Experts Multiple-choice
Code Code Available 1Enhancing Large Language Models with Pseudo- and Multisource- Knowledge Graphs for Open-ended Question Answering Feb 15, 2024 Graph Generation Knowledge Graphs
— Unverified 0Shai: A large language model for asset management Dec 21, 2023 Asset Management Language Modeling
— Unverified 0On Early Detection of Hallucinations in Factual Question Answering Dec 19, 2023 Hallucination Open-Ended Question Answering
Code Code Available 1Universal Self-Consistency for Large Language Model Generation Nov 29, 2023 Code Generation Language Modeling
— Unverified 0Downstream Trade-offs of a Family of Text Watermarks Nov 16, 2023 Form Language Modelling
Code Code Available 0Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca Sep 16, 2023 Instruction Following Large Language Model
Code Code Available 0Prompting Large Language Models with Speech Recognition Abilities Jul 21, 2023 Abstractive Text Summarization Automatic Speech Recognition
— Unverified 0PRD: Peer Rank and Discussion Improve Large Language Model based Evaluations Jul 6, 2023 Language Modeling Language Modelling
Code Code Available 1On the Model-Misspecification in Reinforcement Learning Jun 19, 2023 model Open-Ended Question Answering
— Unverified 02D-Shapley: A Framework for Fragmented Data Valuation Jun 18, 2023 counterfactual Data Valuation
Code Code Available 0Adversaries with Limited Information in the Friedkin--Johnsen Model Jun 17, 2023 Open-Ended Question Answering Sociology
Code Code Available 0POP: Prompt Of Prompts for Continual Learning Jun 14, 2023 Continual Learning Open-Ended Question Answering
— Unverified 0Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models Jun 13, 2023 Catalytic activity prediction Chemical-Disease Interaction Extraction
Code Code Available 2Provable Accelerated Convergence of Nesterov's Momentum for Deep ReLU Neural Networks Jun 13, 2023 Open-Ended Question Answering
— Unverified 0Non-autoregressive Conditional Diffusion Models for Time Series Prediction Jun 8, 2023 Denoising Open-Ended Question Answering
— Unverified 0Benchmarking Foundation Models with Language-Model-as-an-Examiner Jun 7, 2023 Benchmarking Language Modeling
— Unverified 0Differences in boundary behavior in the 3D vertex and Voronoi models Jun 6, 2023 Open-Ended Question Answering
— Unverified 0