| Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models | Jul 22, 2024 | DisentanglementQuestion Answering | CodeCode Available | 0 |
| Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data | Jul 20, 2024 | Language ModellingMachine Translation | —Unverified | 0 |
| LoFTI: Localization and Factuality Transfer to Indian Locales | Jul 16, 2024 | World Knowledge | CodeCode Available | 0 |
| VISA: Reasoning Video Object Segmentation via Large Language Models | Jul 16, 2024 | DecoderObject | CodeCode Available | 3 |
| Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities | Jul 10, 2024 | counterfactualFact Checking | CodeCode Available | 1 |
| VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving | Jul 9, 2024 | Autonomous DrivingImage to 3D | —Unverified | 0 |
| Language Representations Can be What Recommenders Need: Findings and Potentials | Jul 7, 2024 | Collaborative FilteringContrastive Learning | CodeCode Available | 2 |
| BAPO: Base-Anchored Preference Optimization for Overcoming Forgetting in Large Language Models Personalization | Jun 30, 2024 | Continual LearningGeneral Knowledge | —Unverified | 0 |
| LLaRA: Supercharging Robot Learning Data for Vision-Language Policy | Jun 28, 2024 | Vision-Language-ActionWorld Knowledge | CodeCode Available | 3 |
| Scaling Synthetic Data Creation with 1,000,000,000 Personas | Jun 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 11 |
| Mental Modeling of Reinforcement Learning Agents by Language Models | Jun 26, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |
| LABOR-LLM: Language-Based Occupational Representations with Large Language Models | Jun 25, 2024 | In-Context LearningJob Prediction | —Unverified | 0 |
| Mitigating Hallucination in Fictional Character Role-Play | Jun 25, 2024 | HallucinationWorld Knowledge | CodeCode Available | 0 |
| Exploring Factual Entailment with NLI: A News Media Study | Jun 24, 2024 | ArticlesFew-Shot Learning | —Unverified | 0 |
| Evaluating the Ability of Large Language Models to Reason about Cardinal Directions | Jun 24, 2024 | World Knowledge | —Unverified | 0 |
| On the Role of Long-tail Knowledge in Retrieval Augmented Large Language Models | Jun 24, 2024 | RAGRetrieval | —Unverified | 0 |
| LangSuitE: Planning, Controlling and Interacting with Large Language Models in Embodied Text Environments | Jun 24, 2024 | World Knowledge | CodeCode Available | 2 |
| OCALM: Object-Centric Assessment with Language Models | Jun 24, 2024 | ObjectReinforcement Learning (RL) | —Unverified | 0 |
| What Teaches Robots to Walk, Teaches Them to Trade too -- Regime Adaptive Execution using Informed Data and LLMs | Jun 20, 2024 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Locating and Extracting Relational Concepts in Large Language Models | Jun 19, 2024 | World Knowledge | CodeCode Available | 0 |
| WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia | Jun 19, 2024 | Language ModellingRAG | —Unverified | 0 |
| Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning | Jun 18, 2024 | BenchmarkingWorld Knowledge | CodeCode Available | 0 |
| Are Large Language Models True Healthcare Jacks-of-All-Trades? Benchmarking Across Health Professions Beyond Physician Exams | Jun 17, 2024 | AllBenchmarking | CodeCode Available | 0 |
| A Systematic Analysis of Large Language Models as Soft Reasoners: The Case of Syllogistic Inferences | Jun 17, 2024 | In-Context Learningvalid | CodeCode Available | 0 |
| RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models | Jun 16, 2024 | Adversarial AttackBenchmarking | CodeCode Available | 2 |