| Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks | Aug 7, 2024 | AttributeIn-Context Learning | CodeCode Available | 2 |
| CLR-Fact: Evaluating the Complex Logical Reasoning Capability of Large Language Models over Factual Knowledge | Jul 30, 2024 | In-Context LearningKnowledge Graphs | —Unverified | 0 |
| SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages | Jul 29, 2024 | DiversityInstruction Following | CodeCode Available | 2 |
| Visual Riddles: a Commonsense and World Knowledge Challenge for Large Vision and Language Models | Jul 28, 2024 | World Knowledge | —Unverified | 0 |
| DYNAMICQA: Tracing Internal Knowledge Conflicts in Language Models | Jul 24, 2024 | Retrieval-augmented GenerationWorld Knowledge | CodeCode Available | 0 |
| Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models | Jul 22, 2024 | DisentanglementQuestion Answering | CodeCode Available | 0 |
| Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data | Jul 20, 2024 | Language ModellingMachine Translation | —Unverified | 0 |
| LoFTI: Localization and Factuality Transfer to Indian Locales | Jul 16, 2024 | World Knowledge | CodeCode Available | 0 |
| VISA: Reasoning Video Object Segmentation via Large Language Models | Jul 16, 2024 | DecoderObject | CodeCode Available | 3 |
| Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities | Jul 10, 2024 | counterfactualFact Checking | CodeCode Available | 1 |