| WorldQA: Multimodal World Knowledge in Videos through Long-Chain Reasoning | May 6, 2024 | Multiple-choiceVideo Understanding | —Unverified | 0 |
| LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model | May 3, 2024 | Image CaptioningInstruction Following | CodeCode Available | 0 |
| Improving Complex Reasoning over Knowledge Graph with Logic-Aware Curriculum Tuning | May 2, 2024 | Knowledge GraphsLogical Reasoning | —Unverified | 0 |
| Learning Multiple Object States from Actions via Large Language Models | May 2, 2024 | Multi-Label ClassificationMUlTI-LABEL-ClASSIFICATION | —Unverified | 0 |
| DOCCI: Descriptions of Connected and Contrasting Images | Apr 30, 2024 | Image GenerationImage to text | —Unverified | 0 |
| Reinforcement Learning Problem Solving with Large Language Models | Apr 29, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Re-Thinking Inverse Graphics With Large Language Models | Apr 23, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| A User-Centric Multi-Intent Benchmark for Evaluating Large Language Models | Apr 22, 2024 | BenchmarkingWorld Knowledge | CodeCode Available | 1 |
| Self-Bootstrapped Visual-Language Model for Knowledge Selection and Question Answering | Apr 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Look Before You Decide: Prompting Active Deduction of MLLMs for Assumptive Reasoning | Apr 19, 2024 | Benchmarkingcounterfactual | —Unverified | 0 |