| HRSeg: High-Resolution Visual Perception and Enhancement for Reasoning Segmentation | Jul 17, 2025 | Reasoning SegmentationWorld Knowledge | —Unverified | 0 |
| Comparing Apples to Oranges: A Dataset & Analysis of LLM Humour Understanding from Traditional Puns to Topical Jokes | Jul 17, 2025 | Common Sense ReasoningWorld Knowledge | —Unverified | 0 |
| KEN: Knowledge Augmentation and Emotion Guidance Network for Multimodal Fake News Detection | Jul 13, 2025 | Fake News DetectionMisinformation | —Unverified | 0 |
| Video Event Reasoning and Prediction by Fusing World Knowledge from LLMs with Vision Foundation Models | Jul 8, 2025 | Future predictionLarge Language Model | —Unverified | 0 |
| DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge | Jul 6, 2025 | Image GenerationMultimodal Reasoning | CodeCode Available | 3 |
| A Semi-supervised Scalable Unified Framework for E-commerce Query Classification | Jun 26, 2025 | ClassificationWorld Knowledge | —Unverified | 0 |
| From 2D to 3D Cognition: A Brief Survey of General World Models | Jun 25, 2025 | Autonomous DrivingScene Generation | —Unverified | 0 |
| MIRAGE: A Benchmark for Multimodal Information-Seeking and Reasoning in Agricultural Expert-Guided Conversations | Jun 25, 2025 | World Knowledge | CodeCode Available | 0 |
| Multi-Preference Lambda-weighted Listwise DPO for Dynamic Preference Alignment | Jun 24, 2025 | Informativenessreinforcement-learning | CodeCode Available | 0 |
| ImpliRet: Benchmarking the Implicit Fact Retrieval Challenge | Jun 17, 2025 | BenchmarkingRetrieval | CodeCode Available | 0 |