| ORQA: A Benchmark and Foundation Model for Holistic Operating Room Modeling | May 19, 2025 | Graph GenerationKnowledge Distillation | —Unverified | 0 |
| AoP-SAM: Automation of Prompts for Efficient Segmentation | May 17, 2025 | Image SegmentationPrompt Engineering | —Unverified | 0 |
| RVTBench: A Benchmark for Visual Reasoning Tasks | May 17, 2025 | Reasoning SegmentationVisual Question Answering (VQA) | CodeCode Available | 0 |
| GenKnowSub: Improving Modularity and Reusability of LLMs through General Knowledge Subtraction | May 16, 2025 | General KnowledgeZero-shot Generalization | CodeCode Available | 0 |
| Depth Anything with Any Prior | May 15, 2025 | Depth CompletionDepth Estimation | —Unverified | 0 |
| NVSPolicy: Adaptive Novel-View Synthesis for Generalizable Language-Conditioned Policy Learning | May 15, 2025 | Novel View SynthesisRobot Manipulation | —Unverified | 0 |
| Denoising and Alignment: Rethinking Domain Generalization for Multimodal Face Anti-Spoofing | May 14, 2025 | cross-modal alignmentDenoising | —Unverified | 0 |
| Visual Image Reconstruction from Brain Activity via Latent Representation | May 13, 2025 | Early ClassificationImage Reconstruction | —Unverified | 0 |
| Towards Artificial General or Personalized Intelligence? A Survey on Foundation Models for Personalized Federated Intelligence | May 11, 2025 | Computational EfficiencyFederated Learning | —Unverified | 0 |
| Learning Graph Representation of Agent Diffusers | May 10, 2025 | Graph Neural NetworkImage Generation | CodeCode Available | 0 |