| Position: Interactive Generative Video as Next-Generation Game Engine | Mar 21, 2025 | PositionVideo Generation | —Unverified | 0 |
| Cross-Modal Interactive Perception Network with Mamba for Lung Tumor Segmentation in PET-CT Images | Mar 21, 2025 | Image SegmentationMamba | CodeCode Available | 2 |
| FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing | Mar 20, 2025 | Image GenerationPosition | —Unverified | 0 |
| Multi-Modal Gesture Recognition from Video and Surgical Tool Pose Information via Motion Invariants | Mar 19, 2025 | Activity RecognitionGesture Recognition | —Unverified | 0 |
| Movable-Element RIS-Aided Wireless Communications: An Element-Wise Position Optimization Approach | Mar 19, 2025 | Position | —Unverified | 0 |
| Visual Position Prompt for MLLM based Visual Grounding | Mar 19, 2025 | PositionVisual Grounding | CodeCode Available | 1 |
| DRoPE: Directional Rotary Position Embedding for Efficient Agent Interaction Modeling | Mar 19, 2025 | Autonomous DrivingPosition | —Unverified | 0 |
| Inductive Position Sensors based on Coupling of Coils on Printed Circuit Boards for Demanding Automotive Applications | Mar 18, 2025 | Position | —Unverified | 0 |
| Identifying and Mitigating Position Bias of Multi-image Vision-Language Models | Mar 18, 2025 | PositionQuestion Answering | —Unverified | 0 |
| Computation Mechanism Behind LLM Position Generalization | Mar 17, 2025 | DisentanglementPosition | —Unverified | 0 |