| Prohibited Items Segmentation via Occlusion-aware Bilayer Modeling | Jun 13, 2025 | DecoderImage Segmentation | CodeCode Available | 0 |
| DEAL: Disentangling Transformer Head Activations for LLM Steering | Jun 10, 2025 | Binary ClassificationZero-shot Generalization | —Unverified | 0 |
| Deep Equivariant Multi-Agent Control Barrier Functions | Jun 9, 2025 | Robot NavigationZero-shot Generalization | —Unverified | 0 |
| CXR-LT 2024: A MICCAI challenge on long-tailed, multi-label, and zero-shot disease classification from chest X-ray | Jun 9, 2025 | ClassificationDiagnostic | —Unverified | 0 |
| ZeroVO: Visual Odometry with Minimal Assumptions | Jun 9, 2025 | Autonomous DrivingCamera Calibration | —Unverified | 0 |
| Latent Diffusion Model Based Denoising Receiver for 6G Semantic Communication: From Stochastic Differential Theory to Application | Jun 6, 2025 | DenoisingSemantic Communication | —Unverified | 0 |
| Towards Vision-Language-Garment Models For Web Knowledge Garment Understanding and Generation | Jun 5, 2025 | Zero-shot Generalization | —Unverified | 0 |
| Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer | Jun 5, 2025 | 3DGSDataset Generation | —Unverified | 0 |
| Language-Guided Multi-Agent Learning in Simulations: A Unified Framework and Evaluation | Jun 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ViTaPEs: Visuotactile Position Encodings for Cross-Modal Alignment in Multimodal Transformers | May 26, 2025 | cross-modal alignmentPosition | —Unverified | 0 |