| Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model | Mar 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Rapid patient-specific neural networks for intraoperative X-ray to volume registration | Mar 20, 2025 | | CodeCode Available | 2 |
| SaMam: Style-aware State Space Model for Arbitrary Image Style Transfer | Mar 20, 2025 | DecoderMamba | CodeCode Available | 2 |
| Deconstructing Long Chain-of-Thought: A Structured Reasoning Optimization Framework for Long CoT Distillation | Mar 20, 2025 | | CodeCode Available | 2 |
| Uni-3DAR: Unified 3D Generation and Understanding via Autoregression on Compressed Spatial Tokens | Mar 20, 2025 | 3D Generation | CodeCode Available | 2 |
| DnLUT: Ultra-Efficient Color Image Denoising via Channel-Aware Lookup Tables | Mar 20, 2025 | Color Image DenoisingDenoising | CodeCode Available | 2 |
| The Change You Want To Detect: Semantic Change Detection In Earth Observation With Hybrid Data Generation | Mar 19, 2025 | Change DetectionEarth Observation | CodeCode Available | 2 |
| VenusFactory: A Unified Platform for Protein Engineering Data Retrieval and Language Model Fine-Tuning | Mar 19, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 2 |
| LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning | Mar 19, 2025 | Instruction FollowingMultimodal Reasoning | CodeCode Available | 2 |
| Aligning Information Capacity Between Vision and Language via Dense-to-Sparse Feature Distillation for Image-Text Matching | Mar 19, 2025 | Image-text matchingText Matching | CodeCode Available | 2 |
| High-Order Control Barrier Functions: Insights and a Truncated Taylor-Based Formulation | Mar 19, 2025 | Collision Avoidance | CodeCode Available | 2 |
| DiffPortrait360: Consistent Portrait Diffusion for 360 View Synthesis | Mar 19, 2025 | | CodeCode Available | 2 |
| Derm1M: A Million-scale Vision-Language Dataset Aligned with Clinical Ontology Knowledge for Dermatology | Mar 19, 2025 | Cross-Modal RetrievalDiagnostic | CodeCode Available | 2 |
| PET-MAD, a universal interatomic potential for advanced materials modeling | Mar 18, 2025 | Diversity | CodeCode Available | 2 |
| Advances in 4D Generation: A Survey | Mar 18, 2025 | Autonomous DrivingComputational Efficiency | CodeCode Available | 2 |
| SocialJax: An Evaluation Suite for Multi-agent Reinforcement Learning in Sequential Social Dilemmas | Mar 18, 2025 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 2 |
| Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting | Mar 18, 2025 | Instance SegmentationObject | CodeCode Available | 2 |
| LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary Optimizers | Mar 18, 2025 | Automated Feature EngineeringFeature Engineering | CodeCode Available | 2 |
| LEGNet: Lightweight Edge-Gaussian Driven Network for Low-Quality Remote Sensing Image Object Detection | Mar 18, 2025 | Computational Efficiencyobject-detection | CodeCode Available | 2 |
| DARS: Dynamic Action Re-Sampling to Enhance Coding Agent Performance by Adaptive Tree Traversal | Mar 18, 2025 | | CodeCode Available | 2 |
| Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models | Mar 18, 2025 | AnatomyAttribute | CodeCode Available | 2 |
| Reinforcement learning-based motion imitation for physiologically plausible musculoskeletal motor control | Mar 18, 2025 | Humanoid ControlMotion Synthesis | CodeCode Available | 2 |
| SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and Editing | Mar 18, 2025 | DenoisingMotion Generation | CodeCode Available | 2 |
| Tiled Flash Linear Attention: More Efficient Linear RNN and xLSTM Kernels | Mar 18, 2025 | GPULanguage Modeling | CodeCode Available | 2 |
| Bridging Past and Future: End-to-End Autonomous Driving with Historical Prediction and Planning | Mar 18, 2025 | Autonomous DrivingMotion Planning | CodeCode Available | 2 |