| U-RWKV: Lightweight medical image segmentation with direction-adaptive RWKV | Jul 15, 2025 | Computational EfficiencyImage Segmentation | CodeCode Available | 1 |
| Text-Visual Semantic Constrained AI-Generated Image Quality Assessment | Jul 14, 2025 | Image DescriptionImage Quality Assessment | CodeCode Available | 1 |
| REST: Stress Testing Large Reasoning Models by Asking Multiple Problems at Once | Jul 14, 2025 | | CodeCode Available | 1 |
| IM-LUT: Interpolation Mixing Look-Up Tables for Image Super-Resolution | Jul 14, 2025 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 1 |
| Graph World Model | Jul 14, 2025 | Graph Learningmodel | CodeCode Available | 1 |
| 4D-Animal: Freely Reconstructing Animatable 3D Animals from Videos | Jul 14, 2025 | | CodeCode Available | 1 |
| WildFX: A DAW-Powered Pipeline for In-the-Wild Audio FX Graph Modeling | Jul 14, 2025 | Music Generation | CodeCode Available | 1 |
| Warehouse Spatial Question Answering with LLM Agent | Jul 14, 2025 | Question AnsweringSpatial Reasoning | CodeCode Available | 1 |
| Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination | Jul 14, 2025 | MathMathematical Reasoning | CodeCode Available | 1 |
| Deep Reinforcement Learning with Gradient Eligibility Traces | Jul 12, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Conformation-Aware Structure Prediction of Antigen-Recognizing Immune Proteins | Jul 11, 2025 | PredictionProtein Structure Prediction | CodeCode Available | 1 |
| BrainLesion Suite: A Flexible and User-Friendly Framework for Modular Brain Lesion Image Analysis | Jul 11, 2025 | Skull Stripping | CodeCode Available | 1 |
| RadiomicsRetrieval: A Customizable Framework for Medical Image Retrieval Using Radiomics Features | Jul 11, 2025 | Contrastive LearningImage Retrieval | CodeCode Available | 1 |
| A Practical Two-Stage Recipe for Mathematical LLMs: Maximizing Accuracy with SFT and Efficiency with Reinforcement Learning | Jul 11, 2025 | MathMathematical Reasoning | CodeCode Available | 1 |
| Exploring Design of Multi-Agent LLM Dialogues for Research Ideation | Jul 11, 2025 | Diversity | CodeCode Available | 1 |
| Disentangling Instance and Scene Contexts for 3D Semantic Scene Completion | Jul 11, 2025 | 3D Semantic Scene Completion | CodeCode Available | 1 |
| Dual Dimensions Geometric Representation Learning Based Document Dewarping | Jul 11, 2025 | Representation Learning | CodeCode Available | 1 |
| Compress Any Segment Anything Model (SAM) | Jul 11, 2025 | modelQuantization | CodeCode Available | 1 |
| Rethinking Query-based Transformer for Continual Image Segmentation | Jul 10, 2025 | Continual LearningImage Segmentation | CodeCode Available | 1 |
| Seg-Wild: Interactive Segmentation based on 3D Gaussian Splatting for Unconstrained Image Collections | Jul 10, 2025 | Interactive SegmentationSegmentation | CodeCode Available | 1 |
| HiM2SAM: Enhancing SAM2 with Hierarchical Motion Estimation and Memory Optimization towards Long-term Tracking | Jul 10, 2025 | Motion EstimationObject Tracking | CodeCode Available | 1 |
| PacGDC: Label-Efficient Generalizable Depth Completion with Projection Ambiguity and Consistency | Jul 10, 2025 | Depth Completion | CodeCode Available | 1 |
| NLGCL: Naturally Existing Neighbor Layers Graph Contrastive Learning for Recommendation | Jul 10, 2025 | Collaborative FilteringContrastive Learning | CodeCode Available | 1 |
| Rethinking Verification for LLM Code Generation: From Generation to Testing | Jul 9, 2025 | Code GenerationHumanEval | CodeCode Available | 1 |
| HVI-CIDNet+: Beyond Extreme Darkness for Low-Light Image Enhancement | Jul 9, 2025 | Image EnhancementLow-Light Image Enhancement | CodeCode Available | 1 |
| RSRefSeg 2: Decoupling Referring Remote Sensing Image Segmentation with Foundation Models | Jul 8, 2025 | cross-modal alignmentImage Segmentation | CodeCode Available | 1 |
| Evaluating Morphological Alignment of Tokenizers in 70 Languages | Jul 8, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| NeoBabel: A Multilingual Open Tower for Visual Generation | Jul 8, 2025 | Image GenerationText to Image Generation | CodeCode Available | 1 |
| eegFloss: A Python package for refining sleep EEG recordings using machine learning models | Jul 8, 2025 | EEGSleep Staging | CodeCode Available | 1 |
| Prompt-Free Conditional Diffusion for Multi-object Image Augmentation | Jul 8, 2025 | DiversityDomain Generalization | CodeCode Available | 1 |
| Robust One-step Speech Enhancement via Consistency Distillation | Jul 8, 2025 | Speech Enhancement | CodeCode Available | 1 |
| The Delta Learning Hypothesis: Preference Tuning on Weak Data can Yield Strong Gains | Jul 8, 2025 | MathMMLU | CodeCode Available | 1 |
| Kamae: Bridging Spark and Keras for Seamless ML Preprocessing | Jul 8, 2025 | Learning-To-RankRecommendation Systems | CodeCode Available | 1 |
| Differential Mamba | Jul 8, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MCAM: Multimodal Causal Analysis Model for Ego-Vehicle-Level Driving Video Understanding | Jul 8, 2025 | Autonomous DrivingVideo Understanding | CodeCode Available | 1 |
| ScoreAdv: Score-based Targeted Generation of Natural Adversarial Examples via Diffusion Models | Jul 8, 2025 | Adversarial AttackDenoising | CodeCode Available | 1 |
| CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization | Jul 8, 2025 | Active LearningAutomated Theorem Proving | CodeCode Available | 1 |
| LangMamba: A Language-driven Mamba Framework for Low-dose CT Denoising with Vision-language Models | Jul 8, 2025 | DenoisingDiagnostic | CodeCode Available | 1 |
| FindRec: Stein-Guided Entropic Flow for Multi-Modal Sequential Recommendation | Jul 7, 2025 | MambaRecommendation Systems | CodeCode Available | 1 |
| LOOM-Scope: a comprehensive and efficient LOng-cOntext Model evaluation framework | Jul 7, 2025 | | CodeCode Available | 1 |
| The Extended SONICOM HRTF Dataset and Spatial Audio Metrics Toolbox | Jul 7, 2025 | | CodeCode Available | 1 |
| Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations | Jul 7, 2025 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| VOTE: Vision-Language-Action Optimization with Trajectory Ensemble Voting | Jul 7, 2025 | Depth EstimationVision-Language-Action | CodeCode Available | 1 |
| SV-DRR: High-Fidelity Novel View X-Ray Synthesis Using Diffusion Model | Jul 7, 2025 | AnatomyImage Generation | CodeCode Available | 1 |
| Exploring Remote Physiological Signal Measurement under Dynamic Lighting Conditions at Night: Dataset, Experiment, and Analysis | Jul 6, 2025 | Emotion Recognition | CodeCode Available | 1 |
| LLMThinkBench: Towards Basic Math Reasoning and Overthinking in Large Language Models | Jul 5, 2025 | BenchmarkingGPU | CodeCode Available | 1 |
| SAMed-2: Selective Memory Enhanced Medical Segment Anything Model | Jul 4, 2025 | Continual LearningImage Segmentation | CodeCode Available | 1 |
| CoreCodeBench: A Configurable Multi-Scenario Repository-Level Benchmark | Jul 4, 2025 | Bug fixingCode Generation | CodeCode Available | 1 |
| Be the Change You Want to See: Revisiting Remote Sensing Change Detection Practices | Jul 4, 2025 | Change Detection | CodeCode Available | 1 |
| Be the Change You Want to See: Revisiting Remote Sensing Change Detection Practices | Jul 4, 2025 | Change Detection | CodeCode Available | 1 |