| AnomalyR1: A GRPO-based End-to-end MLLM for Industrial Anomaly Detection | Apr 16, 2025 | Anomaly DetectionLarge Language Model | CodeCode Available | 1 |
| The Tenth NTIRE 2025 Image Denoising Challenge Report | Apr 16, 2025 | DenoisingImage Denoising | CodeCode Available | 1 |
| SkeletonX: Data-Efficient Skeleton-based Action Recognition via Cross-sample Feature Aggregation | Apr 16, 2025 | Action RecognitionOne-Shot 3D Action Recognition | CodeCode Available | 1 |
| Evaluating the Goal-Directedness of Large Language Models | Apr 16, 2025 | | CodeCode Available | 1 |
| Dense Backpropagation Improves Training for Sparse Mixture-of-Experts | Apr 16, 2025 | Mixture-of-Experts | CodeCode Available | 1 |
| Activated LoRA: Fine-tuned LLMs for Intrinsics | Apr 16, 2025 | | CodeCode Available | 1 |
| Climate-economy projections under shared socioeconomic pathways and net-zero scenarios | Apr 16, 2025 | | CodeCode Available | 1 |
| DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging | Apr 16, 2025 | Image Generationmodel | CodeCode Available | 1 |
| HLS-Eval: A Benchmark and Framework for Evaluating LLMs on High-Level Synthesis Design Tasks | Apr 16, 2025 | High-Level SynthesisLarge Language Model | CodeCode Available | 1 |
| Progent: Programmable Privilege Control for LLM Agents | Apr 16, 2025 | Blocking | CodeCode Available | 1 |
| InjectLab: A Tactical Framework for Adversarial Threat Modeling Against Large Language Models | Apr 16, 2025 | | CodeCode Available | 1 |
| The Hitchhiker's Guide to Program Analysis, Part II: Deep Thoughts by LLMs | Apr 16, 2025 | Vulnerability Detection | CodeCode Available | 1 |
| Search is All You Need for Few-shot Anomaly Detection | Apr 16, 2025 | AllAnomaly Detection | CodeCode Available | 1 |
| Zooming In on Fakes: A Novel Dataset for Localized AI-Generated Image Detection with Forgery Amplification Approach | Apr 16, 2025 | | CodeCode Available | 1 |
| GrabS: Generative Embodied Agent for 3D Object Segmentation without Scene Supervision | Apr 16, 2025 | ObjectSemantic Segmentation | CodeCode Available | 1 |
| Robust MPC for Uncertain Linear Systems -- Combining Model Adaptation and Iterative Learning | Apr 15, 2025 | Computational EfficiencyModel Predictive Control | CodeCode Available | 1 |
| MSCRS: Multi-modal Semantic Graph Prompt Learning Framework for Conversational Recommender Systems | Apr 15, 2025 | Prompt LearningRecommendation Systems | CodeCode Available | 1 |
| Adaptive Decision Boundary for Few-Shot Class-Incremental Learning | Apr 15, 2025 | class-incremental learningClass Incremental Learning | CodeCode Available | 1 |
| Deep Learning in Concealed Dense Prediction | Apr 15, 2025 | Deep LearningPrediction | CodeCode Available | 1 |
| Deep Learning-based Bathymetry Retrieval without In-situ Depths using Remote Sensing Imagery and SfM-MVS DSMs with Data Gaps | Apr 15, 2025 | 3D ReconstructionBathymetry prediction | CodeCode Available | 1 |
| Change State Space Models for Remote Sensing Change Detection | Apr 15, 2025 | Change DetectionComputational Efficiency | CodeCode Available | 1 |
| PraNet-V2: Dual-Supervised Reverse Attention for Medical Image Segmentation | Apr 15, 2025 | Foreground SegmentationImage Segmentation | CodeCode Available | 1 |
| Crane: Context-Guided Prompt Learning and Attention Refinement for Zero-Shot Anomaly Detections | Apr 15, 2025 | Anomaly DetectionAnomaly Localization | CodeCode Available | 1 |
| LazyReview A Dataset for Uncovering Lazy Thinking in NLP Peer Reviews | Apr 15, 2025 | | CodeCode Available | 1 |
| Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion | Apr 15, 2025 | Lidar Scene Completion | CodeCode Available | 1 |
| Explicit and Implicit Representations in AI-based 3D Reconstruction for Radiology: A Systematic Review | Apr 15, 2025 | 3D ReconstructionSystematic Literature Review | CodeCode Available | 1 |
| A Dual-Space Framework for General Knowledge Distillation of Large Language Models | Apr 15, 2025 | Code GenerationGeneral Knowledge | CodeCode Available | 1 |
| DRIFT open dataset: A drone-derived intelligence for traffic analysis in urban environmen | Apr 15, 2025 | object-detectionObject Detection | CodeCode Available | 1 |
| Teaching Large Language Models to Reason through Learning and Forgetting | Apr 15, 2025 | Mathematical Reasoning | CodeCode Available | 1 |
| R-TPT: Improving Adversarial Robustness of Vision-Language Models through Test-Time Prompt Tuning | Apr 15, 2025 | Adversarial Robustness | CodeCode Available | 1 |
| Fine-Tuning Large Language Models on Quantum Optimization Problems for Circuit Generation | Apr 15, 2025 | MathQuantum Machine Learning | CodeCode Available | 1 |
| Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception | Apr 15, 2025 | Data AugmentationDenoising | CodeCode Available | 1 |
| SafeSpeech: Robust and Universal Voice Protection Against Malicious Speech Synthesis | Apr 14, 2025 | Face SwappingSpeech Synthesis | CodeCode Available | 1 |
| MonoDiff9D: Monocular Category-Level 9D Object Pose Estimation via Diffusion Model | Apr 14, 2025 | ObjectPose Estimation | CodeCode Available | 1 |
| FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation | Apr 14, 2025 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| RealWebAssist: A Benchmark for Long-Horizon Web Assistance with Real-World Users | Apr 14, 2025 | Instruction Following | CodeCode Available | 1 |
| Efficient Process Reward Model Training via Active Learning | Apr 14, 2025 | Active LearningMath | CodeCode Available | 1 |
| EmbodiedAgent: A Scalable Hierarchical Approach to Overcome Practical Challenge in Multi-Robot Control | Apr 14, 2025 | Hallucination | CodeCode Available | 1 |
| Towards Low-Latency Event-based Obstacle Avoidance on a FPGA-Drone | Apr 14, 2025 | Collision AvoidanceEvent-based vision | CodeCode Available | 1 |
| Efficient Generative Model Training via Embedded Representation Warmup | Apr 14, 2025 | | CodeCode Available | 1 |
| DNF-Avatar: Distilling Neural Fields for Real-time Animatable Avatar Relighting | Apr 14, 2025 | Image Relighting | CodeCode Available | 1 |
| Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing | Apr 14, 2025 | Image GenerationText to Image Generation | CodeCode Available | 1 |
| DUE: A Deep Learning Framework and Library for Modeling Unknown Equations | Apr 14, 2025 | | CodeCode Available | 1 |
| Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition | Apr 14, 2025 | Computational EfficiencyImage Retrieval | CodeCode Available | 1 |
| TinyverseGP: Towards a Modular Cross-domain Benchmarking Framework for Genetic Programming | Apr 14, 2025 | BenchmarkingProgram Synthesis | CodeCode Available | 1 |
| SoccerNet-v3D: Leveraging Sports Broadcast Replays for 3D Scene Understanding | Apr 14, 2025 | Camera CalibrationObject Localization | CodeCode Available | 1 |
| Multimodal Long Video Modeling Based on Temporal Dynamic Context | Apr 14, 2025 | Video Understanding | CodeCode Available | 1 |
| TAMP: Token-Adaptive Layerwise Pruning in Multimodal Large Language Models | Apr 14, 2025 | Diversity | CodeCode Available | 1 |
| Attention GhostUNet++: Enhanced Segmentation of Adipose Tissue and Liver in CT Images | Apr 14, 2025 | Computational EfficiencyLiver Segmentation | CodeCode Available | 1 |
| M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models | Apr 14, 2025 | MambaMath | CodeCode Available | 1 |