| Towards Accurate and Interpretable Neuroblastoma Diagnosis via Contrastive Multi-scale Pathological Image Analysis | Apr 18, 2025 | Contrastive LearningDiagnostic | CodeCode Available | 1 |
| Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations | Apr 18, 2025 | Hallucination | CodeCode Available | 1 |
| p2smi: A Python Toolkit for Peptide FASTA-to-SMILES Conversion and Molecular Property Analysis | Apr 18, 2025 | | CodeCode Available | 1 |
| WeatherGen: A Unified Diverse Weather Generator for LiDAR Point Clouds via Spider Mamba Diffusion | Apr 18, 2025 | Contrastive LearningDenoising | CodeCode Available | 1 |
| Learning to Attribute with Attention | Apr 18, 2025 | AttributeLanguage Modeling | CodeCode Available | 1 |
| Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling | Apr 18, 2025 | Machine TranslationTranslation | CodeCode Available | 1 |
| Meta-Learning and Knowledge Discovery based Physics-Informed Neural Network for Remaining Useful Life Prediction | Apr 18, 2025 | Meta-Learning | CodeCode Available | 1 |
| Learning from Noisy Pseudo-labels for All-Weather Land Cover Mapping | Apr 18, 2025 | AllImage Segmentation | CodeCode Available | 1 |
| U-Shape Mamba: State Space Model for faster diffusion | Apr 18, 2025 | DecoderImage Generation | CodeCode Available | 1 |
| A Deep Learning-Based Supervised Transfer Learning Framework for DOA Estimation with Array Imperfections | Apr 18, 2025 | Deep LearningTransfer Learning | CodeCode Available | 1 |
| CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning | Apr 18, 2025 | Common Sense Reasoningimage-classification | CodeCode Available | 1 |
| Filter2Noise: Interpretable Self-Supervised Single-Image Denoising for Low-Dose CT with Attention-Guided Bilateral Filtering | Apr 18, 2025 | DenoisingDiagnostic | CodeCode Available | 1 |
| Bayesian continual learning and forgetting in neural networks | Apr 18, 2025 | Bayesian InferenceContinual Learning | CodeCode Available | 1 |
| Towards Scale-Aware Low-Light Enhancement via Structure-Guided Transformer Design | Apr 18, 2025 | Image EnhancementLow-Light Image Enhancement | CodeCode Available | 1 |
| FocusTrack: A Self-Adaptive Local Sampling Algorithm for Efficient Anti-UAV Tracking | Apr 18, 2025 | Computational Efficiency | CodeCode Available | 1 |
| Compile Scene Graphs with Reinforcement Learning | Apr 18, 2025 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| HDBFormer: Efficient RGB-D Semantic Segmentation with A Heterogeneous Dual-Branch Framework | Apr 18, 2025 | RGBD Semantic SegmentationSemantic Segmentation | CodeCode Available | 1 |
| STAMP Your Content: Proving Dataset Membership via Watermarked Rephrasings | Apr 18, 2025 | Articles | CodeCode Available | 1 |
| MIB: A Mechanistic Interpretability Benchmark | Apr 17, 2025 | | CodeCode Available | 1 |
| Hierarchical Feature Learning for Medical Point Clouds via State Space Model | Apr 17, 2025 | Anatomy | CodeCode Available | 1 |
| Rethinking Temporal Fusion with a Unified Gradient Descent View for 3D Semantic Occupancy Prediction | Apr 17, 2025 | 3D Semantic Occupancy Prediction | CodeCode Available | 1 |
| TTRD3: Texture Transfer Residual Denoising Dual Diffusion Model for Remote Sensing Image Super-Resolution | Apr 17, 2025 | DenoisingImage Super-Resolution | CodeCode Available | 1 |
| UncAD: Towards Safe End-to-end Autonomous Driving via Online Map Uncertainty | Apr 17, 2025 | Autonomous Drivingmotion prediction | CodeCode Available | 1 |
| GSAC: Leveraging Gaussian Splatting for Photorealistic Avatar Creation with Unity Integration | Apr 17, 2025 | 3DGSNeRF | CodeCode Available | 1 |
| Hierarchical Vector Quantized Graph Autoencoder with Annealing-Based Code Selection | Apr 17, 2025 | Link PredictionNode Classification | CodeCode Available | 1 |
| Mask Image Watermarking | Apr 17, 2025 | Computational EfficiencyDecoder | CodeCode Available | 1 |
| Post-pre-training for Modality Alignment in Vision-Language Foundation Models | Apr 17, 2025 | | CodeCode Available | 1 |
| NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: Methods and Results | Apr 17, 2025 | FormImage Super-Resolution | CodeCode Available | 1 |
| Retrieval-Augmented Generation with Conflicting Evidence | Apr 17, 2025 | Large Language ModelMisinformation | CodeCode Available | 1 |
| EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance | Apr 17, 2025 | Anatomy | CodeCode Available | 1 |
| Stronger, Steadier & Superior: Geometric Consistency in Depth VFM Forges Domain Generalized Semantic Segmentation | Apr 17, 2025 | Semantic Segmentation | CodeCode Available | 1 |
| Building Russian Benchmark for Evaluation of Information Retrieval Models | Apr 17, 2025 | Information RetrievalRetrieval | CodeCode Available | 1 |
| Personalized Text-to-Image Generation with Auto-Regressive Models | Apr 17, 2025 | Image GenerationPersonalized Image Generation | CodeCode Available | 1 |
| CDF-RAG: Causal Dynamic Feedback for Adaptive Retrieval-Augmented Generation | Apr 17, 2025 | RAGRetrieval | CodeCode Available | 1 |
| Graph Learning at Scale: Characterizing and Optimizing Pre-Propagation GNNs | Apr 17, 2025 | Graph LearningGraph Sampling | CodeCode Available | 1 |
| Collaborative Perception Datasets for Autonomous Driving: A Review | Apr 17, 2025 | Autonomous DrivingDomain Adaptation | CodeCode Available | 1 |
| TimeCapsule: Solving the Jigsaw Puzzle of Long-Term Time Series Forecasting with Compressed Predictive Representations | Apr 17, 2025 | Time SeriesTime Series Forecasting | CodeCode Available | 1 |
| Training-Free Hierarchical Scene Understanding for Gaussian Splatting with Superpoint Graphs | Apr 17, 2025 | 3D geometry3DGS | CodeCode Available | 1 |
| Towards Lossless Token Pruning in Late-Interaction Retrieval Models | Apr 17, 2025 | Retrieval | CodeCode Available | 1 |
| Towards Cardiac MRI Foundation Models: Comprehensive Visual-Tabular Representations for Whole-Heart Assessment and Beyond | Apr 17, 2025 | Anatomy | CodeCode Available | 1 |
| Data-efficient LLM Fine-tuning for Code Generation | Apr 17, 2025 | Code GenerationGPU | CodeCode Available | 1 |
| VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models | Apr 17, 2025 | HallucinationVideo Understanding | CodeCode Available | 1 |
| Enhancing the Geometric Problem-Solving Ability of Multimodal LLMs via Symbolic-Neural Integration | Apr 17, 2025 | Geometry Problem SolvingLarge Language Model | CodeCode Available | 1 |
| SmartFreeEdit: Mask-Free Spatial-Aware Image Editing with Complex Instruction Understanding | Apr 17, 2025 | Image GenerationLarge Language Model | CodeCode Available | 1 |
| ZeroSumEval: Scaling LLM Evaluation with Inter-Model Competition | Apr 17, 2025 | model | CodeCode Available | 1 |
| NTIRE 2025 Challenge on Event-Based Image Deblurring: Methods and Results | Apr 16, 2025 | DeblurringEvent-based vision | CodeCode Available | 1 |
| DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency | Apr 16, 2025 | Few-Shot LearningInteractive Segmentation | CodeCode Available | 1 |
| VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate | Apr 16, 2025 | Video Generation | CodeCode Available | 1 |
| RadMamba: Efficient Human Activity Recognition through Radar-based Micro-Doppler-Oriented Mamba State-Space Model | Apr 16, 2025 | Activity RecognitionHuman Activity Recognition | CodeCode Available | 1 |
| Climbing the Ladder of Reasoning: What LLMs Can-and Still Can't-Solve after SFT? | Apr 16, 2025 | Mathematical Reasoning | CodeCode Available | 1 |