| BrainLesion Suite: A Flexible and User-Friendly Framework for Modular Brain Lesion Image Analysis | Jul 11, 2025 | Skull Stripping | CodeCode Available | 1 |
| M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning | Jul 11, 2025 | Spatial Reasoning | —Unverified | 0 |
| Exploiting Leaderboards for Large-Scale Distribution of Malicious Models | Jul 11, 2025 | Model DiscoveryText Generation | —Unverified | 0 |
| Towards Imperceptible JPEG Image Hiding: Multi-range Representations-driven Adversarial Stego Generation | Jul 11, 2025 | DisentanglementSteganalysis | —Unverified | 0 |
| Lightweight Safety Guardrails via Synthetic Data and RL-guided Adversarial Training | Jul 11, 2025 | Generative Adversarial NetworkSynthetic Data Generation | —Unverified | 0 |
| Admissibility of Stein Shrinkage for Batch Normalization in the Presence of Adversarial Attacks | Jul 11, 2025 | image-classificationImage Classification | —Unverified | 0 |
| Towards Collaborative Fairness in Federated Learning Under Imbalanced Covariate Shift | Jul 11, 2025 | Collaborative FairnessFairness | —Unverified | 0 |
| SFedKD: Sequential Federated Learning with Discrepancy-Aware Multi-Teacher Knowledge Distillation | Jul 11, 2025 | Federated LearningKnowledge Distillation | —Unverified | 0 |
| Lizard: An Efficient Linearization Framework for Large Language Models | Jul 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| From Physics to Foundation Models: A Review of AI-Driven Quantitative Remote Sensing Inversion | Jul 11, 2025 | Domain GeneralizationUncertainty Quantification | —Unverified | 0 |
| FreeAudio: Training-Free Timing Planning for Controllable Long-Form Text-to-Audio Generation | Jul 11, 2025 | Audio GenerationData Augmentation | —Unverified | 0 |
| MIDI-VALLE: Improving Expressive Piano Performance Synthesis Through Neural Codec Language Modelling | Jul 11, 2025 | Audio SynthesisLanguage Modelling | —Unverified | 0 |
| Compress Any Segment Anything Model (SAM) | Jul 11, 2025 | modelQuantization | CodeCode Available | 1 |
| Disentangling Instance and Scene Contexts for 3D Semantic Scene Completion | Jul 11, 2025 | 3D Semantic Scene Completion | CodeCode Available | 1 |
| Dual Dimensions Geometric Representation Learning Based Document Dewarping | Jul 11, 2025 | Representation Learning | CodeCode Available | 1 |
| Dynamic Parameter Memory: Temporary LoRA-Enhanced LLM for Long-Sequence Emotion Recognition in Conversation | Jul 11, 2025 | 4kEmotion Recognition | CodeCode Available | 0 |
| Unsupervised Methods for Video Quality Improvement: A Survey of Restoration and Enhancement Techniques | Jul 11, 2025 | Video Restoration | —Unverified | 0 |
| Lumos-1: On Autoregressive Video Generation from a Unified Model Perspective | Jul 11, 2025 | Video Generation | CodeCode Available | 0 |
| Review of Feed-forward 3D Reconstruction: From DUSt3R to VGGT | Jul 11, 2025 | 3D ReconstructionAutonomous Driving | —Unverified | 0 |
| Geo-ORBIT: A Federated Digital Twin Framework for Scene-Adaptive Lane Geometry Detection | Jul 11, 2025 | Computational EfficiencyFederated Learning | CodeCode Available | 0 |
| Repairing Language Model Pipelines by Meta Self-Refining Competing Constraints at Runtime | Jul 11, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan | Jul 11, 2025 | 3D ReconstructionOptical Flow Estimation | —Unverified | 0 |
| RadiomicsRetrieval: A Customizable Framework for Medical Image Retrieval Using Radiomics Features | Jul 11, 2025 | Contrastive LearningImage Retrieval | CodeCode Available | 1 |
| VIP: Visual Information Protection through Adversarial Attacks on Vision-Language Models | Jul 11, 2025 | Adversarial Attack | CodeCode Available | 0 |
| Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation | Jul 11, 2025 | Image GenerationImage Reconstruction | —Unverified | 0 |
| The Bayesian Approach to Continual Learning: An Overview | Jul 11, 2025 | Bayesian Inferenceclass-incremental learning | —Unverified | 0 |
| AgentsNet: Coordination and Collaborative Reasoning in Multi-Agent LLMs | Jul 11, 2025 | | —Unverified | 0 |
| Fairness Is Not Enough: Auditing Competence and Intersectional Bias in AI-powered Resume Screening | Jul 11, 2025 | Fairness | CodeCode Available | 0 |
| Model Parallelism With Subnetwork Data Parallelism | Jul 11, 2025 | AttributeFederated Learning | —Unverified | 0 |
| Scaling Attention to Very Long Sequences in Linear Time with Wavelet-Enhanced Random Spectral Attention (WERSA) | Jul 11, 2025 | GPU | CodeCode Available | 0 |
| When and Where do Data Poisons Attack Textual Inversion? | Jul 11, 2025 | | CodeCode Available | 0 |
| An Offline Mobile Conversational Agent for Mental Health Support: Learning from Emotional Dialogues and Psychological Texts with Student-Centered Evaluation | Jul 11, 2025 | Domain Adaptation | —Unverified | 0 |
| ByDeWay: Boost Your multimodal LLM with DEpth prompting in a Training-Free Way | Jul 11, 2025 | Depth EstimationHallucination | —Unverified | 0 |
| Entangled Threats: A Unified Kill Chain Model for Quantum Machine Learning Security | Jul 11, 2025 | Model extractionQuantum Machine Learning | —Unverified | 0 |
| An Adaptive Volatility-based Learning Rate Scheduler | Jul 11, 2025 | Scheduling | —Unverified | 0 |
| Comparative Analysis of Vision Transformers and Traditional Deep Learning Approaches for Automated Pneumonia Detection in Chest X-Rays | Jul 11, 2025 | Computational EfficiencyPneumonia Detection | —Unverified | 0 |
| From Classical Machine Learning to Emerging Foundation Models: Review on Multimodal Data Integration for Cancer Research | Jul 11, 2025 | Data Integration | CodeCode Available | 0 |
| Exploring Design of Multi-Agent LLM Dialogues for Research Ideation | Jul 11, 2025 | Diversity | CodeCode Available | 1 |
| Car Object Counting and Position Estimation via Extension of the CLIP-EBC Framework | Jul 11, 2025 | ClusteringCrowd Counting | CodeCode Available | 0 |
| RePaintGS: Reference-Guided Gaussian Splatting for Realistic and View-Consistent 3D Scene Inpainting | Jul 11, 2025 | Image Inpainting | —Unverified | 0 |
| A Practical Two-Stage Recipe for Mathematical LLMs: Maximizing Accuracy with SFT and Efficiency with Reinforcement Learning | Jul 11, 2025 | MathMathematical Reasoning | CodeCode Available | 1 |
| KAT-V1: Kwai-AutoThink Technical Report | Jul 11, 2025 | Knowledge DistillationLarge Language Model | —Unverified | 0 |
| Prospective Learning in Retrospect | Jul 10, 2025 | | CodeCode Available | 0 |
| Corvid: Improving Multimodal Large Language Models Towards Chain-of-Thought Reasoning | Jul 10, 2025 | | —Unverified | 0 |
| Machine Bullshit: Characterizing the Emergent Disregard for Truth in Large Language Models | Jul 10, 2025 | | —Unverified | 0 |
| Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs | Jul 10, 2025 | | —Unverified | 0 |
| Shifting from Ranking to Set Selection for Retrieval Augmented Generation | Jul 10, 2025 | | CodeCode Available | 0 |
| Multi-modal Representations for Fine-grained Multi-label Critical View of Safety Recognition | Jul 10, 2025 | | CodeCode Available | 0 |
| Dual Semantic-Aware Network for Noise Suppressed Ultrasound Video Segmentation | Jul 10, 2025 | | CodeCode Available | 0 |
| RLEP: Reinforcement Learning with Experience Replay for LLM Reasoning | Jul 10, 2025 | | CodeCode Available | 0 |