| SPPSFormer: High-quality Superpoint-based Transformer for Roof Plane Instance Segmentation from Point Clouds | May 30, 2025 | Data AugmentationInstance Segmentation | —Unverified | 0 |
| ACM-UNet: Adaptive Integration of CNNs and Mamba for Efficient Medical Image Segmentation | May 30, 2025 | DecoderImage Segmentation | CodeCode Available | 0 |
| Werewolf: A Straightforward Game Framework with TTS for Improved User Engagement | May 30, 2025 | text-to-speechText to Speech | —Unverified | 0 |
| KairosAD: A SAM-Based Model for Industrial Anomaly Detection on Embedded Devices | May 30, 2025 | Anomaly Detection | CodeCode Available | 0 |
| PCIE_Interaction Solution for Ego4D Social Interaction Challenge | May 30, 2025 | | CodeCode Available | 0 |
| Aligned but Blind: Alignment Increases Implicit Bias by Reducing Awareness of Race | May 30, 2025 | Machine Unlearning | CodeCode Available | 0 |
| GenSpace: Benchmarking Spatially-Aware Image Generation | May 30, 2025 | BenchmarkingImage Generation | —Unverified | 0 |
| Neural Drift Estimation for Ergodic Diffusions: Non-parametric Analysis and Numerical Exploration | May 30, 2025 | Generalization Bounds | —Unverified | 0 |
| Beyond Pretty Pictures: Combined Single- and Multi-Image Super-resolution for Sentinel-2 Images | May 30, 2025 | Earth ObservationImage Super-Resolution | —Unverified | 0 |
| Bottom-Up Perspectives on AI Governance: Insights from User Reviews of AI Products | May 30, 2025 | Ethics | —Unverified | 0 |
| Decoupled Competitive Framework for Semi-supervised Medical Image Segmentation | May 30, 2025 | Image SegmentationMedical Image Segmentation | CodeCode Available | 0 |
| Enhancing Drug Discovery: Autoencoder-Based Latent Space Augmentation for Improved Molecular Solubility Prediction using LatMixSol | May 30, 2025 | Drug DiscoveryFeature Compression | —Unverified | 0 |
| CL-LoRA: Continual Low-Rank Adaptation for Rehearsal-Free Class-Incremental Learning | May 30, 2025 | class-incremental learningClass Incremental Learning | CodeCode Available | 1 |
| EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering | May 30, 2025 | Denoising | CodeCode Available | 2 |
| DrVD-Bench: Do Vision-Language Models Reason Like Human Doctors in Medical Image Diagnosis? | May 30, 2025 | DiagnosticMedical Image Analysis | CodeCode Available | 1 |
| Reinforcing Video Reasoning with Focused Thinking | May 30, 2025 | Data AugmentationVisual Reasoning | CodeCode Available | 1 |
| DisTime: Distribution-based Time Representation for Video Large Language Models | May 30, 2025 | Temporal LocalizationVideo Understanding | CodeCode Available | 1 |
| Period-LLM: Extending the Periodic Capability of Multimodal Large Language Model | May 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Boosting All-in-One Image Restoration via Self-Improved Privilege Learning | May 30, 2025 | AllImage Restoration | CodeCode Available | 1 |
| Beyond the LUMIR challenge: The pathway to foundational registration models | May 30, 2025 | Image RegistrationZero-shot Generalization | CodeCode Available | 1 |
| Bridging 3D Anomaly Localization and Repair via High-Quality Continuous Geometric Representation | May 30, 2025 | 3D Anomaly DetectionAnomaly Detection | CodeCode Available | 0 |
| Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation | May 30, 2025 | AllBenchmarking | CodeCode Available | 1 |
| Optimal Density Functions for Weighted Convolution in Learning Models | May 30, 2025 | DenoisingImage Denoising | CodeCode Available | 2 |
| Proactive Guidance of Multi-Turn Conversation in Industrial Search | May 30, 2025 | Knowledge Distillationreinforcement-learning | —Unverified | 0 |
| Unifying Language Agent Algorithms with Graph-based Orchestration Engine for Reproducible Agent Research | May 30, 2025 | Mathematical Reasoning | CodeCode Available | 1 |
| HELM: Hyperbolic Large Language Models via Mixture-of-Curvature Experts | May 30, 2025 | ARCGeneral Knowledge | CodeCode Available | 1 |
| Hyperbolic Dataset Distillation | May 30, 2025 | Computational EfficiencyDataset Distillation | —Unverified | 0 |
| ClueAnchor: Clue-Anchored Knowledge Reasoning Exploration and Optimization for Retrieval-Augmented Generation | May 30, 2025 | RAGRetrieval | CodeCode Available | 0 |
| A Mathematical Perspective On Contrastive Learning | May 30, 2025 | Contrastive LearningRetrieval | —Unverified | 0 |
| Explainable Depression Detection using Masked Hard Instance Mining | May 30, 2025 | Depression Detection | —Unverified | 0 |
| Adaptive LoRA Merge with Parameter Pruning for Low-Resource Generation | May 30, 2025 | Text Generation | CodeCode Available | 0 |
| Transformers Are Universally Consistent | May 30, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Context-Aware Sentiment Forecasting via LLM-based Multi-Perspective Role-Playing Agents | May 30, 2025 | | CodeCode Available | 0 |
| SALE : Low-bit Estimation for Efficient Sparse Attention in Long-context LLM Prefilling | May 30, 2025 | Large Language Model | CodeCode Available | 0 |
| STAR-Net: An Interpretable Model-Aided Network for Remote Sensing Image Denoising | May 30, 2025 | DenoisingImage Denoising | CodeCode Available | 0 |
| Efficient Text Encoders for Labor Market Analysis | May 30, 2025 | Contrastive LearningExtreme Multi-Label Classification | —Unverified | 0 |
| TRIDENT: Enhancing Large Language Model Safety with Tri-Dimensional Diversified Red-Teaming Data Synthesis | May 30, 2025 | DiversityLanguage Modeling | CodeCode Available | 0 |
| Mamba Knockout for Unraveling Factual Information Flow | May 30, 2025 | Mamba | CodeCode Available | 0 |
| Reading Recognition in the Wild | May 30, 2025 | Diversity | —Unverified | 0 |
| CoRet: Improved Retriever for Code Editing | May 30, 2025 | Natural Language QueriesRetrieval | —Unverified | 0 |
| Supervised Quantum Machine Learning: A Future Outlook from Qubits to Enterprise Applications | May 30, 2025 | Quantum Machine Learning | —Unverified | 0 |
| HLSAD: Hodge Laplacian-based Simplicial Anomaly Detection | May 30, 2025 | Anomaly DetectionComputational Efficiency | —Unverified | 0 |
| Model Informed Flows for Bayesian Inference of Probabilistic Programs | May 30, 2025 | Bayesian InferenceTranslation | —Unverified | 0 |
| Rehearsal with Auxiliary-Informed Sampling for Audio Deepfake Detection | May 30, 2025 | Audio Deepfake DetectionContinual Learning | CodeCode Available | 0 |
| Vision LLMs Are Bad at Hierarchical Visual Understanding, and LLMs Are the Bottleneck | May 30, 2025 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Efficient Estimation of Regularized Tyler's M-Estimator Using Approximate LOOCV | May 30, 2025 | Face RecognitionObject Recognition | —Unverified | 0 |
| Light as Deception: GPT-driven Natural Relighting Against Vision-Language Pre-training Models | May 30, 2025 | Image CaptioningQuestion Answering | —Unverified | 0 |
| LPASS: Linear Probes as Stepping Stones for vulnerability detection using compressed LLMs | May 30, 2025 | Vulnerability Detection | —Unverified | 0 |
| Breaking the Gold Standard: Extracting Forgotten Data under Exact Unlearning in Large Language Models | May 30, 2025 | Medical Diagnosis | —Unverified | 0 |
| A Reward-driven Automated Webshell Malicious-code Generator for Red-teaming | May 30, 2025 | Code GenerationDiversity | —Unverified | 0 |