| Cross-Channel Unlabeled Sensing over a Union of Signal Subspaces | Jun 11, 2025 | compressed sensing | —Unverified | 0 |
| Wavelet Scattering Transform and Fourier Representation for Offline Detection of Malicious Clients in Federated Learning | Jun 11, 2025 | Anomaly DetectionFederated Learning | —Unverified | 0 |
| FedVLMBench: Benchmarking Federated Fine-Tuning of Vision-Language Models | Jun 11, 2025 | BenchmarkingFederated Learning | —Unverified | 0 |
| A Survey on the Role of Artificial Intelligence and Machine Learning in 6G-V2X Applications | Jun 11, 2025 | Autonomous VehiclesFederated Learning | —Unverified | 0 |
| MOORL: A Framework for Integrating Offline-Online Reinforcement Learning | Jun 11, 2025 | D4RLDeep Reinforcement Learning | —Unverified | 0 |
| Towards Efficient and Effective Alignment of Large Language Models | Jun 11, 2025 | Mathematical ReasoningMeta-Learning | —Unverified | 0 |
| Foundation Model-Aided Deep Reinforcement Learning for RIS-Assisted Wireless Communication | Jun 11, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| Advancing Exchange Rate Forecasting: Leveraging Machine Learning and AI for Enhanced Accuracy in Global Financial Markets | Jun 11, 2025 | Sentiment Analysis | —Unverified | 0 |
| Vision Generalist Model: A Survey | Jun 11, 2025 | modelSurvey | —Unverified | 0 |
| Intelligent Design 4.0: Paradigm Evolution Toward the Agentic AI Era | Jun 11, 2025 | General Knowledge | —Unverified | 0 |
| Wasserstein Hypergraph Neural Network | Jun 11, 2025 | Graph Representation LearningNode Classification | —Unverified | 0 |
| UniForward: Unified 3D Scene and Semantic Field Reconstruction via Feed-Forward Gaussian Splatting from Only Sparse-View Images | Jun 11, 2025 | Novel View Synthesis | —Unverified | 0 |
| Provoking Multi-modal Few-Shot LVLM via Exploration-Exploitation In-Context Learning | Jun 11, 2025 | In-Context LearningQuestion Answering | —Unverified | 0 |
| Hidden in Plain Sight: Evaluation of the Deception Detection Capabilities of LLMs in Multimodal Settings | Jun 11, 2025 | Deception Detection | —Unverified | 0 |
| An Effective End-to-End Solution for Multimodal Action Recognition | Jun 11, 2025 | Action RecognitionComputational Efficiency | —Unverified | 0 |
| ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model | Jun 11, 2025 | cross-modal alignmentDescriptive | CodeCode Available | 2 |
| DeepTraverse: A Depth-First Search Inspired Network for Algorithmic Visual Understanding | Jun 11, 2025 | image-classificationImage Classification | —Unverified | 0 |
| Leveraging LLMs for Mission Planning in Precision Agriculture | Jun 11, 2025 | Spatial Reasoning | —Unverified | 0 |
| A Novel Lightweight Transformer with Edge-Aware Fusion for Remote Sensing Image Captioning | Jun 11, 2025 | DecoderImage Captioning | —Unverified | 0 |
| Rethinking Brain Tumor Segmentation from the Frequency Domain Perspective | Jun 11, 2025 | Brain Tumor SegmentationComputational Efficiency | CodeCode Available | 1 |
| ComfyUI-R1: Exploring Reasoning Models for Workflow Generation | Jun 11, 2025 | 4k | CodeCode Available | 7 |
| Towards Practical Alzheimer's Disease Diagnosis: A Lightweight and Interpretable Spiking Neural Model | Jun 11, 2025 | Diagnostic | CodeCode Available | 1 |
| ScaleLSD: Scalable Deep Line Segment Detection Streamlined | Jun 11, 2025 | 3D geometryLine Segment Detection | CodeCode Available | 1 |
| Evasion Attacks Against Bayesian Predictive Models | Jun 11, 2025 | | CodeCode Available | 0 |
| HopaDIFF: Holistic-Partial Aware Fourier Conditioned Diffusion for Referring Human Action Segmentation in Multi-Person Scenarios | Jun 11, 2025 | Action RecognitionAction Segmentation | CodeCode Available | 0 |
| On-the-Fly Adaptive Distillation of Transformer to Dual-State Linear Attention | Jun 11, 2025 | Text Summarization | CodeCode Available | 0 |
| Improving Personalized Search with Regularized Low-Rank Parameter Updates | Jun 11, 2025 | General KnowledgeImage Retrieval | CodeCode Available | 0 |
| Consistent Story Generation with Asymmetry Zigzag Sampling | Jun 11, 2025 | Image GenerationStory Generation | CodeCode Available | 0 |
| MMME: A Spontaneous Multi-Modal Micro-Expression Dataset Enabling Visual-Physiological Fusion | Jun 11, 2025 | EEG | CodeCode Available | 0 |
| SRPL-SFDA: SAM-Guided Reliable Pseudo-Labels for Source-Free Domain Adaptation in Medical Image Segmentation | Jun 11, 2025 | Domain AdaptationImage Segmentation | CodeCode Available | 0 |
| Apollo: A Posteriori Label-Only Membership Inference Attack Towards Machine Unlearning | Jun 11, 2025 | Inference AttackMachine Unlearning | CodeCode Available | 0 |
| Discrete Scale-invariant Metric Learning for Efficient Collaborative Filtering | Jun 11, 2025 | Collaborative FilteringMetric Learning | CodeCode Available | 0 |
| IntPhys 2: Benchmarking Intuitive Physics Understanding In Complex Synthetic Environments | Jun 11, 2025 | Benchmarking | CodeCode Available | 2 |
| Non-Contact Health Monitoring During Daily Personal Care Routines | Jun 11, 2025 | Heart rate estimationMulti-Task Learning | CodeCode Available | 1 |
| VerIF: Verification Engineering for Reinforcement Learning in Instruction Following | Jun 11, 2025 | Instruction Followingreinforcement-learning | CodeCode Available | 2 |
| DAVSP: Safety Alignment for Large Vision-Language Models via Deep Aligned Visual Safety Prompt | Jun 11, 2025 | Safety Alignment | CodeCode Available | 1 |
| Unmasking real-world audio deepfakes: A data-centric approach | Jun 11, 2025 | DeepFake DetectionFace Swapping | CodeCode Available | 1 |
| OmniDRCA: Parallel Speech-Text Foundation Model via Dual-Resolution Speech Representations and Contrastive Alignment | Jun 11, 2025 | cross-modal alignmentQuestion Answering | CodeCode Available | 0 |
| ScoreMix: Improving Face Recognition via Score Composition in Diffusion Generators | Jun 11, 2025 | Data AugmentationFace Recognition | —Unverified | 0 |
| MetricHMR: Metric Human Mesh Recovery from Monocular Images | Jun 11, 2025 | Human Mesh RecoveryTranslation | —Unverified | 0 |
| Detecção da Psoríase Utilizando Visão Computacional: Uma Abordagem Comparativa Entre CNNs e Vision Transformers | Jun 11, 2025 | image-classificationImage Classification | —Unverified | 0 |
| Learning Efficient and Generalizable Graph Retriever for Knowledge-Graph Question Answering | Jun 11, 2025 | Graph Question AnsweringKnowledge Graphs | CodeCode Available | 0 |
| Step-by-step Instructions and a Simple Tabular Output Format Improve the Dependency Parsing Accuracy of LLMs | Jun 11, 2025 | Dependency ParsingHallucination | CodeCode Available | 0 |
| Empirical and computer-aided robustness analysis of long-step and accelerated methods in smooth convex optimization | Jun 11, 2025 | | CodeCode Available | 0 |
| Incorporating Linguistic Constraints from External Knowledge Source for Audio-Visual Target Speech Extraction | Jun 11, 2025 | Speech ExtractionTarget Speaker Extraction | —Unverified | 0 |
| Prompt Variability Effects On LLM Code Generation | Jun 11, 2025 | Code Generation | —Unverified | 0 |
| Auto-Compressing Networks | Jun 11, 2025 | Transfer Learning | —Unverified | 0 |
| UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting | Jun 11, 2025 | DiversityRepresentation Learning | CodeCode Available | 2 |
| CausalVQA: A Physically Grounded Causal Reasoning Benchmark for Video Models | Jun 11, 2025 | counterfactualDescriptive | CodeCode Available | 2 |
| LLMail-Inject: A Dataset from a Realistic Adaptive Prompt Injection Challenge | Jun 11, 2025 | | CodeCode Available | 1 |