| VERITAS: Verification and Explanation of Realness in Images for Transparency in AI Systems | Jul 7, 2025 | Decision MakingSynthetic Image Detection | CodeCode Available | 0 |
| Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations | Jul 7, 2025 | AttributeMMLU | CodeCode Available | 0 |
| AI-Driven Cytomorphology Image Synthesis for Medical Diagnostics | Jul 7, 2025 | Image GenerationMedical Diagnosis | CodeCode Available | 0 |
| AXLearn: Modular Large Model Training on Heterogeneous Infrastructure | Jul 7, 2025 | Deep Learning | —Unverified | 0 |
| Differential Attention for Multimodal Crisis Event Analysis | Jul 7, 2025 | Disaster ResponseHumanitarian | CodeCode Available | 0 |
| CLIP-Guided Backdoor Defense through Entropy-Based Poisoned Dataset Separation | Jul 7, 2025 | backdoor defense | CodeCode Available | 0 |
| Learning Robust Stereo Matching in the Wild with Selective Mixture-of-Experts | Jul 7, 2025 | Inductive BiasMixture-of-Experts | CodeCode Available | 2 |
| Enhancing Spatial Reasoning in Vision-Language Models via Chain-of-Thought Prompting and Reinforcement Learning | Jul 6, 2025 | | CodeCode Available | 0 |
| Just Add Geometry: Gradient-Free Open-Vocabulary 3D Detection Without Human-in-the-Loop | Jul 6, 2025 | | CodeCode Available | 0 |
| ARMR: Adaptively Responsive Network for Medication Recommendation | Jul 6, 2025 | | CodeCode Available | 0 |
| Anomalous Decision Discovery using Inverse Reinforcement Learning | Jul 6, 2025 | | CodeCode Available | 0 |
| Diffusion Explorer: Interactive Exploration of Diffusion Models | Jul 6, 2025 | | —Unverified | 0 |
| Towards Understanding the Cognitive Habits of Large Reasoning Models | Jul 6, 2025 | | CodeCode Available | 0 |
| Efficient Training of Deep Networks using Guided Spectral Data Selection: A Step Toward Learning What You Need | Jul 6, 2025 | | CodeCode Available | 0 |
| HKCanto-Eval: A Benchmark for Evaluating Cantonese Language Understanding and Cultural Comprehension in LLMs | Jul 6, 2025 | | CodeCode Available | 0 |
| SCAWaveNet: A Spatial-Channel Attention-Based Network for Global Significant Wave Height Retrieval | Jul 6, 2025 | | CodeCode Available | 0 |
| Inertial Quadratic Majorization Minimization with Application to Kernel Regularized Learning | Jul 6, 2025 | | CodeCode Available | 0 |
| Dynamic Frequency Feature Fusion Network for Multi-Source Remote Sensing Data Classification | Jul 6, 2025 | | CodeCode Available | 0 |
| ViTaL: A Multimodality Dataset and Benchmark for Multi-pathological Ovarian Tumor Recognition | Jul 6, 2025 | | CodeCode Available | 0 |
| LearnLens: LLM-Enabled Personalised, Curriculum-Grounded Feedback with Educators in the Loop | Jul 6, 2025 | Retrieval | —Unverified | 0 |
| A Hybrid Machine Learning Framework for Optimizing Crop Selection via Agronomic and Economic Forecasting | Jul 6, 2025 | Hybrid Machine Learningspeech-recognition | —Unverified | 0 |
| Transferring Visual Explainability of Self-Explaining Models through Task Arithmetic | Jul 6, 2025 | image-classificationImage Classification | —Unverified | 0 |
| DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge | Jul 6, 2025 | Image GenerationMultimodal Reasoning | CodeCode Available | 3 |
| Grid-Reg: Grid-Based SAR and Optical Image Registration Across Platforms | Jul 6, 2025 | geo-localizationImage Registration | —Unverified | 0 |
| CoT-lized Diffusion: Let's Reinforce T2I Generation Step-by-step | Jul 6, 2025 | DenoisingLarge Language Model | —Unverified | 0 |
| Tail-aware Adversarial Attacks: A Distributional Approach to Efficient LLM Jailbreaking | Jul 6, 2025 | Adversarial Robustness | —Unverified | 0 |
| MoReMouse: Monocular Reconstruction of Laboratory Mouse | Jul 6, 2025 | 3D Reconstruction3D Surface Generation | —Unverified | 0 |
| Visual Hand Gesture Recognition with Deep Learning: A Comprehensive Review of Methods, Datasets, Challenges and Future Research Directions | Jul 6, 2025 | Gesture RecognitionHand Gesture Recognition | —Unverified | 0 |
| TinyProto: Communication-Efficient Federated Learning with Sparse Prototypes in Resource-Constrained Environments | Jul 6, 2025 | Federated Learning | CodeCode Available | 0 |
| BiFair: A Fairness-aware Training Framework for LLM-enhanced Recommender Systems via Bi-level Optimization | Jul 6, 2025 | FairnessLarge Language Model | —Unverified | 0 |
| A Training-Free Style-Personalization via Scale-wise Autoregressive Model | Jul 6, 2025 | Image GenerationPersonalized Image Generation | —Unverified | 0 |
| Heterogeneous Federated Learning with Prototype Alignment and Upscaling | Jul 6, 2025 | Federated Learning | CodeCode Available | 0 |
| MVNet: Hyperspectral Remote Sensing Image Classification Based on Hybrid Mamba-Transformer Vision Backbone Architecture | Jul 6, 2025 | Computational EfficiencyHyperspectral Image Classification | CodeCode Available | 0 |
| Model Inversion Attacks on Llama 3: Extracting PII from Large Language Models | Jul 6, 2025 | Privacy Preserving | —Unverified | 0 |
| Fuzzy Classification Aggregation for a Continuum of Agents | Jul 6, 2025 | Classification | —Unverified | 0 |
| Just Enough Shifts: Mitigating Over-Refusal in Aligned Language Models with Targeted Representation Fine-Tuning | Jul 6, 2025 | Safety Alignment | —Unverified | 0 |
| LoSiA: Efficient High-Rank Fine-Tuning via Subnet Localization and Optimization | Jul 6, 2025 | Common Sense Reasoningparameter-efficient fine-tuning | CodeCode Available | 0 |
| FA: Forced Prompt Learning of Vision-Language Models for Out-of-Distribution Detection | Jul 6, 2025 | Out-of-Distribution DetectionOut of Distribution (OOD) Detection | CodeCode Available | 0 |
| MambaFusion: Height-Fidelity Dense Global Fusion for Multi-modal 3D Object Detection | Jul 6, 2025 | 3D Object DetectionAttribute | CodeCode Available | 2 |
| Exploring Remote Physiological Signal Measurement under Dynamic Lighting Conditions at Night: Dataset, Experiment, and Analysis | Jul 6, 2025 | Emotion Recognition | CodeCode Available | 1 |
| FinTeam: A Multi-Agent Collaborative Intelligence System for Comprehensive Financial Scenarios | Jul 5, 2025 | | CodeCode Available | 0 |
| Handling Korean Out-of-Vocabulary Words with Phoneme Representation Learning | Jul 5, 2025 | | CodeCode Available | 0 |
| AFUNet: Cross-Iterative Alignment-Fusion Synergy for HDR Reconstruction via Deep Unfolding Paradigm | Jul 5, 2025 | | CodeCode Available | 0 |
| Enhanced accuracy through ensembling of randomly initialized auto-regressive models for time-dependent PDEs | Jul 5, 2025 | | CodeCode Available | 0 |
| Consistency-Aware Padding for Incomplete Multi-Modal Alignment Clustering Based on Self-Repellent Greedy Anchor Search | Jul 5, 2025 | | CodeCode Available | 0 |
| Evaluating Adversarial Protections for Diffusion Personalization: A Comprehensive Study | Jul 5, 2025 | | CodeCode Available | 0 |
| Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents | Jul 5, 2025 | | CodeCode Available | 0 |
| BYOKG-RAG: Multi-Strategy Graph Retrieval for Knowledge Graph Question Answering | Jul 5, 2025 | | CodeCode Available | 0 |
| PromptSR: Cascade Prompting for Lightweight Image Super-Resolution | Jul 5, 2025 | | CodeCode Available | 0 |
| When Data-Free Knowledge Distillation Meets Non-Transferable Teacher: Escaping Out-of-Distribution Trap is All You Need | Jul 5, 2025 | | CodeCode Available | 0 |