| MOVi: Training-free Text-conditioned Multi-Object Video Generation | May 29, 2025 | ObjectVideo Generation | —Unverified | 0 |
| Zero-P-to-3: Zero-Shot Partial-View Images to 3D Object | May 29, 2025 | 3D ReconstructionImage Reconstruction | —Unverified | 0 |
| EAD: An EEG Adapter for Automated Classification | May 29, 2025 | ClassificationEEG | —Unverified | 0 |
| Identification of Patterns of Cognitive Impairment for Early Detection of Dementia | May 29, 2025 | feature selection | —Unverified | 0 |
| TextSR: Diffusion Super-Resolution with Multilingual OCR Guidance | May 29, 2025 | Image Super-ResolutionOptical Character Recognition | —Unverified | 0 |
| WTEFNet: Real-Time Low-Light Object Detection for Advanced Driver-Assistance Systems | May 29, 2025 | Denoisingobject-detection | —Unverified | 0 |
| Image Aesthetic Reasoning: A New Benchmark for Medical Image Screening with MLLMs | May 29, 2025 | Image GenerationMultiple-choice | —Unverified | 0 |
| RSFAKE-1M: A Large-Scale Dataset for Detecting Diffusion-Generated Remote Sensing Forgeries | May 29, 2025 | Image Generation | —Unverified | 0 |
| GenCAD-Self-Repairing: Feasibility Enhancement for 3D CAD Generation | May 29, 2025 | Contrastive LearningDenoising | —Unverified | 0 |
| Federated Unsupervised Semantic Segmentation | May 29, 2025 | Federated LearningImage Segmentation | —Unverified | 0 |
| TRACE: Trajectory-Constrained Concept Erasure in Diffusion Models | May 29, 2025 | DenoisingFairness | —Unverified | 0 |
| Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis | May 29, 2025 | Dimensionality ReductionImage Generation | —Unverified | 0 |
| Fine-Tuning Next-Scale Visual Autoregressive Models with Group Relative Policy Optimization | May 29, 2025 | Reinforcement Learning (RL) | —Unverified | 0 |
| DSAGL: Dual-Stream Attention-Guided Learning for Weakly Supervised Whole Slide Image Classification | May 29, 2025 | image-classificationImage Classification | —Unverified | 0 |
| Beam-Guided Knowledge Replay for Knowledge-Rich Image Captioning using Vision-Language Model | May 29, 2025 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| MCFNet: A Multimodal Collaborative Fusion Network for Fine-Grained Semantic Classification | May 29, 2025 | Classificationimage-classification | —Unverified | 0 |
| Point or Line? Using Line-based Representation for Panoptic Symbol Spotting in CAD Drawings | May 29, 2025 | graph construction | —Unverified | 0 |
| Bridging Geometric and Semantic Foundation Models for Generalized Monocular Depth Estimation | May 29, 2025 | Depth EstimationMonocular Depth Estimation | —Unverified | 0 |
| Wav2Sem: Plug-and-Play Audio Semantic Decoupling for 3D Speech-Driven Facial Animation | May 29, 2025 | Motion Generation | CodeCode Available | 1 |
| EmergentTTS-Eval: Evaluating TTS Models on Complex Prosodic, Expressiveness, and Linguistic Challenges Using Model-as-a-Judge | May 29, 2025 | text-to-speechText to Speech | CodeCode Available | 3 |
| Toward Memory-Aided World Models: Benchmarking via Spatial Consistency | May 29, 2025 | BenchmarkingMinecraft | CodeCode Available | 1 |
| Proximal Algorithm Unrolling: Flexible and Efficient Reconstruction Networks for Single-Pixel Imaging | May 29, 2025 | | CodeCode Available | 1 |
| UniRL: Self-Improving Unified Multimodal Models via Supervised and Reinforcement Learning | May 29, 2025 | | CodeCode Available | 2 |
| Deep Modeling and Optimization of Medical Image Classification | May 29, 2025 | AvgClassification | CodeCode Available | 0 |
| From Connectivity to Autonomy: The Dawn of Self-Evolving Communication Systems | May 29, 2025 | Decision MakingManagement | —Unverified | 0 |
| Self-Correcting Code Generation Using Small Language Models | May 29, 2025 | Code GenerationHumanEval | CodeCode Available | 0 |
| Computational Algebra with Attention: Transformer Oracles for Border Basis Algorithms | May 29, 2025 | | CodeCode Available | 0 |
| LLM Performance for Code Generation on Noisy Tasks | May 29, 2025 | BenchmarkingCode Generation | CodeCode Available | 0 |
| What About Emotions? Guiding Fine-Grained Emotion Extraction from Mobile App Reviews | May 29, 2025 | Emotion ClassificationEmotion Recognition | CodeCode Available | 0 |
| A Descriptor Is All You Need: Accurate Machine Learning of Nonadiabatic Coupling Vectors | May 29, 2025 | All | CodeCode Available | 0 |
| Self-supervised feature learning for cardiac Cine MR image reconstruction | May 29, 2025 | Image ReconstructionMRI Reconstruction | CodeCode Available | 0 |
| Synthetic Generation and Latent Projection Denoising of Rim Lesions in Multiple Sclerosis | May 29, 2025 | DenoisingDiagnostic | CodeCode Available | 0 |
| ZIPA: A family of efficient models for multilingual phone recognition | May 29, 2025 | Diversity | CodeCode Available | 2 |
| Unsupervised Word-level Quality Estimation for Machine Translation Through the Lens of Annotators (Dis)agreement | May 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Optimization-Free Diffusion Model -- A Perturbation Theory Approach | May 29, 2025 | model | —Unverified | 0 |
| Sensitivity of DC Network Representation for GIC Analysis | May 29, 2025 | BlockingSensitivity | —Unverified | 0 |
| VERINA: Benchmarking Verifiable Code Generation | May 29, 2025 | BenchmarkingCode Generation | CodeCode Available | 2 |
| On Transferring Transferability: Towards a Theory for Size Generalization | May 29, 2025 | | CodeCode Available | 0 |
| Understanding the Information Propagation Effects of Communication Topologies in LLM-based Multi-Agent Systems | May 29, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 0 |
| FSL-SAGE: Accelerating Federated Split Learning via Smashed Activation Gradient Estimation | May 29, 2025 | Federated Learning | CodeCode Available | 0 |
| AMBER: Adaptive Mesh Generation by Iterative Mesh Resolution Prediction | May 29, 2025 | Computational EfficiencyData Augmentation | CodeCode Available | 0 |
| AgentAlign: Navigating Safety Alignment in the Shift from Informative to Agentic Large Language Models | May 29, 2025 | Safety Alignment | CodeCode Available | 0 |
| HyperMotion: DiT-Based Pose-Guided Human Image Animation of Complex Motions | May 29, 2025 | Image AnimationVideo Generation | CodeCode Available | 2 |
| Revisit CP Tensor Decomposition: Statistical Optimality and Fast Convergence | May 29, 2025 | Tensor Decomposition | CodeCode Available | 0 |
| Evaluating AI capabilities in detecting conspiracy theories on YouTube | May 29, 2025 | Data Integration | CodeCode Available | 0 |
| OSS-UAgent: An Agent-based Usability Evaluation Framework for Open Source Software | May 29, 2025 | Code Generation | CodeCode Available | 0 |
| Spatio-Temporal Joint Density Driven Learning for Skeleton-Based Action Recognition | May 29, 2025 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning | May 29, 2025 | DiagnosticQuestion Answering | CodeCode Available | 1 |
| Dual-Task Graph Neural Network for Joint Seizure Onset Zone Localization and Outcome Prediction using Stereo EEG | May 29, 2025 | EEGFeature Importance | —Unverified | 0 |
| Decom-Renorm-Merge: Model Merging on the Right Space Improves Multitasking | May 29, 2025 | Decoder | —Unverified | 0 |