| OTSurv: A Novel Multiple Instance Learning Framework for Survival Prediction with Heterogeneity-aware Optimal Transport | Jun 25, 2025 | Multiple Instance LearningSurvival Prediction | CodeCode Available | 1 |
| Leveraging Vision-Language Models to Select Trustworthy Super-Resolution Samples Generated by Diffusion Models | Jun 25, 2025 | Semantic SimilaritySemantic Textual Similarity | —Unverified | 0 |
| Vector Contrastive Learning For Pixel-Wise Pretraining In Medical Vision | Jun 25, 2025 | Contrastive LearningFeature Correlation | CodeCode Available | 1 |
| IMC-PINN-FE: A Physics-Informed Neural Network for Patient-Specific Left Ventricular Finite Element Modeling with Image Motion Consistency and Biomechanical Parameter Estimation | Jun 25, 2025 | parameter estimationSpecificity | CodeCode Available | 0 |
| Latent-space Field Tension for Astrophysical Component Detection An application to X-ray imaging | Jun 25, 2025 | Uncertainty Quantification | —Unverified | 0 |
| Cooperative Sensing and Communication Beamforming Design for Low-Altitude Economy | Jun 25, 2025 | Integrated sensing and communicationISAC | —Unverified | 0 |
| Uncertainty-Aware Machine-Learning Framework for Predicting Dislocation Plasticity and Stress-Strain Response in FCC Alloys | Jun 25, 2025 | Uncertainty Quantification | —Unverified | 0 |
| Volumetric segmentation of muscle compartments using in vivo imaging and architectural validation in human finger flexors | Jun 25, 2025 | AnatomySegmentation | —Unverified | 0 |
| Building Lightweight Semantic Segmentation Models for Aerial Images Using Dual Relation Distillation | Jun 25, 2025 | Knowledge DistillationRelation | —Unverified | 0 |
| DPLib: A Standard Benchmark Library for Distributed Power System Analysis and Optimization | Jun 25, 2025 | Distributed Optimization | CodeCode Available | 1 |
| Distributed Lyapunov Functions for Nonlinear Networks | Jun 25, 2025 | Dimensionality ReductionLEMMA | CodeCode Available | 0 |
| Papanicolaou Stain Unmixing for RGB Image Using Weighted Nucleus Sparsity and Total Variation Regularization | Jun 25, 2025 | | CodeCode Available | 0 |
| Noise-Tolerant Hybrid Approach for Data-Driven Predictive Control | Jun 25, 2025 | Sensitivity | —Unverified | 0 |
| Identifiability and Maximum Likelihood Estimation for System Identification of Networks of Dynamical Systems | Jun 25, 2025 | | CodeCode Available | 0 |
| Test-time Scaling Techniques in Theoretical Physics -- A Comparison of Methods on the TPBench Dataset | Jun 25, 2025 | Mathematical Reasoning | —Unverified | 0 |
| U-R-VEDA: Integrating UNET, Residual Links, Edge and Dual Attention, and Vision Transformer for Accurate Semantic Segmentation of CMRs | Jun 25, 2025 | Edge DetectionMedical Image Analysis | —Unverified | 0 |
| Structural System Identification via Validation and Adaptation | Jun 25, 2025 | parameter estimationUncertainty Quantification | —Unverified | 0 |
| Analytic inference with two-way clustering | Jun 25, 2025 | Clustering | —Unverified | 0 |
| Engineering RAG Systems for Real-World Applications: Design, Development, and Evaluation | Jun 25, 2025 | Optical Character Recognition (OCR)RAG | —Unverified | 0 |
| Generating Reliable Adverse event Profiles for Health through Automated Integrated Data (GRAPH-AID): A Semi-Automated Ontology Building Approach | Jun 25, 2025 | Knowledge Graphs | —Unverified | 0 |
| Brain2Model Transfer: Training sensory and decision models with human neural activity as a teacher | Jun 25, 2025 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Spiking Neural Networks for SAR Interferometric Phase Unwrapping: A Theoretical Framework for Energy-Efficient Processing | Jun 25, 2025 | Earth Observation | —Unverified | 0 |
| 3DGH: 3D Head Generation with Composable Hair and Face | Jun 25, 2025 | Image Generation | —Unverified | 0 |
| How do Foundation Models Compare to Skeleton-Based Approaches for Gesture Recognition in Human-Robot Interaction? | Jun 25, 2025 | Gesture Recognition | —Unverified | 0 |
| Exploring the Effects of Chatbot Anthropomorphism and Human Empathy on Human Prosocial Behavior Toward Chatbots | Jun 25, 2025 | Chatbot | —Unverified | 0 |
| GPU Kernel Scientist: An LLM-Driven Framework for Iterative Kernel Optimization | Jun 25, 2025 | GPU | —Unverified | 0 |
| An Exploration of ECAPA-TDNN and x-vector Speaker Representations in Zero-shot Multi-speaker TTS | Jun 25, 2025 | Speaker Recognitiontext-to-speech | —Unverified | 0 |
| The Ideation-Execution Gap: Execution Outcomes of LLM-Generated versus Human Research Ideas | Jun 25, 2025 | | CodeCode Available | 3 |
| Model-Based Real-Time Pose and Sag Estimation of Overhead Power Lines Using LiDAR for Drone Inspection | Jun 25, 2025 | valid | CodeCode Available | 0 |
| Communicating Smartly in the Molecular Domain: Neural Networks in the Internet of Bio-Nano Things | Jun 25, 2025 | Dataset GenerationExplainable artificial intelligence | CodeCode Available | 0 |
| The role of audio-visual integration in the time course of phonetic encoding in self-supervised speech models | Jun 25, 2025 | Self-Supervised Learning | —Unverified | 0 |
| ConViTac: Aligning Visual-Tactile Fusion with Contrastive Representations | Jun 25, 2025 | Contrastive LearningMaterial Classification | —Unverified | 0 |
| Lightweight Target-Speaker-Based Overlap Transcription for Practical Streaming ASR | Jun 25, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| RAG-VisualRec: An Open Resource for Vision- and Text-Enhanced Retrieval-Augmented Generation in Recommendation | Jun 25, 2025 | Collaborative FilteringData Augmentation | CodeCode Available | 0 |
| Complex Model Transformations by Reinforcement Learning with Uncertain Human Guidance | Jun 25, 2025 | Reinforcement Learning (RL) | CodeCode Available | 0 |
| Video Perception Models for 3D Scene Synthesis | Jun 25, 2025 | 3D ReconstructionImage Generation | —Unverified | 0 |
| POLAR: A Pessimistic Model-based Policy Learning Algorithm for Dynamic Treatment Regimes | Jun 25, 2025 | Sequential Decision Making | —Unverified | 0 |
| FINN-GL: Generalized Mixed-Precision Extensions for FPGA-Accelerated LSTMs | Jun 25, 2025 | Sentiment AnalysisStock Prediction | —Unverified | 0 |
| MS-IQA: A Multi-Scale Feature Fusion Network for PET/CT Image Quality Assessment | Jun 25, 2025 | DiagnosticImage Quality Assessment | CodeCode Available | 0 |
| Med-Art: Diffusion Transformer for 2D Medical Text-to-Image Generation | Jun 25, 2025 | Image GenerationMedical Image Generation | —Unverified | 0 |
| A Transformer Based Handwriting Recognition System Jointly Using Online and Offline Features | Jun 25, 2025 | Handwriting RecognitionRepresentation Learning | —Unverified | 0 |
| X-SiT: Inherently Interpretable Surface Vision Transformers for Dementia Diagnosis | Jun 25, 2025 | AnatomyDecision Making | —Unverified | 0 |
| HiWave: Training-Free High-Resolution Image Generation via Wavelet-Based Diffusion Sampling | Jun 25, 2025 | Image Generation | —Unverified | 0 |
| Semantic-enhanced Modality-asymmetric Retrieval for Online E-commerce Search | Jun 25, 2025 | Question AnsweringRetrieval | —Unverified | 0 |
| A Literature Review on Simulation in Conversational Recommender Systems | Jun 25, 2025 | Conversational RecommendationRecommendation Systems | —Unverified | 0 |
| From 2D to 3D Cognition: A Brief Survey of General World Models | Jun 25, 2025 | Autonomous DrivingScene Generation | —Unverified | 0 |
| UniCode^2: Cascaded Large-scale Codebooks for Unified Multimodal Understanding and Generation | Jun 25, 2025 | 16k | —Unverified | 0 |
| Dynamic Bandwidth Allocation for Hybrid Event-RGB Transmission | Jun 25, 2025 | DeblurringInformativeness | —Unverified | 0 |
| From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios | Jun 25, 2025 | Prediction | —Unverified | 0 |
| Recognizing Surgical Phases Anywhere: Few-Shot Test-time Adaptation and Task-graph Guided Refinement | Jun 25, 2025 | Surgical phase recognitionTest-time Adaptation | CodeCode Available | 0 |