| Qwen vs. Gemma Integration with Whisper: A Comparative Study in Multilingual SpeechLLM Systems | Jun 16, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| Logical Expressiveness of Graph Neural Networks with Hierarchical Node Individualization | Jun 16, 2025 | Isomorphism Testing | CodeCode Available | 0 |
| Delving Into the Psychology of Machines: Exploring the Structure of Self-Regulated Learning via LLM-Generated Survey Responses | Jun 16, 2025 | Survey | —Unverified | 0 |
| AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning | Jun 16, 2025 | Action GenerationAutonomous Driving | CodeCode Available | 3 |
| Evaluating Large Language Models for Phishing Detection, Self-Consistency, Faithfulness, and Explainability | Jun 16, 2025 | ClassificationContrastive Learning | CodeCode Available | 0 |
| A Survey on World Models Grounded in Acoustic Physical Information | Jun 16, 2025 | Autonomous DrivingSurvey | CodeCode Available | 0 |
| Fake it till You Make it: Reward Modeling as Discriminative Prediction | Jun 16, 2025 | | —Unverified | 0 |
| Towards Pervasive Distributed Agentic Generative AI -- A State of The Art | Jun 16, 2025 | Natural Language UnderstandingSurvey | —Unverified | 0 |
| OPTIMUS: Observing Persistent Transformations in Multi-temporal Unlabeled Satellite-data | Jun 16, 2025 | Change Point DetectionSelf-Supervised Learning | —Unverified | 0 |
| GeoRecon: Graph-Level Representation Learning for 3D Molecules via Reconstruction-Based Pretraining | Jun 16, 2025 | DenoisingLanguage Modeling | —Unverified | 0 |
| Weakest Link in the Chain: Security Vulnerabilities in Advanced Reasoning Models | Jun 16, 2025 | Math | —Unverified | 0 |
| Can you see how I learn? Human observers' inferences about Reinforcement Learning agents' learning processes | Jun 16, 2025 | Reinforcement Learning (RL) | —Unverified | 0 |
| A Survey on Imitation Learning for Contact-Rich Tasks in Robotics | Jun 16, 2025 | Contact-rich ManipulationImitation Learning | —Unverified | 0 |
| From Flat to Feeling: A Feasibility and Impact Study on Dynamic Facial Emotions in AI-Generated Avatars | Jun 16, 2025 | GPUSpeech Synthesis | —Unverified | 0 |
| SPOT: Bridging Natural Language and Geospatial Search for Investigative Journalists | Jun 16, 2025 | Fact CheckingTAG | —Unverified | 0 |
| Taming Polysemanticity in LLMs: Provable Feature Recovery via Sparse Autoencoders | Jun 16, 2025 | | —Unverified | 0 |
| DoA Estimation using MUSIC with Range/Doppler Multiplexing for MIMO-OFDM Radar | Jun 16, 2025 | parameter estimationSuper-Resolution | —Unverified | 0 |
| Stability Analysis of Physics-Informed Neural Networks via Variational Coercivity, Perturbation Bounds, and Concentration Estimates | Jun 16, 2025 | Generalization Bounds | —Unverified | 0 |
| Dynamic Preference Multi-Objective Reinforcement Learning for Internet Network Management | Jun 16, 2025 | ManagementMulti-Objective Reinforcement Learning | —Unverified | 0 |
| IKDiffuser: A Generative Inverse Kinematics Solver for Multi-arm Robots via Diffusion Model | Jun 16, 2025 | Computational EfficiencyDiversity | —Unverified | 0 |
| ROSA: Harnessing Robot States for Vision-Language and Action Alignment | Jun 16, 2025 | State EstimationVision-Language-Action | —Unverified | 0 |
| Agent Capability Negotiation and Binding Protocol (ACNBP) | Jun 16, 2025 | Document Translation | CodeCode Available | 0 |
| TextureSplat: Per-Primitive Texture Mapping for Reflective Gaussian Splatting | Jun 16, 2025 | GPUInverse Rendering | CodeCode Available | 0 |
| Polyra Swarms: A Shape-Based Approach to Machine Learning | Jun 16, 2025 | Anomaly Detection | —Unverified | 0 |
| JENGA: Object selection and pose estimation for robotic grasping from a stack | Jun 16, 2025 | BenchmarkingObject | —Unverified | 0 |
| VideoPDE: Unified Generative PDE Solving via Video Inpainting Diffusion Models | Jun 16, 2025 | Computational EfficiencyMissing Values | —Unverified | 0 |
| Block-wise Adaptive Caching for Accelerating Diffusion Policy | Jun 16, 2025 | Action GenerationDenoising | —Unverified | 0 |
| FrontendBench: A Benchmark for Evaluating LLMs on Front-End Development via Automatic Evaluation | Jun 16, 2025 | Code Generation | —Unverified | 0 |
| Seewo's Submission to MLC-SLM: Lessons learned from Speech Reasoning Language Models | Jun 16, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| PeakWeather: MeteoSwiss Weather Station Measurements for Spatiotemporal Deep Learning | Jun 16, 2025 | Deep LearningGraph structure learning | CodeCode Available | 1 |
| Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model | Jun 16, 2025 | Large Language Modelmultimodal interaction | CodeCode Available | 5 |
| Membership Inference Attacks as Privacy Tools: Reliability, Disparity and Ensemble | Jun 16, 2025 | Machine Unlearning | CodeCode Available | 1 |
| ZipVoice: Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching | Jun 16, 2025 | DecoderSpeech Synthesis | CodeCode Available | 4 |
| SuperPoint-SLAM3: Augmenting ORB-SLAM3 with Deep Features, Adaptive NMS, and Learning-Based Loop Closure | Jun 16, 2025 | Simultaneous Localization and Mapping | CodeCode Available | 2 |
| Global Convergence of Adjoint-Optimized Neural PDEs | Jun 16, 2025 | | CodeCode Available | 0 |
| EAQuant: Enhancing Post-Training Quantization for MoE Models via Expert-Aware Optimization | Jun 16, 2025 | Mixture-of-ExpertsModel Compression | CodeCode Available | 0 |
| Probing Deep into Temporal Profile Makes the Infrared Small Target Detector Much Better | Jun 15, 2025 | Anomaly Detection | CodeCode Available | 1 |
| SMPL Normal Map Is All You Need for Single-view Textured Human Reconstruction | Jun 15, 2025 | 3D Human Reconstruction3D Reconstruction | —Unverified | 0 |
| ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies | Jun 15, 2025 | Benchmarking | CodeCode Available | 1 |
| Evaluating Cell Type Inference in Vision Language Models Under Varying Visual Context | Jun 15, 2025 | image-classificationImage Classification | CodeCode Available | 0 |
| Dynamic Scheduling for Enhanced Performance in RIS-assisted Cooperative Network with Interference | Jun 15, 2025 | ManagementScheduling | —Unverified | 0 |
| Effect Decomposition of Functional-Output Computer Experiments via Orthogonal Additive Gaussian Processes | Jun 15, 2025 | Gaussian ProcessesSensitivity | —Unverified | 0 |
| PDCNet: a benchmark and general deep learning framework for activity prediction of peptide-drug conjugates | Jun 15, 2025 | Activity Prediction | —Unverified | 0 |
| MORIC: CSI Delay-Doppler Decomposition for Robust Wi-Fi-based Human Activity Recognition | Jun 15, 2025 | Activity RecognitionHuman Activity Recognition | —Unverified | 0 |
| Improving spliced alignment by modeling splice sites with deep learning | Jun 15, 2025 | | CodeCode Available | 2 |
| Uncovering Social Network Activity Using Joint User and Topic Interaction | Jun 15, 2025 | Point Processes | —Unverified | 0 |
| KCLNet: Physics-Informed Power Flow Prediction via Constraints Projections | Jun 15, 2025 | Graph Neural NetworkPrediction | —Unverified | 0 |
| Nonlinear Model Order Reduction of Dynamical Systems in Process Engineering: Review and Comparison | Jun 15, 2025 | Chemical Process | —Unverified | 0 |
| GM-LDM: Latent Diffusion Model for Brain Biomarker Identification through Functional Data-Driven Gray Matter Synthesis | Jun 15, 2025 | DecoderDenoising | —Unverified | 0 |
| Predicting Genetic Mutations from Single-Cell Bone Marrow Images in Acute Myeloid Leukemia Using Noise-Robust Deep Learning Models | Jun 15, 2025 | Diagnostic | —Unverified | 0 |