| Writing-Zero: Bridge the Gap Between Non-verifiable Tasks and Verifiable Rewards | May 30, 2025 | Code Generation | —Unverified | 0 |
| Empirical Validation of the Independent Chip Model | May 30, 2025 | model | —Unverified | 0 |
| Multi-Analyte, Swab-based Automated Wound Monitor with AI | May 30, 2025 | Diagnostic | —Unverified | 0 |
| Artificial Empathy: AI based Mental Health | May 30, 2025 | Large Language Model | —Unverified | 0 |
| PersianMedQA: Language-Centric Evaluation of LLMs in the Persian Medical Domain | May 30, 2025 | Instruction FollowingMultiple-choice | —Unverified | 0 |
| A Reinforcement Learning-Based Telematic Routing Protocol for the Internet of Underwater Things | May 30, 2025 | Decision Making | —Unverified | 0 |
| RoboMoRe: LLM-based Robot Co-design via Joint Optimization of Morphology and Reward | May 30, 2025 | Large Language Model | —Unverified | 0 |
| Ctrl-Crash: Controllable Diffusion for Realistic Car Crashes | May 30, 2025 | counterfactualVideo Generation | —Unverified | 0 |
| Understanding while Exploring: Semantics-driven Active Mapping | May 30, 2025 | 3DGSInformativeness | —Unverified | 0 |
| Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces | May 30, 2025 | Spatial Reasoning | —Unverified | 0 |
| Interactive Imitation Learning for Dexterous Robotic Manipulation: Challenges and Perspectives -- A Survey | May 30, 2025 | Imitation Learning | —Unverified | 0 |
| Hi-Dyna Graph: Hierarchical Dynamic Scene Graph for Robotic Autonomy in Human-Centric Environments | May 30, 2025 | Graph GenerationHuman-Object Interaction Detection | —Unverified | 0 |
| Supporting architecture evaluation for ATAM scenarios with LLMs | May 30, 2025 | Attribute | —Unverified | 0 |
| Applying Large Language Models to Issue Classification: Revisiting with Extended Data and New Models | May 30, 2025 | Classification | —Unverified | 0 |
| A Causation-Based Framework for Pricing and Cost Allocation of Energy, Reserves, and Transmission in Modern Power Systems | May 30, 2025 | Scheduling | —Unverified | 0 |
| MRDust: Wireless Implant Data Uplink & Localization via Magnetic Resonance Image Modulation | May 30, 2025 | Image Registration | —Unverified | 0 |
| Tensor Network for Anomaly Detection in the Latent Space of Proton Collision Events at the LHC | May 30, 2025 | Anomaly DetectionQuantum Machine Learning | CodeCode Available | 0 |
| Input-Power-to-State Stability of Time-Varying Systems | May 30, 2025 | Form | —Unverified | 0 |
| MOFGPT: Generative Design of Metal-Organic Frameworks using Language Models | May 30, 2025 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 |
| Sorrel: A simple and flexible framework for multi-agent reinforcement learning | May 30, 2025 | Multi-agent Reinforcement LearningPhilosophy | CodeCode Available | 1 |
| Vedavani: A Benchmark Corpus for ASR on Vedic Sanskrit Poetry | May 30, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Generator Based Inference (GBI) | May 30, 2025 | Anomaly Detectionparameter estimation | CodeCode Available | 0 |
| Pushing the Limits of Beam Search Decoding for Transducer-based ASR models | May 30, 2025 | GPU | —Unverified | 0 |
| Applying Vision Transformers on Spectral Analysis of Astronomical Objects | May 30, 2025 | | CodeCode Available | 0 |
| Beyond Atomic Geometry Representations in Materials Science: A Human-in-the-Loop Multimodal Framework | May 30, 2025 | Benchmarking | CodeCode Available | 0 |
| Hierarchical Level-Wise News Article Clustering via Multilingual Matryoshka Embeddings | May 30, 2025 | ArticlesClustering | —Unverified | 0 |
| SoundSculpt: Direction and Semantics Driven Ambisonic Target Sound Extraction | May 30, 2025 | Image SegmentationSemantic Segmentation | —Unverified | 0 |
| Structure-Aware Fill-in-the-Middle Pretraining for Code | May 30, 2025 | | CodeCode Available | 0 |
| Optimal Weighted Convolution for Classification and Denosing | May 30, 2025 | ClassificationDenoising | CodeCode Available | 2 |
| Segmenting France Across Four Centuries | May 30, 2025 | BenchmarkingImage-to-Image Translation | CodeCode Available | 0 |
| GARLIC: GAussian Representation LearnIng for spaCe partitioning | May 30, 2025 | Representation Learning | —Unverified | 0 |
| Tackling View-Dependent Semantics in 3D Language Gaussian Splatting | May 30, 2025 | 3D Scene ReconstructionScene Understanding | CodeCode Available | 2 |
| un^2CLIP: Improving CLIP's Visual Detail Capturing Ability via Inverting unCLIP | May 30, 2025 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 |
| Model-Guided Network with Cluster-Based Operators for Spatio-Spectral Super-Resolution | May 30, 2025 | Spectral ReconstructionSpectral Super-Resolution | CodeCode Available | 0 |
| Category-Level 6D Object Pose Estimation in Agricultural Settings Using a Lattice-Deformation Framework and Diffusion-Augmented Synthetic Data | May 30, 2025 | 6D Pose Estimation6D Pose Estimation using RGB | —Unverified | 0 |
| UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation | May 30, 2025 | Video Generation | —Unverified | 0 |
| 3D Gaussian Splat Vulnerabilities | May 30, 2025 | 3DGSAdversarial Attack | CodeCode Available | 1 |
| Pretraining Deformable Image Registration Networks with Random Images | May 30, 2025 | Computational EfficiencyImage Registration | CodeCode Available | 0 |
| Consistent line clustering using geometric hypergraphs | May 30, 2025 | Clustering | —Unverified | 0 |
| 6D Pose Estimation on Point Cloud Data through Prior Knowledge Integration: A Case Study in Autonomous Disassembly | May 30, 2025 | 6D Pose EstimationPose Estimation | —Unverified | 0 |
| ComposeAnything: Composite Object Priors for Text-to-Image Generation | May 30, 2025 | DenoisingImage Generation | —Unverified | 0 |
| Threading Keyframe with Narratives: MLLMs as Strong Long Video Comprehenders | May 30, 2025 | Video Understanding | —Unverified | 0 |
| 50 Years of Automated Face Recognition | May 30, 2025 | Face RecognitionFace Verification | —Unverified | 0 |
| Harnessing Foundation Models for Robust and Generalizable 6-DOF Bronchoscopy Localization | May 30, 2025 | Depth Estimation | —Unverified | 0 |
| Out of Sight, Not Out of Context? Egocentric Spatial Reasoning in VLMs Across Disjoint Frames | May 30, 2025 | ObjectSpatial Reasoning | —Unverified | 0 |
| LLM-powered Query Expansion for Enhancing Boundary Prediction in Language-driven Action Localization | May 30, 2025 | Action Localization | —Unverified | 0 |
| Progressive Class-level Distillation | May 30, 2025 | BenchmarkingKnowledge Distillation | —Unverified | 0 |
| Leadership Assessment in Pediatric Intensive Care Unit Team Training | May 30, 2025 | Contact Detectionobject-detection | —Unverified | 0 |
| InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing | May 30, 2025 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| VUDG: A Dataset for Video Understanding Domain Generalization | May 30, 2025 | Domain GeneralizationMultiple-choice | —Unverified | 0 |