| Flexible Tool Selection through Low-dimensional Attribute Alignment of Vision and Language | May 28, 2025 | Attribute | —Unverified | 0 |
| ATI: Any Trajectory Instruction for Controllable Video Generation | May 28, 2025 | Image to Video GenerationVideo Generation | —Unverified | 0 |
| Multi-period Mean-Buffered Probability of Exceedance in Defined Contribution Portfolio Optimization | May 28, 2025 | Bilevel OptimizationPortfolio Optimization | —Unverified | 0 |
| Frequency-Adaptive Discrete Cosine-ViT-ResNet Architecture for Sparse-Data Vision | May 28, 2025 | image-classificationImage Classification | —Unverified | 0 |
| Rhetorical Text-to-Image Generation via Two-layer Diffusion Policy Optimization | May 28, 2025 | DenoisingImage Generation | —Unverified | 0 |
| 3DGS Compression with Sparsity-guided Hierarchical Transform Coding | May 28, 2025 | 3DGS | —Unverified | 0 |
| GoMatching++: Parameter- and Data-Efficient Arbitrary-Shaped Video Text Spotting and Benchmarking | May 28, 2025 | BenchmarkingText Spotting | CodeCode Available | 1 |
| Large Language Models for Depression Recognition in Spoken Language Integrating Psychological Knowledge | May 28, 2025 | Depression DetectionDiagnostic | CodeCode Available | 1 |
| VidText: Towards Comprehensive Evaluation for Video Text Understanding | May 28, 2025 | Multimodal ReasoningOptical Character Recognition (OCR) | CodeCode Available | 1 |
| CLIPGaussian: Universal and Multimodal Style Transfer Based on Gaussian Splatting | May 28, 2025 | Style Transfer | CodeCode Available | 1 |
| FAMA: The First Large-Scale Open-Science Speech Foundation Model for English and Italian | May 28, 2025 | 16k | —Unverified | 0 |
| Topological Machine Learning for Protein-Nucleic Acid Binding Affinity Changes Upon Mutation | May 28, 2025 | Topological Data Analysis | CodeCode Available | 0 |
| Fast Isotropic Median Filtering | May 28, 2025 | Allimage smoothing | CodeCode Available | 1 |
| Shapley Value-driven Data Pruning for Recommender Systems | May 28, 2025 | DenoisingRecommendation Systems | CodeCode Available | 0 |
| IMTS is Worth Time Channel Patches: Visual Masked Autoencoders for Irregular Multivariate Time Series Prediction | May 28, 2025 | Missing ValuesSelf-Supervised Learning | CodeCode Available | 1 |
| CFP-Gen: Combinatorial Functional Protein Generation via Diffusion Language Models | May 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Distribution free M-estimation | May 28, 2025 | Learning TheoryStochastic Optimization | —Unverified | 0 |
| 4DTAM: Non-Rigid Tracking and Mapping via Dynamic Surface Gaussians | May 28, 2025 | Camera LocalizationCamera Pose Estimation | —Unverified | 0 |
| A Tool for Generating Exceptional Behavior Tests With Large Language Models | May 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| IRS: Incremental Relationship-guided Segmentation for Digital Pathology | May 28, 2025 | Continual LearningDomain Generalization | CodeCode Available | 0 |
| Learning-Based Robust Fixed-Time Terminal Sliding Mode Control | May 28, 2025 | Gaussian Processes | —Unverified | 0 |
| PGLearn -- An Open-Source Learning Toolkit for Optimal Power Flow | May 28, 2025 | Benchmarking | —Unverified | 0 |
| Smart Surrogate Losses for Contextual Stochastic Linear Optimization with Robust Constraints | May 28, 2025 | Conformal PredictionSelection bias | —Unverified | 0 |
| A Probabilistic Jump-Diffusion Framework for Open-World Egocentric Activity Recognition | May 28, 2025 | Activity RecognitionEgocentric Activity Recognition | —Unverified | 0 |
| Finite-Sample Convergence Bounds for Trust Region Policy Optimization in Mean-Field Games | May 28, 2025 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| A Large Language Model-Enabled Control Architecture for Dynamic Resource Capability Exploration in Multi-Agent Manufacturing Systems | May 28, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Generative Social Choice: The Next Generation | May 28, 2025 | | CodeCode Available | 0 |
| Improving Contrastive Learning for Referring Expression Counting | May 28, 2025 | Contrastive LearningObject Counting | CodeCode Available | 0 |
| WebDancer: Towards Autonomous Information Seeking Agency | May 28, 2025 | | CodeCode Available | 11 |
| Hierarchical Material Recognition from Local Appearance | May 28, 2025 | Few-Shot LearningGraph Attention | —Unverified | 0 |
| First Steps Towards Overhearing LLM Agents: A Case Study With Dungeons & Dragons Gameplay | May 28, 2025 | | CodeCode Available | 0 |
| Handling bounded response in high dimensions: a Horseshoe prior Bayesian Beta regression approach | May 28, 2025 | regressionVariable Selection | CodeCode Available | 0 |
| VME: A Satellite Imagery Dataset and Benchmark for Detecting Vehicles in the Middle East and Beyond | May 28, 2025 | Disaster ResponseDiversity | CodeCode Available | 0 |
| VIGNETTE: Socially Grounded Bias Evaluation for Vision-Language Models | May 28, 2025 | Decision MakingQuestion Answering | CodeCode Available | 0 |
| Cultural Evaluations of Vision-Language Models Have a Lot to Learn from Cultural Theory | May 28, 2025 | DiversityPosition | —Unverified | 0 |
| RocqStar: Leveraging Similarity-driven Retrieval and Agentic Systems for Rocq generation | May 28, 2025 | Automated Theorem ProvingRetrieval | —Unverified | 0 |
| Higher-Order Group Synchronization | May 28, 2025 | | CodeCode Available | 0 |
| Conversational Alignment with Artificial Intelligence in Context | May 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BugWhisperer: Fine-Tuning LLMs for SoC Hardware Vulnerability Detection | May 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning to Charge More: A Theoretical Study of Collusion by Q-Learning Agents | May 28, 2025 | Q-Learning | —Unverified | 0 |
| EFIM: Efficient Serving of LLMs for Infilling Tasks with Improved KV Cache Reuse | May 28, 2025 | | CodeCode Available | 0 |
| Self-orthogonalizing attractor neural networks emerging from the free energy principle | May 28, 2025 | | CodeCode Available | 1 |
| LoKI: Low-damage Knowledge Implanting of Large Language Models | May 28, 2025 | parameter-efficient fine-tuning | CodeCode Available | 1 |
| cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning | May 28, 2025 | CAD ReconstructionLarge Language Model | CodeCode Available | 2 |
| MIAS-SAM: Medical Image Anomaly Segmentation without thresholding | May 28, 2025 | Anomaly SegmentationDecoder | CodeCode Available | 0 |
| A Provable Approach for End-to-End Safe Reinforcement Learning | May 28, 2025 | Gaussian ProcessesReinforcement Learning (RL) | —Unverified | 0 |
| Hyperbolic recurrent neural network as the first type of non-Euclidean neural quantum state ansatz | May 28, 2025 | Variational Monte Carlo | CodeCode Available | 0 |
| Assessing Quantum Advantage for Gaussian Process Regression | May 28, 2025 | regression | —Unverified | 0 |
| Chest Disease Detection In X-Ray Images Using Deep Learning Classification Method | May 28, 2025 | ClassificationTransfer Learning | —Unverified | 0 |
| PS4PRO: Pixel-to-pixel Supervision for Photorealistic Rendering and Optimization | May 28, 2025 | 3D geometry3D Reconstruction | —Unverified | 0 |