| Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence | Jun 18, 2025 | | —Unverified | 0 |
| MEGC2025: Micro-Expression Grand Challenge on Spot Then Recognize and Visual Question Answering | Jun 18, 2025 | Multimodal ReasoningQuestion Answering | —Unverified | 0 |
| MSNeRV: Neural Video Representation with Multi-Scale Feature Fusion | Jun 18, 2025 | DecoderVideo Compression | —Unverified | 0 |
| PRISM-Loc: a Lightweight Long-range LiDAR Localization in Urban Environments with Topological Maps | Jun 18, 2025 | Pose Estimation | —Unverified | 0 |
| Context-Aware Deep Lagrangian Networks for Model Predictive Control | Jun 18, 2025 | Model Predictive Control | —Unverified | 0 |
| Probabilistic Trajectory GOSPA: A Metric for Uncertainty-Aware Multi-Object Tracking Performance Evaluation | Jun 18, 2025 | Multi-Object TrackingObject Tracking | —Unverified | 0 |
| Diff-TONE: Timestep Optimization for iNstrument Editing in Text-to-Music Diffusion Models | Jun 18, 2025 | Music GenerationText-to-Music Generation | —Unverified | 0 |
| Factorized RVQ-GAN For Disentangled Speech Tokenization | Jun 18, 2025 | DisentanglementKnowledge Distillation | —Unverified | 0 |
| Uncovering Intention through LLM-Driven Code Snippet Description Generation | Jun 18, 2025 | Descriptive | —Unverified | 0 |
| Code Rate Optimization via Neural Polar Decoders | Jun 18, 2025 | Capacity Estimation | —Unverified | 0 |
| One-shot Face Sketch Synthesis in the Wild via Generative Diffusion Prior and Instruction Tuning | Jun 18, 2025 | Face Sketch Synthesis | CodeCode Available | 0 |
| ABC: Adaptive BayesNet Structure Learning for Computational Scalable Multi-task Image Compression | Jun 18, 2025 | Image Compression | CodeCode Available | 0 |
| Fair Contracts in Principal-Agent Games with Heterogeneous Types | Jun 18, 2025 | Fairness | —Unverified | 0 |
| MAARTA:Multi-Agentic Adaptive Radiology Teaching Assistant | Jun 18, 2025 | Diagnostic | —Unverified | 0 |
| Centroid Approximation for Byzantine-Tolerant Federated Learning | Jun 18, 2025 | Distributed ComputingFederated Learning | —Unverified | 0 |
| RAS-Eval: A Comprehensive Benchmark for Security Evaluation of LLM Agents in Real-World Environments | Jun 18, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| All is Not Lost: LLM Recovery without Checkpoints | Jun 18, 2025 | AllScheduling | CodeCode Available | 1 |
| SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning | Jun 18, 2025 | Caption GenerationDescriptive | CodeCode Available | 2 |
| Evaluation Pipeline for systematically searching for Anomaly Detection Systems | Jun 18, 2025 | Anomaly Detection | —Unverified | 0 |
| Multi-Agent Reinforcement Learning for Autonomous Multi-Satellite Earth Observation: A Realistic Case Study | Jun 18, 2025 | Earth ObservationManagement | —Unverified | 0 |
| PredGen: Accelerated Inference of Large Language Models through Input-Time Speculation for Real-Time Speech Interaction | Jun 18, 2025 | Sentencetext-to-speech | —Unverified | 0 |
| video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language Models | Jun 18, 2025 | Audio captioningLarge Language Model | CodeCode Available | 2 |
| Optimizing Web-Based AI Query Retrieval with GPT Integration in LangChain A CoT-Enhanced Prompt Engineering Approach | Jun 18, 2025 | Prompt EngineeringRetrieval | —Unverified | 0 |
| Semantic and Feature Guided Uncertainty Quantification of Visual Localization for Autonomous Vehicles | Jun 18, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| deepSURF: Detecting Memory Safety Vulnerabilities in Rust Through Fuzzing LLM-Augmented Harnesses | Jun 18, 2025 | Large Language Model | —Unverified | 0 |
| PhantomHunter: Detecting Unseen Privately-Tuned LLM-Generated Text via Family-Aware Learning | Jun 18, 2025 | LLM-generated Text DetectionMisinformation | —Unverified | 0 |
| Transit for All: Mapping Equitable Bike2Subway Connection using Region Representation Learning | Jun 18, 2025 | AllRepresentation Learning | —Unverified | 0 |
| Mapping Caregiver Needs to AI Chatbot Design: Strengths and Gaps in Mental Health Support for Alzheimer's and Dementia Caregivers | Jun 18, 2025 | Chatbot | —Unverified | 0 |
| Accessible Gesture-Driven Augmented Reality Interaction System | Jun 18, 2025 | Federated LearningGesture Recognition | —Unverified | 0 |
| An Empirical Study of Bugs in Data Visualization Libraries | Jun 18, 2025 | Data VisualizationDecision Making | —Unverified | 0 |
| Steering Your Diffusion Policy with Latent Space Reinforcement Learning | Jun 18, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Particle-Grid Neural Dynamics for Learning Deformable Object Models from RGB-D Videos | Jun 18, 2025 | Object | —Unverified | 0 |
| RaCalNet: Radar Calibration Network for Sparse-Supervised Metric Depth Estimation | Jun 18, 2025 | Depth EstimationDepth Prediction | —Unverified | 0 |
| Model Predictive Path-Following Control for a Quadrotor | Jun 18, 2025 | Model Predictive Control | —Unverified | 0 |
| MCOO-SLAM: A Multi-Camera Omnidirectional Object SLAM System | Jun 18, 2025 | ObjectObject SLAM | —Unverified | 0 |
| Correspondence-Free Multiview Point Cloud Registration via Depth-Guided Joint Optimisation | Jun 18, 2025 | Point Cloud Registration | —Unverified | 0 |
| HEAL: An Empirical Study on Hallucinations in Embodied Agents Driven by Large Language Models | Jun 18, 2025 | Hallucination | —Unverified | 0 |
| An accurate and revised version of optical character recognition-based speech synthesis using LabVIEW | Jun 18, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| PNCS:Power-Norm Cosine Similarity for Diverse Client Selection in Federated Learning | Jun 18, 2025 | Federated Learning | —Unverified | 0 |
| Veracity: An Open-Source AI Fact-Checking System | Jun 18, 2025 | Fact CheckingMisinformation | —Unverified | 0 |
| In-Context Learning for Gradient-Free Receiver Adaptation: Principles, Applications, and Theory | Jun 18, 2025 | In-Context LearningMeta-Learning | —Unverified | 0 |
| cAST: Enhancing Code Retrieval-Augmented Generation with Structural Chunking via Abstract Syntax Tree | Jun 18, 2025 | ChunkingCode Generation | CodeCode Available | 2 |
| I Know Which LLM Wrote Your Code Last Summer: LLM generated Code Stylometry for Authorship Attribution | Jun 18, 2025 | Authorship AttributionBinary Classification | —Unverified | 0 |
| 4Real-Video-V2: Fused View-Time Attention and Feedforward Reconstruction for 4D Scene Generation | Jun 18, 2025 | 3D Reconstruction4D reconstruction | —Unverified | 0 |
| InfiniPot-V: Memory-Constrained KV Cache Compression for Streaming Video Understanding | Jun 18, 2025 | GPUStreaming video understanding | —Unverified | 0 |
| LoX: Low-Rank Extrapolation Robustifies LLM Safety Against Fine-tuning | Jun 18, 2025 | Attribute | CodeCode Available | 0 |
| Show-o2: Improved Native Unified Multimodal Models | Jun 18, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| Mix-of-Language-Experts Architecture for Multilingual Programming | Jun 18, 2025 | | CodeCode Available | 0 |
| HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization Challenges | Jun 18, 2025 | Combinatorial Optimization | CodeCode Available | 2 |
| Finance Language Model Evaluation (FLaME) | Jun 18, 2025 | BenchmarkingLanguage Model Evaluation | —Unverified | 0 |