| AI Generated Text Detection Using Instruction Fine-tuned Large Language and Transformer-Based Models | Jul 7, 2025 | ArticlesLarge Language Model | —Unverified | 0 |
| OpenWorldSAM: Extending SAM2 for Universal Image Segmentation with Language Prompts | Jul 7, 2025 | Image SegmentationPanoptic Segmentation | —Unverified | 0 |
| Information-Guided Diffusion Sampling for Dataset Distillation | Jul 7, 2025 | Dataset Distillation | —Unverified | 0 |
| Beyond Training-time Poisoning: Component-level and Post-training Backdoors in Deep Reinforcement Learning | Jul 7, 2025 | Backdoor AttackDeep Reinforcement Learning | —Unverified | 0 |
| Robustifying 3D Perception through Least-Squares Multi-Agent Graphs Object Tracking | Jul 7, 2025 | Autonomous VehiclesObject Tracking | —Unverified | 0 |
| Trojan Horse Prompting: Jailbreaking Conversational Multimodal Models by Forging Assistant Message | Jul 7, 2025 | Image GenerationSafety Alignment | —Unverified | 0 |
| High Order Collaboration-Oriented Federated Graph Neural Network for Accurate QoS Prediction | Jul 7, 2025 | Computational EfficiencyGraph Neural Network | —Unverified | 0 |
| A Federated Learning-based Lightweight Network with Zero Trust for UAV Authentication | Jul 7, 2025 | Federated Learning | —Unverified | 0 |
| Interest Networks (iNETs) for Cities: Cross-Platform Insights and Urban Behavior Explanations | Jul 7, 2025 | Explainable RecommendationRecommendation Systems | —Unverified | 0 |
| Spatial and Semantic Embedding Integration for Stereo Sound Event Localization and Detection in Regular Videos | Jul 7, 2025 | Sound Event Localization and Detection | —Unverified | 0 |
| Estimating Interventional Distributions with Uncertain Causal Graphs through Meta-Learning | Jul 7, 2025 | Bayesian InferenceCausal Inference | —Unverified | 0 |
| Inaugural MOASEI Competition at AAMAS'2025: A Technical Report | Jul 7, 2025 | BenchmarkingDecision Making | —Unverified | 0 |
| Meta-Learning Transformers to Improve In-Context Generalization | Jul 7, 2025 | In-Context LearningMeta-Learning | —Unverified | 0 |
| An analysis of vision-language models for fabric retrieval | Jul 7, 2025 | AttributeCross-Modal Retrieval | —Unverified | 0 |
| Simultaneous Localization and Mapping Using Active mmWave Sensing in 5G NR | Jul 7, 2025 | Loop Closure DetectionPoint Cloud Generation | —Unverified | 0 |
| Piggyback Camera: Easy-to-Deploy Visual Surveillance by Mobile Sensing on Commercial Robot Vacuums | Jul 7, 2025 | Data Augmentation | —Unverified | 0 |
| 4DSloMo: 4D Reconstruction for High Speed Scene with Asynchronous Capture | Jul 7, 2025 | 4D reconstruction | —Unverified | 0 |
| ReLoop: "Seeing Twice and Thinking Backwards" via Closed-loop Training to Mitigate Hallucinations in Multimodal understanding | Jul 7, 2025 | HallucinationQuestion Answering | —Unverified | 0 |
| UGG-ReID: Uncertainty-Guided Graph Model for Multi-Modal Object Re-Identification | Jul 7, 2025 | Mixture-of-Experts | —Unverified | 0 |
| Motion Generation: A Survey of Generative Approaches and Benchmarks | Jul 7, 2025 | Motion GenerationSurvey | —Unverified | 0 |
| Acquiring and Adapting Priors for Novel Tasks via Neural Meta-Architectures | Jul 7, 2025 | 3D GenerationComputational chemistry | —Unverified | 0 |
| Kalman Filter Aided Federated Koopman Learning | Jul 7, 2025 | Federated Learning | —Unverified | 0 |
| Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations | Jul 7, 2025 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Neural-Driven Image Editing | Jul 7, 2025 | Contrastive LearningMultimodel-guided image editing | CodeCode Available | 2 |
| Heterogeneous User Modeling for LLM-based Recommendation | Jul 7, 2025 | | CodeCode Available | 0 |
| Estimating Object Physical Properties from RGB-D Vision and Depth Robot Sensors Using Deep Learning | Jul 7, 2025 | Image Generation | CodeCode Available | 0 |
| Blind Targeting: Personalization under Third-Party Privacy Constraints | Jul 7, 2025 | Bayesian OptimizationPrivacy Preserving | —Unverified | 0 |
| RIPE: Reinforcement Learning on Unlabeled Image Pairs for Robust Keypoint Extraction | Jul 7, 2025 | | CodeCode Available | 2 |
| Llama Nemoretriever Colembed: Top-Performing Text-Image Retrieval Model | Jul 7, 2025 | Image RetrievalLanguage Modeling | —Unverified | 0 |
| any4: Learned 4-bit Numeric Representation for LLMs | Jul 7, 2025 | GPUGSM8K | CodeCode Available | 2 |
| FedPall: Prototype-based Adversarial and Collaborative Learning for Federated Learning with Feature Drift | Jul 7, 2025 | Federated Learning | CodeCode Available | 0 |
| LoomNet: Enhancing Multi-View Image Generation via Latent Space Weaving | Jul 7, 2025 | Image GenerationSurface Reconstruction | —Unverified | 0 |
| Reinforcement Fine-Tuning Naturally Mitigates Forgetting in Continual Post-Training | Jul 7, 2025 | General KnowledgeMMLU | —Unverified | 0 |
| Modeling (Deontic) Modal Operators With the s(CASP) Goal-directed Predicate Answer Set Programming System | Jul 7, 2025 | Negation | —Unverified | 0 |
| CorrDetail: Visual Detail Enhanced Self-Correction for Face Forgery Detection | Jul 7, 2025 | Face SwappingImage Generation | —Unverified | 0 |
| YOLO-APD: Enhancing YOLOv8 for Robust Pedestrian Detection on Complex Road Geometries | Jul 7, 2025 | Autonomous NavigationDomain Adaptation | —Unverified | 0 |
| LOOM-Scope: a comprehensive and efficient LOng-cOntext Model evaluation framework | Jul 7, 2025 | | CodeCode Available | 1 |
| BackFed: An Efficient & Standardized Benchmark Suite for Backdoor Attacks in Federated Learning | Jul 7, 2025 | Federated Learning | CodeCode Available | 2 |
| The Extended SONICOM HRTF Dataset and Spatial Audio Metrics Toolbox | Jul 7, 2025 | | CodeCode Available | 1 |
| SV-DRR: High-Fidelity Novel View X-Ray Synthesis Using Diffusion Model | Jul 7, 2025 | AnatomyImage Generation | CodeCode Available | 1 |
| DESIGN AND IMPLEMENTATION OF ONLINE CLEARANCE REPORT. | Jul 7, 2025 | AllManagement | —Unverified | 0 |
| ModelCitizens:Representing Community Voices in Online Safety | Jul 7, 2025 | | —Unverified | 0 |
| Digital clock and calender management system project report. | Jul 7, 2025 | DecoderManagement | —Unverified | 0 |
| S^2Edit: Text-Guided Image Editing with Precise Semantic and Spatial Control | Jul 7, 2025 | text-guided-image-editing | —Unverified | 0 |
| Evolutionary and Coevolutionary Multi-Agent Design Choices and Dynamics | Jul 7, 2025 | Evolutionary Algorithms | —Unverified | 0 |
| DeepRetro: Retrosynthetic Pathway Discovery using Iterative LLM Reasoning | Jul 7, 2025 | HallucinationLarge Language Model | —Unverified | 0 |
| CoSteer: Collaborative Decoding-Time Personalization via Local Delta Steering | Jul 7, 2025 | Text Generation | —Unverified | 0 |
| XiYan-SQL: A Novel Multi-Generator Framework For Text-to-SQL | Jul 7, 2025 | Text to SQLText-To-SQL | CodeCode Available | 4 |
| VOTE: Vision-Language-Action Optimization with Trajectory Ensemble Voting | Jul 7, 2025 | Depth EstimationVision-Language-Action | CodeCode Available | 1 |
| EduCoder: An Open-Source Annotation System for Education Transcript Data | Jul 7, 2025 | text annotation | CodeCode Available | 0 |