| Mitigating Object Hallucinations via Sentence-Level Early Intervention | Jul 16, 2025 | HallucinationMM-Vet | CodeCode Available | 1 |
| SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation | Jul 16, 2025 | 3DGSCamera Pose Estimation | CodeCode Available | 0 |
| Similarity Memory Prior is All You Need for Medical Image Segmentation | Jul 15, 2025 | | CodeCode Available | 0 |
| EditGen: Harnessing Cross-Attention Control for Instruction-Based Auto-Regressive Audio Editing | Jul 15, 2025 | | CodeCode Available | 0 |
| Interpretable Prediction of Lymph Node Metastasis in Rectal Cancer MRI Using Variational Autoencoders | Jul 15, 2025 | | CodeCode Available | 0 |
| Subgraph Generation for Generalizing on Out-of-Distribution Links | Jul 15, 2025 | | CodeCode Available | 0 |
| Internal Value Alignment in Large Language Models through Controlled Value Vector Activation | Jul 15, 2025 | | CodeCode Available | 0 |
| Bridging Literature and the Universe Via A Multi-Agent Large Language Model System | Jul 15, 2025 | | CodeCode Available | 0 |
| ProactiveVideoQA: A Comprehensive Benchmark Evaluating Proactive Interactions in Video Large Language Models | Jul 15, 2025 | | CodeCode Available | 0 |
| Trexplorer Super: Topologically Correct Centerline Tree Tracking of Tubular Objects in CT Volumes | Jul 15, 2025 | | CodeCode Available | 0 |
| On the Effect of Instruction Tuning Loss on Generalization | Jul 15, 2025 | | CodeCode Available | 0 |
| Conceptualizing Multi-scale Wavelet Attention and Ray-based Encoding for Human-Object Interaction Detection | Jul 15, 2025 | | CodeCode Available | 0 |
| Team HUMANE at AVeriTeC 2025: HerO 2 for Efficient Fact Verification | Jul 15, 2025 | | CodeCode Available | 0 |
| Focus on Texture: Rethinking Pre-training in Masked Autoencoders for Medical Image Classification | Jul 15, 2025 | | CodeCode Available | 0 |
| Function-to-Style Guidance of LLMs for Code Translation | Jul 15, 2025 | Code TranslationTranslation | —Unverified | 0 |
| A Graph-in-Graph Learning Framework for Drug-Target Interaction Prediction | Jul 15, 2025 | Drug DiscoveryGraph Learning | —Unverified | 0 |
| Data-Driven Meta-Analysis and Public-Dataset Evaluation for Sensor-Based Gait Age Estimation | Jul 15, 2025 | Age EstimationSensor Fusion | —Unverified | 0 |
| Seeing the Signs: A Survey of Edge-Deployable OCR Models for Billboard Visibility Analysis | Jul 15, 2025 | MarketingOptical Character Recognition | —Unverified | 0 |
| ZKP-FedEval: Verifiable and Privacy-Preserving Federated Evaluation using Zero-Knowledge Proofs | Jul 15, 2025 | Activity RecognitionFederated Learning | —Unverified | 0 |
| Sporadic Federated Learning Approach in Quantum Environment to Tackle Quantum Noise | Jul 15, 2025 | Federated Learning | —Unverified | 0 |
| Physically Based Neural LiDAR Resimulation | Jul 15, 2025 | 3D Scene ReconstructionNovel View Synthesis | CodeCode Available | 0 |
| Langevin Flows for Modeling Neural Latent Dynamics | Jul 15, 2025 | | CodeCode Available | 0 |
| Streaming 4D Visual Geometry Transformer | Jul 15, 2025 | 4D reconstructionPhilosophy | CodeCode Available | 4 |
| Generative Click-through Rate Prediction with Applications to Search Advertising | Jul 15, 2025 | Click-Through Rate PredictionPrediction | —Unverified | 0 |
| Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping | Jul 15, 2025 | Instance Segmentationobject-detection | —Unverified | 0 |
| Optimal Sensor Scheduling and Selection for Continuous-Discrete Kalman Filtering with Auxiliary Dynamics | Jul 15, 2025 | SchedulingState Space Models | CodeCode Available | 0 |
| Implementing Adaptations for Vision AutoRegressive Model | Jul 15, 2025 | Image Generationmodel | CodeCode Available | 0 |
| Acting and Planning with Hierarchical Operational Models on a Mobile Robot: A Study with RAE+UPOM | Jul 15, 2025 | Decision Making | —Unverified | 0 |
| Real-Time Bayesian Detection of Drift-Evasive GNSS Spoofing in Reinforcement Learning Based UAV Deconfliction | Jul 15, 2025 | Change Point DetectionReinforcement Learning (RL) | —Unverified | 0 |
| Illuminating the Three Dogmas of Reinforcement Learning under Evolutionary Light | Jul 15, 2025 | Reinforcement Learning (RL) | —Unverified | 0 |
| COLI: A Hierarchical Efficient Compressor for Large Images | Jul 15, 2025 | SSIM | —Unverified | 0 |
| CATVis: Context-Aware Thought Visualization | Jul 15, 2025 | cross-modal alignmentEEG | —Unverified | 0 |
| CogDDN: A Cognitive Demand-Driven Navigation with Decision Optimization and Dual-Process Thinking | Jul 15, 2025 | Decision MakingNavigate | —Unverified | 0 |
| A Mixed-Primitive-based Gaussian Splatting Method for Surface Reconstruction | Jul 15, 2025 | Representation LearningSurface Reconstruction | —Unverified | 0 |
| P.808 Multilingual Speech Enhancement Testing: Approach and Results of URGENT 2025 Challenge | Jul 15, 2025 | Speech Enhancementtext-to-speech | —Unverified | 0 |
| 3C-FBI: A Combinatorial method using Convolutions for Circle Fitting in Blurry Images | Jul 15, 2025 | CPUDensity Estimation | —Unverified | 0 |
| The model is the message: Lightweight convolutional autoencoders applied to noisy imaging data for planetary science and astrobiology | Jul 15, 2025 | Image Reconstruction | —Unverified | 0 |
| ViewSRD: 3D Visual Grounding via Structured Multi-View Decomposition | Jul 15, 2025 | 3D visual groundingVisual Grounding | —Unverified | 0 |
| Detección y Cuantificación de Erosión Fluvial con Visión Artificial | Jul 15, 2025 | Decision Making | —Unverified | 0 |
| 3D Magnetic Inverse Routine for Single-Segment Magnetic Field Images | Jul 15, 2025 | Image Reconstruction | —Unverified | 0 |
| Fine-Grained Chinese Hate Speech Understanding: Span-Level Resources, Coded Term Lexicon, and Enhanced Detection Frameworks | Jul 15, 2025 | Hate Speech Detection | —Unverified | 0 |
| Temperature and Persona Shape LLM Agent Consensus With Minimal Accuracy Gains in Qualitative Coding | Jul 15, 2025 | Math | —Unverified | 0 |
| RMAU-NET: A Residual-Multihead-Attention U-Net Architecture for Landslide Segmentation and Detection from Remote Sensing Images | Jul 15, 2025 | Landslide segmentation | —Unverified | 0 |
| KptLLM++: Towards Generic Keypoint Comprehension with Large Language Model | Jul 15, 2025 | Keypoint DetectionLanguage Modeling | —Unverified | 0 |
| Tactical Decision for Multi-UGV Confrontation with a Vision-Language Model-Based Commander | Jul 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling | Jul 15, 2025 | 3D geometry | —Unverified | 0 |
| Journalism-Guided Agentic In-Context Learning for News Stance Detection | Jul 15, 2025 | ArticlesIn-Context Learning | —Unverified | 0 |
| A Multi-View High-Resolution Foot-Ankle Complex Point Cloud Dataset During Gait for Occlusion-Robust 3D Completion | Jul 15, 2025 | BenchmarkingPoint Cloud Completion | —Unverified | 0 |
| Personalized OVSS: Understanding Personal Concept in Open-Vocabulary Semantic Segmentation | Jul 15, 2025 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | —Unverified | 0 |
| Bridge Feature Matching and Cross-Modal Alignment with Mutual-filtering for Zero-shot Anomaly Detection | Jul 15, 2025 | Anomaly ClassificationAnomaly Detection | —Unverified | 0 |