| Data-Efficient Challenges in Visual Inductive Priors: A Retrospective | Jun 10, 2025 | Data AugmentationDeep Learning | —Unverified | 0 |
| Draft-based Approximate Inference for LLMs | Jun 10, 2025 | | CodeCode Available | 1 |
| Integration of Old and New Knowledge for Generalized Intent Discovery: A Consistency-driven Prototype-Prompting Framework | Jun 10, 2025 | Domain AdaptationIntent Detection | CodeCode Available | 0 |
| Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models | Jun 10, 2025 | 3D Lane Detection3D Object Detection | CodeCode Available | 3 |
| From Legal Texts to Defeasible Deontic Logic via LLMs: A Study in Automated Semantic Analysis | Jun 10, 2025 | Prompt Engineering | —Unverified | 0 |
| Generating Vision-Language Navigation Instructions Incorporated Fine-Grained Alignment Annotations | Jun 10, 2025 | cross-modal alignmentNavigate | —Unverified | 0 |
| Enhancing Video Memorability Prediction with Text-Motion Cross-modal Contrastive Loss and Its Application in Video Summarization | Jun 10, 2025 | PredictionVideo Summarization | —Unverified | 0 |
| RoboSwap: A GAN-driven Video Diffusion Framework For Unsupervised Robot Arm Swapping | Jun 10, 2025 | Video Editing | —Unverified | 0 |
| SurfR: Surface Reconstruction with Multi-scale Attention | Jun 10, 2025 | Surface Reconstruction | —Unverified | 0 |
| LLaVA-c: Continual Improved Visual Instruction Tuning | Jun 10, 2025 | Continual LearningContinual Pretraining | —Unverified | 0 |
| CanadaFireSat: Toward high-resolution wildfire forecasting with multiple modalities | Jun 10, 2025 | Deep LearningEarth Observation | —Unverified | 0 |
| Gaussian2Scene: 3D Scene Representation Learning via Self-supervised Learning with 3D Gaussian Splatting | Jun 10, 2025 | 3DGS3D Object Detection | —Unverified | 0 |
| HunyuanVideo-HOMA: Generic Human-Object Interaction in Multimodal Driven Human Animation | Jun 10, 2025 | Human AnimationHuman-Object Interaction Detection | —Unverified | 0 |
| Cross-Spectral Body Recognition with Side Information Embedding: Benchmarks on LLCM and Analyzing Range-Induced Occlusions on IJB-MDF | Jun 10, 2025 | Occlusion HandlingPerson Re-Identification | —Unverified | 0 |
| ADAM: Autonomous Discovery and Annotation Model using LLMs for Context-Aware Annotations | Jun 10, 2025 | Objectobject-detection | —Unverified | 0 |
| Towards Robust Real-World Multivariate Time Series Forecasting: A Unified Framework for Dependency, Asynchrony, and Missingness | Jun 10, 2025 | Missing ValuesMultivariate Time Series Forecasting | —Unverified | 0 |
| Rethinking Range-View LiDAR Segmentation in Adverse Weather | Jun 10, 2025 | Computational EfficiencySegmentation | —Unverified | 0 |
| Generalizable Articulated Object Reconstruction from Casually Captured RGBD Videos | Jun 10, 2025 | ObjectObject Reconstruction | —Unverified | 0 |
| MAMBO: High-Resolution Generative Approach for Mammography Images | Jun 10, 2025 | Anomaly DetectionImage Generation | —Unverified | 0 |
| Can LLMs Ground when they (Don't) Know: A Study on Direct and Loaded Political Questions | Jun 10, 2025 | Misinformation | —Unverified | 0 |
| Structured Variational D-Decomposition for Accurate and Stable Low-Rank Approximation | Jun 10, 2025 | 2k | —Unverified | 0 |
| PhyBlock: A Progressive Benchmark for Physical Understanding and Planning via 3D Block Assembly | Jun 10, 2025 | Question AnsweringScene Understanding | —Unverified | 0 |
| On The Impact of Merge Request Deviations on Code Review Practices | Jun 10, 2025 | Feature ImportanceFew-Shot Learning | —Unverified | 0 |
| Explainable Compliance Detection with Multi-Hop Natural Language Inference on Assurance Case Structure | Jun 10, 2025 | Natural Language Inference | —Unverified | 0 |
| Teaching Physical Awareness to LLMs through Sounds | Jun 10, 2025 | Direction of Arrival Estimation | —Unverified | 0 |
| FloorplanMAE:A self-supervised framework for complete floorplan generation from partial inputs | Jun 10, 2025 | Self-Supervised Learning | —Unverified | 0 |
| RHealthTwin: Towards Responsible and Multimodal Digital Twins for Personalized Well-being | Jun 10, 2025 | HallucinationInstruction Following | —Unverified | 0 |
| Your Brain on ChatGPT: Accumulation of Cognitive Debt when Using an AI Assistant for Essay Writing Task | Jun 10, 2025 | EEG | —Unverified | 0 |
| Preference-Driven Multi-Objective Combinatorial Optimization with Conditional Computation | Jun 10, 2025 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| IntTrajSim: Trajectory Prediction for Simulating Multi-Vehicle driving at Signalized Intersections | Jun 10, 2025 | Trajectory Prediction | —Unverified | 0 |
| Evaluating Generative Vehicle Trajectory Models for Traffic Intersection Dynamics | Jun 10, 2025 | Trajectory Forecasting | —Unverified | 0 |
| VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning | Jun 10, 2025 | Task PlanningVisual Reasoning | —Unverified | 0 |
| FROST-EMA: Finnish and Russian Oral Speech Dataset of Electromagnetic Articulography Measurements with L1, L2 and Imitated L2 Accents | Jun 10, 2025 | Speaker Verification | —Unverified | 0 |
| SEMA: a Scalable and Efficient Mamba like Attention via Token Localization and Averaging | Jun 10, 2025 | Mamba | —Unverified | 0 |
| Your Agent Can Defend Itself against Backdoor Attacks | Jun 10, 2025 | Large Language Model | —Unverified | 0 |
| SPBA: Utilizing Speech Large Language Model for Backdoor Attacks on Speech Classification Models | Jun 10, 2025 | Backdoor AttackKeyword Spotting | —Unverified | 0 |
| Reinforce LLM Reasoning through Multi-Agent Reflection | Jun 10, 2025 | MathOut-of-Distribution Generalization | —Unverified | 0 |
| Spatiotemporal deep learning models for detection of rapid intensification in cyclones | Jun 10, 2025 | Data AugmentationDeep Learning | —Unverified | 0 |
| HASFL: Heterogeneity-aware Split Federated Learning over Edge Computing Systems | Jun 10, 2025 | Edge-computingFederated Learning | —Unverified | 0 |
| Efficient Context Selection for Long-Context QA: No Tuning, No Iteration, Just Adaptive-k | Jun 10, 2025 | Open-Domain Question AnsweringQuestion Answering | —Unverified | 0 |
| Re-Thinking the Automatic Evaluation of Image-Text Alignment in Text-to-Image Models | Jun 10, 2025 | Image GenerationText to Image Generation | —Unverified | 0 |
| DCD: A Semantic Segmentation Model for Fetal Ultrasound Four-Chamber View | Jun 10, 2025 | SegmentationSemantic Segmentation | —Unverified | 0 |
| Fairness is Not Silence: Unmasking Vacuous Neutrality in Small Language Models | Jun 10, 2025 | Fairness | —Unverified | 0 |
| MLVTG: Mamba-Based Feature Alignment and LLM-Driven Purification for Multi-Modal Video Temporal Grounding | Jun 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TrajFlow: Multi-modal Motion Prediction via Flow Matching | Jun 10, 2025 | Autonomous Drivingmotion prediction | —Unverified | 0 |
| Flow-Lenia: Emergent evolutionary dynamics in mass conservative continuous cellular automata | Jun 10, 2025 | Artificial Life | —Unverified | 0 |
| Auto-Regressive vs Flow-Matching: a Comparative Study of Modeling Paradigms for Text-to-Music Generation | Jun 10, 2025 | Audio inpaintingMusic Generation | —Unverified | 0 |
| Optimizing Learned Image Compression on Scalar and Entropy-Constraint Quantization | Jun 10, 2025 | Image CompressionQuantization | —Unverified | 0 |
| Societal AI Research Has Become Less Interdisciplinary | Jun 10, 2025 | FairnessMisinformation | —Unverified | 0 |
| Multimodal Representation Alignment for Cross-modal Information Retrieval | Jun 10, 2025 | Cross-Modal Information RetrievalInformation Retrieval | —Unverified | 0 |