| Times2D: Multi-Period Decomposition and Derivative Mapping for General Time Series Forecasting | Mar 31, 2025 | energy managementTime Series | CodeCode Available | 1 |
| TuRTLe: A Unified Evaluation of LLMs for RTL Generation | Mar 31, 2025 | Code Generation | CodeCode Available | 1 |
| ZeroMimic: Distilling Robotic Manipulation Skills from Web Videos | Mar 31, 2025 | Imitation Learning | CodeCode Available | 1 |
| MaintainCoder: Maintainable Code Generation Under Dynamic Requirements | Mar 31, 2025 | Code Generation | CodeCode Available | 1 |
| Thinking Longer, Not Larger: Enhancing Software Engineering Agents via Scaling Test-Time Compute | Mar 31, 2025 | Fault localization | CodeCode Available | 1 |
| IMPACT: A Generic Semantic Loss for Multimodal Medical Image Registration | Mar 31, 2025 | Deformable Medical Image RegistrationImage Registration | CodeCode Available | 1 |
| SciReplicate-Bench: Benchmarking LLMs in Agent-driven Algorithmic Reproduction from Research Papers | Mar 31, 2025 | Benchmarking | CodeCode Available | 1 |
| InteractiveSurvey: An LLM-based Personalized and Interactive Survey Paper Generation System | Mar 31, 2025 | Paper generationRAG | CodeCode Available | 1 |
| Rethinking Key-Value Cache Compression Techniques for Large Language Model Serving | Mar 31, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| 3D Dental Model Segmentation with Geometrical Boundary Preserving | Mar 31, 2025 | Segmentation | CodeCode Available | 1 |
| Boosting MLLM Reasoning with Text-Debiased Hint-GRPO | Mar 31, 2025 | Mathematical ReasoningMultimodal Reasoning | CodeCode Available | 1 |
| Exploring Temporal Dynamics in Event-based Eye Tracker | Mar 31, 2025 | Mamba | CodeCode Available | 1 |
| Spectral-Adaptive Modulation Networks for Visual Perception | Mar 31, 2025 | object-detectionObject Detection | CodeCode Available | 1 |
| Towards Understanding How Knowledge Evolves in Large Vision-Language Models | Mar 31, 2025 | | CodeCode Available | 1 |
| Can Test-Time Scaling Improve World Foundation Model? | Mar 31, 2025 | Autonomous Driving | CodeCode Available | 1 |
| It's a (Blind) Match! Towards Vision-Language Correspondence without Parallel Data | Mar 31, 2025 | text annotation | CodeCode Available | 1 |
| Universal Zero-shot Embedding Inversion | Mar 31, 2025 | | CodeCode Available | 1 |
| GenSwarm: Scalable Multi-Robot Code-Policy Generation and Deployment via Language Models | Mar 31, 2025 | Zero-Shot Learning | CodeCode Available | 1 |
| EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing | Mar 30, 2025 | AttributeDisentanglement | CodeCode Available | 1 |
| DASH: Detection and Assessment of Systematic Hallucinations of VLMs | Mar 30, 2025 | Object | CodeCode Available | 1 |
| Enhancing Creative Generation on Stable Diffusion-based Models | Mar 30, 2025 | Denoising | CodeCode Available | 1 |
| LaViC: Adapting Large Vision-Language Models to Visually-Aware Conversational Recommendation | Mar 30, 2025 | Conversational RecommendationRecommendation Systems | CodeCode Available | 1 |
| Whisper-LM: Improving ASR Models with Language Models for Low-Resource Languages | Mar 30, 2025 | Automatic Speech RecognitionLanguage Modeling | CodeCode Available | 1 |
| COSMIC: Clique-Oriented Semantic Multi-space Integration for Robust CLIP Test-Time Adaptation | Mar 30, 2025 | Test-time Adaptation | CodeCode Available | 1 |
| A Survey on Unlearnable Data | Mar 30, 2025 | Machine UnlearningSurvey | CodeCode Available | 1 |
| Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation Model | Mar 30, 2025 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 1 |
| A Constrained Multi-Agent Reinforcement Learning Approach to Autonomous Traffic Signal Control | Mar 30, 2025 | FairnessMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| POINT^2: A Polymer Informatics Training and Testing Database | Mar 30, 2025 | Uncertainty Quantification | CodeCode Available | 1 |
| LIRA: A Learning-based Query-aware Partition Framework for Large-scale ANN Search | Mar 30, 2025 | Information Retrieval | CodeCode Available | 1 |
| Language Guided Concept Bottleneck Models for Interpretable Continual Learning | Mar 30, 2025 | Continual LearningDecision Making | CodeCode Available | 1 |
| BiPVL-Seg: Bidirectional Progressive Vision-Language Fusion with Global-Local Alignment for Medical Image Segmentation | Mar 30, 2025 | cross-modal alignmentImage Segmentation | CodeCode Available | 1 |
| Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction | Mar 29, 2025 | Autonomous VehiclesDecoder | CodeCode Available | 1 |
| AstroAgents: A Multi-Agent AI for Hypothesis Generation from Mass Spectrometry Data | Mar 29, 2025 | Large Language Model | CodeCode Available | 1 |
| Imagine All The Relevance: Scenario-Profiled Indexing with Knowledge Expansion for Dense Retrieval | Mar 29, 2025 | AllLanguage Modeling | CodeCode Available | 1 |
| SuperEIO: Self-Supervised Event Feature Learning for Event Inertial Odometry | Mar 29, 2025 | Graph Neural NetworkLow-latency processing | CodeCode Available | 1 |
| ShiftLIC: Lightweight Learned Image Compression with Spatial-Channel Shift Operations | Mar 29, 2025 | Image Compression | CodeCode Available | 1 |
| Aurelia: Test-time Reasoning Distillation in Audio-Visual LLMs | Mar 29, 2025 | | CodeCode Available | 1 |
| STSA: Spatial-Temporal Semantic Alignment for Visual Dubbing | Mar 29, 2025 | | CodeCode Available | 1 |
| RefChartQA: Grounding Visual Answer on Chart Images through Instruction Tuning | Mar 29, 2025 | Chart Question AnsweringChart Understanding | CodeCode Available | 1 |
| Enhance Generation Quality of Flow Matching V2A Model via Multi-Step CoT-Like Guidance and Combined Preference Optimization | Mar 28, 2025 | Audio GenerationFAD | CodeCode Available | 1 |
| QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks? | Mar 28, 2025 | Logical ReasoningMath | CodeCode Available | 1 |
| tempdisagg: A Python Framework for Temporal Disaggregation of Time Series Data | Mar 28, 2025 | ImputationMissing Values | CodeCode Available | 1 |
| Multi-modal Knowledge Distillation-based Human Trajectory Forecasting | Mar 28, 2025 | Autonomous DrivingKnowledge Distillation | CodeCode Available | 1 |
| Mitigating Trade-off: Stream and Query-guided Aggregation for Efficient and Effective 3D Occupancy Prediction | Mar 28, 2025 | Autonomous DrivingScene Understanding | CodeCode Available | 1 |
| Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis | Mar 28, 2025 | 3D Question Answering (3D-QA)3D visual grounding | CodeCode Available | 1 |
| DIFFER: Disentangling Identity Features via Semantic Cues for Clothes-Changing Person Re-ID | Mar 28, 2025 | Clothes Changing Person Re-IdentificationDisentanglement | CodeCode Available | 1 |
| Baseline Systems and Evaluation Metrics for Spatial Semantic Segmentation of Sound Scenes | Mar 28, 2025 | Audio TaggingSemantic Segmentation | CodeCode Available | 1 |
| FLIP: Towards Comprehensive and Reliable Evaluation of Federated Prompt Learning | Mar 28, 2025 | Federated LearningPrompt Learning | CodeCode Available | 1 |
| EgoToM: Benchmarking Theory of Mind Reasoning from Egocentric Videos | Mar 28, 2025 | BenchmarkingQuestion Answering | CodeCode Available | 1 |
| VoteFlow: Enforcing Local Rigidity in Self-Supervised Scene Flow | Mar 28, 2025 | Autonomous DrivingComputational Efficiency | CodeCode Available | 1 |