| Dynamics-incorporated Modeling Framework for Stability Constrained Scheduling Under High-penetration of Renewable Energy | Jan 10, 2025 | Scheduling | CodeCode Available | 1 |
| MS-Temba : Multi-Scale Temporal Mamba for Efficient Temporal Action Detection | Jan 10, 2025 | Action DetectionGPU | CodeCode Available | 1 |
| DiffuSETS: 12-lead ECG Generation Conditioned on Clinical Text Reports and Patient-Specific Information | Jan 10, 2025 | BenchmarkingData Augmentation | CodeCode Available | 1 |
| From Mesh Completion to AI Designed Crown | Jan 9, 2025 | | CodeCode Available | 1 |
| Improving Zero-Shot Object-Level Change Detection by Incorporating Visual Correspondence | Jan 9, 2025 | Change DetectionZero-shot Generalization | CodeCode Available | 1 |
| Uncertainty-aware Knowledge Tracing | Jan 9, 2025 | Contrastive LearningKnowledge Tracing | CodeCode Available | 1 |
| Battling the Non-stationarity in Time Series Forecasting via Test-time Adaptation | Jan 9, 2025 | Test-time AdaptationTime Series | CodeCode Available | 1 |
| AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder | Jan 9, 2025 | Pitch ClassificationPitch control | CodeCode Available | 1 |
| AD-L-JEPA: Self-Supervised Spatial World Models with Joint Embedding Predictive Architecture for Autonomous Driving with LiDAR Data | Jan 9, 2025 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| IPDN: Image-enhanced Prompt Decoding Network for 3D Referring Expression Segmentation | Jan 9, 2025 | DecoderReferring Expression | CodeCode Available | 1 |
| VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models | Jan 9, 2025 | BenchmarkingMathematical Problem-Solving | CodeCode Available | 1 |
| A Flexible and Scalable Framework for Video Moment Search | Jan 9, 2025 | Moment RetrievalRe-Ranking | CodeCode Available | 1 |
| SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution | Jan 9, 2025 | GitHub issue resolutionRetrieval | CodeCode Available | 1 |
| Continuous Knowledge-Preserving Decomposition for Few-Shot Continual Learning | Jan 9, 2025 | class-incremental learningClass Incremental Learning | CodeCode Available | 1 |
| Discovering Hidden Visual Concepts Beyond Linguistic Input in Infant Learning | Jan 9, 2025 | | CodeCode Available | 1 |
| Online Continual Learning: A Systematic Literature Review of Approaches, Challenges, and Benchmarks | Jan 9, 2025 | Continual Learningimage-classification | CodeCode Available | 1 |
| Load Forecasting for Households and Energy Communities: Are Deep Learning Models Worth the Effort? | Jan 9, 2025 | Deep LearningLoad Forecasting | CodeCode Available | 1 |
| SensorQA: A Question Answering Benchmark for Daily-Life Monitoring | Jan 9, 2025 | Question Answering | CodeCode Available | 1 |
| Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration | Jan 9, 2025 | | CodeCode Available | 1 |
| D3RM: A Discrete Denoising Diffusion Refinement Model for Piano Transcription | Jan 9, 2025 | DenoisingImage Segmentation | CodeCode Available | 1 |
| Solving the Catastrophic Forgetting Problem in Generalized Category Discovery | Jan 9, 2025 | | CodeCode Available | 1 |
| Demystifying Domain-adaptive Post-training for Financial LLMs | Jan 9, 2025 | Continual PretrainingDomain Adaptation | CodeCode Available | 1 |
| Progressive Supervision via Label Decomposition: An Long-Term and Large-Scale Wireless Traffic Forecasting Method | Jan 9, 2025 | | CodeCode Available | 1 |
| Plug-and-Play DISep: Separating Dense Instances for Scene-to-Pixel Weakly-Supervised Change Detection in High-Resolution Remote Sensing Images | Jan 9, 2025 | Change Detection | CodeCode Available | 1 |
| Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes | Jan 9, 2025 | NeRF | CodeCode Available | 1 |
| ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark | Jan 9, 2025 | FairnessHallucination | CodeCode Available | 1 |
| EDMB: Edge Detector with Mamba | Jan 8, 2025 | Edge DetectionMamba | CodeCode Available | 1 |
| S2 Chunking: A Hybrid Framework for Document Segmentation Through Integrated Spatial and Semantic Analysis | Jan 8, 2025 | ArticlesChunking | CodeCode Available | 1 |
| ContextMRI: Enhancing Compressed Sensing MRI through Metadata Conditioning | Jan 8, 2025 | Anatomycompressed sensing | CodeCode Available | 1 |
| Rethinking High-speed Image Reconstruction Framework with Spike Camera | Jan 8, 2025 | Image Reconstruction | CodeCode Available | 1 |
| Understanding Before Reasoning: Enhancing Chain-of-Thought with Iterative Summarization Pre-Prompting | Jan 8, 2025 | | CodeCode Available | 1 |
| Are They the Same? Exploring Visual Correspondence Shortcomings of Multimodal LLMs | Jan 8, 2025 | Contrastive Learning | CodeCode Available | 1 |
| Unified Coding for Both Human Perception and Generalized Machine Analytics with CLIP Supervision | Jan 8, 2025 | Image Compression | CodeCode Available | 1 |
| DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models | Jan 8, 2025 | Quantization | CodeCode Available | 1 |
| Neural Parameter Estimation with Incomplete Data | Jan 8, 2025 | parameter estimation | CodeCode Available | 1 |
| Eve: Efficient Multimodal Vision Language Models with Elastic Visual Experts | Jan 8, 2025 | | CodeCode Available | 1 |
| Enhancing Low-Cost Video Editing with Lightweight Adaptors and Temporal-Aware Inversion | Jan 8, 2025 | Video Editing | CodeCode Available | 1 |
| Histologic Dataset of Normal and Atypical Mitotic Figures on Human Breast Cancer (AMi-Br) | Jan 8, 2025 | | CodeCode Available | 1 |
| Online Gaussian Test-Time Adaptation of Vision-Language Models | Jan 8, 2025 | Test-time Adaptation | CodeCode Available | 1 |
| DispFormer: Pretrained Transformer for Flexible Dispersion Curve Inversion from Global Synthesis to Regional Applications | Jan 8, 2025 | Computational Efficiency | CodeCode Available | 1 |
| Implicit Guidance and Explicit Representation of Semantic Information in Points Cloud: A Survey | Jan 7, 2025 | ArticlesAutonomous Driving | CodeCode Available | 1 |
| Can LLMs Design Good Questions Based on Context? | Jan 7, 2025 | | CodeCode Available | 1 |
| RecKG: Knowledge Graph for Recommender Systems | Jan 7, 2025 | AttributeData Integration | CodeCode Available | 1 |
| Entropy-Guided Attention for Private LLMs | Jan 7, 2025 | | CodeCode Available | 1 |
| Stochastic Process Learning via Operator Flow Matching | Jan 7, 2025 | Density Estimationregression | CodeCode Available | 1 |
| LM-Net: A Light-weight and Multi-scale Network for Medical Image Segmentation | Jan 7, 2025 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| Dual-level Adaptive Incongruity-enhanced Model for Multimodal Sarcasm Detection | Jan 7, 2025 | Contrastive LearningSarcasm Detection | CodeCode Available | 1 |
| FgC2F-UDiff: Frequency-guided and Coarse-to-fine Unified Diffusion Model for Multi-modality Missing MRI Synthesis | Jan 7, 2025 | DenoisingSSIM | CodeCode Available | 1 |
| Unsupervised Speech Segmentation: A General Approach Using Speech Language Models | Jan 7, 2025 | Boundary DetectionSegmentation | CodeCode Available | 1 |
| VLM-driven Behavior Tree for Context-aware Task Planning | Jan 7, 2025 | Task Planning | CodeCode Available | 1 |