| OmniStereo: Real-time Omnidireactional Depth Estimation with Multiview Fisheye Cameras | Jan 1, 2025 | Autonomous DrivingDepth Estimation | CodeCode Available | 1 |
| Revisiting Generative Replay for Class Incremental Object Detection | Jan 1, 2025 | class-incremental learningClass Incremental Learning | CodeCode Available | 1 |
| FRESA: Feedforward Reconstruction of Personalized Skinned Avatars from Few Images | Jan 1, 2025 | 3D CanonicalizationZero-shot Generalization | CodeCode Available | 1 |
| Octopus: Alleviating Hallucination via Dynamic Contrastive Decoding | Jan 1, 2025 | Hallucination | CodeCode Available | 1 |
| Blood Flow Speed Estimation with Optical Coherence Tomography Angiography Images | Jan 1, 2025 | Decoder | CodeCode Available | 1 |
| Plug-and-Play PPO: An Adaptive Point Prompt Optimizer Making SAM Greater | Jan 1, 2025 | Deep Reinforcement LearningSegmentation | CodeCode Available | 1 |
| Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question Answering | Jan 1, 2025 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 |
| DV-Matcher: Deformation-based Non-rigid Point Cloud Matching Guided by Pre-trained Visual Features | Jan 1, 2025 | | CodeCode Available | 1 |
| Diffusion Bridge: Leveraging Diffusion Model to Reduce the Modality Gap Between Text and Vision for Zero-Shot Image Captioning | Jan 1, 2025 | cross-modal alignmentDenoising | CodeCode Available | 1 |
| Point Cloud Upsampling Using Conditional Diffusion Module with Adaptive Noise Suppression | Jan 1, 2025 | point cloud upsampling | CodeCode Available | 1 |
| SAM-Aware Graph Prompt Reasoning Network for Cross-Domain Few-Shot Segmentation | Dec 31, 2024 | Cross-Domain Few-ShotGPR | CodeCode Available | 1 |
| TinyHelen's First Curriculum: Training and Evaluating Tiny Language Models in a Simpler Language Environment | Dec 31, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| KAE: Kolmogorov-Arnold Auto-Encoder for Representation Learning | Dec 31, 2024 | DenoisingRepresentation Learning | CodeCode Available | 1 |
| LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts | Dec 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing | Dec 31, 2024 | State Space Models | CodeCode Available | 1 |
| Lightweight G-YOLOv11: Advancing Efficient Fracture Detection in Pediatric Wrist X-rays | Dec 31, 2024 | Fracture detectionGPU | CodeCode Available | 1 |
| HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation | Dec 30, 2024 | Code GenerationHumanEval | CodeCode Available | 1 |
| A novel deep learning approach for facial emotion recognition: application to detecting emotional responses in elderly individuals with Alzheimer’s disease | Dec 30, 2024 | Emotion RecognitionFacial Emotion Recognition | CodeCode Available | 1 |
| Insights on Galaxy Evolution from Interpretable Sparse Feature Networks | Dec 30, 2024 | | CodeCode Available | 1 |
| Low-Light Image Enhancement via Generative Perceptual Priors | Dec 30, 2024 | Image EnhancementLow-Light Image Enhancement | CodeCode Available | 1 |
| Plancraft: an evaluation dataset for planning with LLM agents | Dec 30, 2024 | Decision MakingMinecraft | CodeCode Available | 1 |
| A Large-Scale Study on Video Action Dataset Condensation | Dec 30, 2024 | Action RecognitionDataset Condensation | CodeCode Available | 1 |
| Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner | Dec 30, 2024 | Question AnsweringTable Recognition | CodeCode Available | 1 |
| Length-Aware DETR for Robust Moment Retrieval | Dec 30, 2024 | Information RetrievalMoment Retrieval | CodeCode Available | 1 |
| Facilitating large language model Russian adaptation with Learned Embedding Propagation | Dec 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| PyG-SSL: A Graph Self-Supervised Learning Toolkit | Dec 30, 2024 | Self-Supervised Learning | CodeCode Available | 1 |
| Toward Intelligent and Secure Cloud: Large Language Model Empowered Proactive Defense | Dec 30, 2024 | Cloud ComputingCode Generation | CodeCode Available | 1 |
| TiGDistill-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry Learning Distillation | Dec 30, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Frequency-Masked Embedding Inference: A Non-Contrastive Approach for Time Series Representation Learning | Dec 30, 2024 | Contrastive LearningLinear evaluation | CodeCode Available | 1 |
| GASLITEing the Retrieval: Exploring Vulnerabilities in Dense Embedding-based Search | Dec 30, 2024 | RAGRetrieval | CodeCode Available | 1 |
| TrajLearn: Trajectory Prediction Learning using Deep Generative Models | Dec 30, 2024 | Autonomous NavigationBenchmarking | CodeCode Available | 1 |
| Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration | Dec 30, 2024 | Blind Face RestorationPrompt Learning | CodeCode Available | 1 |
| DDIM sampling for Generative AIBIM, a faster intelligent structural design framework | Dec 30, 2024 | DenoisingLearning Theory | CodeCode Available | 1 |
| Zero-Shot Image Restoration Using Few-Step Guidance of Consistency Models (and Beyond) | Dec 29, 2024 | DeblurringImage Generation | CodeCode Available | 1 |
| Training-free Heterogeneous Model Merging | Dec 29, 2024 | model | CodeCode Available | 1 |
| ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding | Dec 29, 2024 | Video CompressionVideo Understanding | CodeCode Available | 1 |
| EraseAnything: Enabling Concept Erasure in Rectified Flow Transformers | Dec 29, 2024 | Contrastive Learning | CodeCode Available | 1 |
| FairDiffusion: Enhancing Equity in Latent Diffusion Models via Fair Bayesian Perturbation | Dec 29, 2024 | FairnessImage Generation | CodeCode Available | 1 |
| Diminishing Return of Value Expansion Methods | Dec 29, 2024 | Model-based Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| FreqMixFormerV2: Lightweight Frequency-aware Mixed Transformer for Human Skeleton Action Recognition | Dec 29, 2024 | Action Recognition | CodeCode Available | 1 |
| Stochastic gradient descent estimation of generalized matrix factorization models with application to single-cell RNA sequencing data | Dec 29, 2024 | Dimensionality ReductionMissing Values | CodeCode Available | 1 |
| PTQ4VM: Post-Training Quantization for Visual Mamba | Dec 29, 2024 | MambaQuantization | CodeCode Available | 1 |
| Exploiting Hybrid Policy in Reinforcement Learning for Interpretable Temporal Logic Manipulation | Dec 29, 2024 | Reinforcement Learning (RL) | CodeCode Available | 1 |
| The Fifth International Verification of Neural Networks Competition (VNN-COMP 2024): Summary and Results | Dec 28, 2024 | | CodeCode Available | 1 |
| TeLU Activation Function for Fast and Stable Deep Learning | Dec 28, 2024 | Computational EfficiencyDeep Learning | CodeCode Available | 1 |
| BaiJia: A Large-Scale Role-Playing Agent Corpus of Chinese Historical Characters | Dec 28, 2024 | | CodeCode Available | 1 |
| M-MAD: Multidimensional Multi-Agent Debate Framework for Fine-grained Machine Translation Evaluation | Dec 28, 2024 | Machine Translation | CodeCode Available | 1 |
| SimLTD: Simple Supervised and Semi-Supervised Long-Tailed Object Detection | Dec 28, 2024 | Few-Shot Object DetectionLong-tailed Object Detection | CodeCode Available | 1 |
| On the Compositional Generalization of Multimodal LLMs for Medical Imaging | Dec 28, 2024 | | CodeCode Available | 1 |
| Federated Unlearning with Gradient Descent and Conflict Mitigation | Dec 28, 2024 | Federated Learning | CodeCode Available | 1 |