| Reasoning Language Models: A Blueprint | Jan 20, 2025 | Reinforcement Learning (RL)Retrieval-augmented Generation | CodeCode Available | 2 |
| Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training | Jan 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Avoiding Shortcuts: Enhancing Channel-Robust Specific Emitter Identification via Single-Source Domain Generalization | Jan 20, 2025 | Contrastive LearningDomain Generalization | CodeCode Available | 2 |
| Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling | Jan 20, 2025 | Imitation LearningLanguage Modeling | CodeCode Available | 2 |
| A Survey on Diffusion Models for Anomaly Detection | Jan 20, 2025 | Anomaly DetectionComputational Efficiency | CodeCode Available | 2 |
| A generalizable 3D framework and model for self-supervised learning in medical imaging | Jan 20, 2025 | Medical Image SegmentationSelf-Supervised Learning | CodeCode Available | 2 |
| Recurrent Diffusion for Large-Scale Parameter Generation | Jan 20, 2025 | GPU | CodeCode Available | 2 |
| Investigating the Scalability of Approximate Sparse Retrieval Algorithms to Massive Datasets | Jan 20, 2025 | Retrieval | CodeCode Available | 2 |
| Beyond Any-Shot Adaptation: Predicting Optimization Outcome for Robustness Gains without Extra Pay | Jan 19, 2025 | | CodeCode Available | 2 |
| ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario | Jan 17, 2025 | | CodeCode Available | 2 |
| FiLo++: Zero-/Few-Shot Anomaly Detection by Fused Fine-Grained Descriptions and Deformable Localization | Jan 17, 2025 | Anomaly DetectionImage-text matching | CodeCode Available | 2 |
| Discrete Prior-based Temporal-coherent Content Prediction for Blind Face Video Restoration | Jan 17, 2025 | Video Restoration | CodeCode Available | 2 |
| Diffusion Models in Recommendation Systems: A Survey | Jan 17, 2025 | Collaborative FilteringRecommendation Systems | CodeCode Available | 2 |
| Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems | Jan 17, 2025 | Response Generation | CodeCode Available | 2 |
| LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks | Jan 17, 2025 | Change DetectionImage Classification | CodeCode Available | 2 |
| Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search | Jan 16, 2025 | Quantization | CodeCode Available | 2 |
| Scaling up self-supervised learning for improved surgical foundation models | Jan 16, 2025 | Self-Supervised LearningSemantic Segmentation | CodeCode Available | 2 |
| Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key | Jan 16, 2025 | 16kHallucination | CodeCode Available | 2 |
| Prompt-CAM: A Simpler Interpretable Transformer for Fine-Grained Analysis | Jan 16, 2025 | Explainable Artificial Intelligence (XAI)Explainable Models | CodeCode Available | 2 |
| CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation | Jan 16, 2025 | 3D Generation4k | CodeCode Available | 2 |
| Practical Continual Forgetting for Pre-trained Vision Models | Jan 16, 2025 | Continual ForgettingFace Recognition | CodeCode Available | 2 |
| AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation | Jan 16, 2025 | Image GenerationText to Image Generation | CodeCode Available | 2 |
| A Simple Aerial Detection Baseline of Multimodal Language Models | Jan 16, 2025 | object-detectionObject Detection | CodeCode Available | 2 |
| The Devil is in Temporal Token: High Quality Video Reasoning Segmentation | Jan 15, 2025 | Reasoning SegmentationReferring Expression Segmentation | CodeCode Available | 2 |
| What Limits LLM-based Human Simulation: LLMs or Our Design? | Jan 15, 2025 | | CodeCode Available | 2 |