| Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models | May 5, 2025 | Active Learning | CodeCode Available | 2 |
| SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing | May 5, 2025 | Triplet | CodeCode Available | 2 |
| RM-R1: Reward Modeling as Reasoning | May 5, 2025 | MathReinforcement Learning (RL) | CodeCode Available | 2 |
| T2S: High-resolution Time Series Generation with Text-to-Series Diffusion Models | May 5, 2025 | Time SeriesTime Series Generation | CodeCode Available | 2 |
| No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves | May 5, 2025 | Image GenerationRepresentation Learning | CodeCode Available | 2 |
| FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models | May 5, 2025 | BenchmarkingMathematical Reasoning | CodeCode Available | 2 |
| Efficient Multivariate Time Series Forecasting via Calibrated Language Models with Privileged Knowledge Distillation | May 4, 2025 | Knowledge DistillationMultivariate Time Series Forecasting | CodeCode Available | 2 |
| MemEngine: A Unified and Modular Library for Developing Advanced Memory of LLM-based Agents | May 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| An Empirical Study of Qwen3 Quantization | May 4, 2025 | Natural Language UnderstandingQuantization | CodeCode Available | 2 |
| SkillMimic-V2: Learning Robust and Generalizable Interaction Skills from Sparse and Noisy Demonstrations | May 4, 2025 | Data Augmentation | CodeCode Available | 2 |
| PoseX: AI Defeats Physics Approaches on Protein-Ligand Cross Docking | May 3, 2025 | Blind DockingMolecular Docking | CodeCode Available | 2 |
| A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency | May 3, 2025 | | CodeCode Available | 2 |
| CostFilter-AD: Enhancing Anomaly Detection through Matching Cost Filtering | May 2, 2025 | Anomaly DetectionUnsupervised Anomaly Detection | CodeCode Available | 2 |
| Don't be lazy: CompleteP enables compute-efficient deep transformers | May 2, 2025 | | CodeCode Available | 2 |
| CAMELTrack: Context-Aware Multi-cue ExpLoitation for Online Multi-Object Tracking | May 2, 2025 | Multi-Object TrackingObject Tracking | CodeCode Available | 2 |
| MINERVA: Evaluating Complex Video Reasoning | May 1, 2025 | BenchmarkingTemporal Localization | CodeCode Available | 2 |
| Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook | May 1, 2025 | BenchmarkingChange Detection | CodeCode Available | 2 |
| LightEMMA: Lightweight End-to-End Multimodal Model for Autonomous Driving | May 1, 2025 | Autonomous Driving | CodeCode Available | 2 |
| Explainable AI in Spatial Analysis | May 1, 2025 | Bias DetectionExplainable artificial intelligence | CodeCode Available | 2 |
| One Net to Rule Them All: Domain Randomization in Quadcopter Racing Across Different Platforms | Apr 30, 2025 | All | CodeCode Available | 2 |
| Noise Modeling in One Hour: Minimizing Preparation Efforts for Self-supervised Low-Light RAW Image Denoising | Apr 30, 2025 | DenoisingImage Denoising | CodeCode Available | 2 |
| mAIstro: an open-source multi-agentic system for automated end-to-end development of radiomics and deep learning models for medical imaging | Apr 30, 2025 | AI AgentClassification | CodeCode Available | 2 |
| HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation | Apr 30, 2025 | Depth EstimationScene Generation | CodeCode Available | 2 |
| GPU Performance Portability needs Autotuning | Apr 30, 2025 | GPU | CodeCode Available | 2 |
| RWKV-X: A Linear Complexity Hybrid Language Model | Apr 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Visual Text Processing: A Comprehensive Review and Unified Evaluation | Apr 30, 2025 | Image ManipulationImage Reconstruction | CodeCode Available | 2 |
| Multi-Agent Reinforcement Learning for Resources Allocation Optimization: A Survey | Apr 29, 2025 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| Sparse2DGS: Geometry-Prioritized Gaussian Splatting for Surface Reconstruction from Sparse Views | Apr 29, 2025 | NeRFSurface Reconstruction | CodeCode Available | 2 |
| UniversalRAG: Retrieval-Augmented Generation over Corpora of Diverse Modalities and Granularities | Apr 29, 2025 | Question AnsweringRAG | CodeCode Available | 2 |
| GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D Reconstruction | Apr 29, 2025 | 3DGS3D Reconstruction | CodeCode Available | 2 |
| RuleKit 2: Faster and simpler rule learning | Apr 29, 2025 | Descriptive | CodeCode Available | 2 |
| Softpick: No Attention Sink, No Massive Activations with Rectified Softmax | Apr 29, 2025 | Quantization | CodeCode Available | 2 |
| Rulebook: bringing co-routines to reinforcement learning environments | Apr 28, 2025 | reinforcement-learningReinforcement Learning | CodeCode Available | 2 |
| STCOcc: Sparse Spatial-Temporal Cascade Renovation for 3D Occupancy and Scene Flow Prediction | Apr 28, 2025 | GPU | CodeCode Available | 2 |
| Adaptive Dual-domain Learning for Underwater Image Enhancement | Apr 27, 2025 | Image EnhancementUIE | CodeCode Available | 2 |
| BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese | Apr 27, 2025 | BenchmarkingProper Noun | CodeCode Available | 2 |
| Generative AI for Character Animation: A Comprehensive Survey of Techniques, Applications, and Future Directions | Apr 27, 2025 | Image GenerationMotion Synthesis | CodeCode Available | 2 |
| Towards Practical Second-Order Optimizers in Deep Learning: Insights from Fisher Information Analysis | Apr 26, 2025 | Computational Efficiencyimage-classification | CodeCode Available | 2 |
| SPD Learning for Covariance-Based Neuroimaging Analysis: Perspectives, Methods, and Challenges | Apr 26, 2025 | | CodeCode Available | 2 |
| SORT3D: Spatial Object-centric Reasoning Toolbox for Zero-Shot 3D Grounding Using Large Language Models | Apr 25, 2025 | Spatial ReasoningText to 3D | CodeCode Available | 2 |
| DiMeR: Disentangled Mesh Reconstruction Model | Apr 24, 2025 | Image to 3Dmodel | CodeCode Available | 2 |
| FinBERT-QA: Financial Question Answering with pre-trained BERT Language Models | Apr 24, 2025 | Answer SelectionInformation Retrieval | CodeCode Available | 2 |
| GotenNet: Rethinking Efficient 3D Equivariant Graph Neural Networks | Apr 24, 2025 | Atomic ForcesComputational Efficiency | CodeCode Available | 2 |
| LiDPM: Rethinking Point Diffusion for Lidar Scene Completion | Apr 24, 2025 | Lidar Scene Completion | CodeCode Available | 2 |
| CaRL: Learning Scalable Planning Policies with Simple Rewards | Apr 24, 2025 | Autonomous DrivingCARLA longest6 | CodeCode Available | 2 |
| Process Reward Models That Think | Apr 23, 2025 | Math | CodeCode Available | 2 |
| AdaParse: An Adaptive Parallel PDF Parsing and Resource Scaling Engine | Apr 23, 2025 | | CodeCode Available | 2 |
| Can Large Language Models Help Multimodal Language Analysis? MMLA: A Comprehensive Benchmark | Apr 23, 2025 | | CodeCode Available | 2 |
| Dynamic Early Exit in Reasoning Models | Apr 22, 2025 | GSM8KMath | CodeCode Available | 2 |
| CAPO: Cost-Aware Prompt Optimization | Apr 22, 2025 | Arithmetic ReasoningAutoML | CodeCode Available | 2 |