| Boosting Global-Local Feature Matching via Anomaly Synthesis for Multi-Class Point Cloud Anomaly Detection | May 12, 2025 | Anomaly Detection | CodeCode Available | 2 | 5 |
| A Tutorial on Structural Identifiability of Epidemic Models Using StructuralIdentifiability.jl | May 15, 2025 | parameter estimation | CodeCode Available | 2 | 5 |
| DexGarmentLab: Dexterous Garment Manipulation Environment with Generalizable Policy | May 16, 2025 | Reinforcement Learning (RL) | CodeCode Available | 2 | 5 |
| RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning | May 19, 2025 | | CodeCode Available | 2 | 5 |
| CPRet: A Dataset, Benchmark, and Model for Retrieval in Competitive Programming | May 19, 2025 | FairnessLarge Language Model | CodeCode Available | 2 | 5 |
| Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos | May 19, 2025 | 3D geometryCamera Pose Estimation | CodeCode Available | 2 | 5 |
| Neurosymbolic Diffusion Models | May 19, 2025 | Autonomous DrivingUncertainty Quantification | CodeCode Available | 2 | 5 |
| Temporal Query Network for Efficient Multivariate Time Series Forecasting | May 19, 2025 | Correlated Time Series ForecastingMultivariate Time Series Forecasting | CodeCode Available | 2 | 5 |
| Efficient Speech Language Modeling via Energy Distance in Continuous Latent Space | May 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| UniCTokens: Boosting Personalized Understanding and Generation via Unified Concept Tokens | May 20, 2025 | | CodeCode Available | 2 | 5 |
| KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation | May 20, 2025 | reinforcement-learningReinforcement Learning | CodeCode Available | 2 | 5 |
| MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis | Jul 10, 2024 | GPUImage Generation | CodeCode Available | 2 | 5 |
| QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design | May 22, 2025 | CPUGPU | CodeCode Available | 2 | 5 |
| Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models | May 22, 2025 | Reinforcement Learning (RL) | CodeCode Available | 2 | 5 |
| LiteCUA: Computer as MCP Server for Computer-Use Agent on AIOS | May 24, 2025 | | CodeCode Available | 2 | 5 |
| Improved Immiscible Diffusion: Accelerate Diffusion Training by Reducing Its Miscibility | May 24, 2025 | Denoising | CodeCode Available | 2 | 5 |
| Shifting AI Efficiency From Model-Centric to Data-Centric Compression | May 25, 2025 | Position | CodeCode Available | 2 | 5 |
| DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue | May 26, 2025 | DiagnosticQuestion Answering | CodeCode Available | 2 | 5 |
| Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression | May 26, 2025 | Zero-shot Generalization | CodeCode Available | 2 | 5 |
| Chain-of-Thought for Autonomous Driving: A Comprehensive Survey and Future Prospects | May 26, 2025 | Autonomous DrivingLogical Reasoning | CodeCode Available | 2 | 5 |
| Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model | May 29, 2025 | DecoderImage Generation | CodeCode Available | 2 | 5 |
| Aligning Modalities in Vision Large Language Models via Preference Fine-tuning | Feb 18, 2024 | HallucinationInstruction Following | CodeCode Available | 2 | 5 |
| Vision Language Models are Biased | May 29, 2025 | Board Gamescounterfactual | CodeCode Available | 2 | 5 |
| Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting | Jun 5, 2025 | Autonomous DrivingNeRF | CodeCode Available | 2 | 5 |
| CyberGym: Evaluating AI Agents' Cybersecurity Capabilities with Real-World Vulnerabilities at Scale | Jun 3, 2025 | Large Language Model | CodeCode Available | 2 | 5 |
| GSCodec Studio: A Modular Framework for Gaussian Splat Compression | Jun 2, 2025 | Benchmarking | CodeCode Available | 2 | 5 |
| MagiCodec: Simple Masked Gaussian-Injected Codec for High-Fidelity Reconstruction and Generation | May 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Attentive Merging of Hidden Embeddings from Pre-trained Speech Model for Anti-spoofing Detection | Jun 12, 2024 | Computational EfficiencySelf-Supervised Learning | CodeCode Available | 2 | 5 |
| ShiftwiseConv: Small Convolutional Kernel with Large Kernel Effect | Jan 1, 2025 | | CodeCode Available | 2 | 5 |
| Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning | Jun 10, 2025 | Model SelectionReinforcement Learning (RL) | CodeCode Available | 2 | 5 |
| Institutional Books 1.0: A 242B token dataset from Harvard Library's collections, refined for accuracy and usability | Jun 10, 2025 | Optical Character Recognition (OCR) | CodeCode Available | 2 | 5 |
| Do MIL Models Transfer? | Jun 10, 2025 | Multiple Instance LearningTransfer Learning | CodeCode Available | 2 | 5 |
| SDialog: A Python Toolkit for Synthetic Dialogue Generation and Analysis | Jun 12, 2025 | BenchmarkingDialogue Generation | CodeCode Available | 2 | 5 |
| Vision Transformers Don't Need Trained Registers | Jun 9, 2025 | | CodeCode Available | 2 | 5 |
| AutoMind: Adaptive Knowledgeable Agent for Automated Data Science | Jun 12, 2025 | Code GenerationLarge Language Model | CodeCode Available | 2 | 5 |
| UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents | May 27, 2025 | 16k | CodeCode Available | 2 | 5 |
| IntPhys 2: Benchmarking Intuitive Physics Understanding In Complex Synthetic Environments | Jun 11, 2025 | Benchmarking | CodeCode Available | 2 | 5 |
| VerIF: Verification Engineering for Reinforcement Learning in Instruction Following | Jun 11, 2025 | Instruction Followingreinforcement-learning | CodeCode Available | 2 | 5 |
| Solving the Job Shop Scheduling Problem with Graph Neural Networks: A Customizable Reinforcement Learning Environment | Jun 10, 2025 | Combinatorial OptimizationImitation Learning | CodeCode Available | 2 | 5 |
| AnalogNAS-Bench: A NAS Benchmark for Analog In-Memory Computing | Jun 23, 2025 | Neural Architecture SearchQuantization | CodeCode Available | 2 | 5 |
| Confucius3-Math: A Lightweight High-Performance Reasoning LLM for Chinese K-12 Mathematics Learning | Jun 23, 2025 | GPULarge Language Model | CodeCode Available | 2 | 5 |
| Towards In-the-wild 3D Plane Reconstruction from a Single Image | Jun 3, 2025 | 3D Plane Detection | CodeCode Available | 2 | 5 |
| Test3R: Learning to Reconstruct 3D at Test Time | Jun 16, 2025 | 3D ReconstructionDepth Estimation | CodeCode Available | 2 | 5 |
| Parallels Between VLA Model Post-Training and Human Motor Learning: Progress, Challenges, and Trends | Jun 26, 2025 | Action GenerationVision-Language-Action | CodeCode Available | 2 | 5 |
| Flow-Anchored Consistency Models | Jul 4, 2025 | Image Generation | CodeCode Available | 2 | 5 |
| Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion | Jul 8, 2025 | 3D geometryDomain Generalization | CodeCode Available | 2 | 5 |
| EAMamba: Efficient All-Around Vision State Space Model for Image Restoration | Jun 27, 2025 | AllDeblurring | CodeCode Available | 2 | 5 |
| Open Source Planning & Control System with Language Agents for Autonomous Scientific Discovery | Jul 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| CaRL: Learning Scalable Planning Policies with Simple Rewards | Apr 24, 2025 | Autonomous DrivingCARLA longest6 | CodeCode Available | 2 | 5 |
| HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model | Mar 17, 2025 | Image SegmentationSegmentation | CodeCode Available | 2 | 5 |