| Diffusion-TS: Interpretable Diffusion for General Time Series Generation | Mar 4, 2024 | Audio SynthesisDecoder | CodeCode Available | 3 | 5 |
| TapeAgents: a Holistic Framework for Agent Development and Optimization | Dec 11, 2024 | | CodeCode Available | 3 | 5 |
| MixLinear: Extreme Low Resource Multivariate Time Series Forecasting with 0.1K Parameters | Oct 2, 2024 | Multivariate Time Series ForecastingTime Series | CodeCode Available | 3 | 5 |
| DataSentinel: A Game-Theoretic Detection of Prompt Injection Attacks | Apr 15, 2025 | | CodeCode Available | 3 | 5 |
| Adversarial Cheap Talk | Nov 20, 2022 | Meta-LearningReinforcement Learning (RL) | CodeCode Available | 3 | 5 |
| Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image | Jun 6, 2024 | 3D Scene ReconstructionDepth Estimation | CodeCode Available | 3 | 5 |
| EscherNet: A Generative Model for Scalable View Synthesis | Feb 6, 2024 | 3D ReconstructionGPU | CodeCode Available | 3 | 5 |
| 3DIS-FLUX: simple and efficient multi-instance generation with DiT rendering | Jan 9, 2025 | Image GenerationText to Image Generation | CodeCode Available | 3 | 5 |
| Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation | Mar 4, 2025 | Contact-rich ManipulationImitation Learning | CodeCode Available | 3 | 5 |
| Rethinking the Evaluation of Visible and Infrared Image Fusion | Oct 9, 2024 | object-detectionObject Detection | CodeCode Available | 3 | 5 |
| Training Verifiers to Solve Math Word Problems | Oct 27, 2021 | GSM8KMath | CodeCode Available | 3 | 5 |
| Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline | Nov 19, 2024 | Image SegmentationInteractive Segmentation | CodeCode Available | 3 | 5 |
| Generating Long Sequences with Sparse Transformers | Apr 23, 2019 | DiversityImage Generation | CodeCode Available | 3 | 5 |
| Towards Generalizable Tumor Synthesis | Feb 29, 2024 | Computed Tomography (CT) | CodeCode Available | 3 | 5 |
| Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning | Apr 15, 2025 | Automated Theorem ProvingLarge Language Model | CodeCode Available | 3 | 5 |
| Pipeline Parallelism with Controllable Memory | May 24, 2024 | | CodeCode Available | 3 | 5 |
| SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline | May 25, 2025 | Speech ExtractionSpeech Separation | CodeCode Available | 3 | 5 |
| L0: Reinforcement Learning to Become General Agents | Jun 30, 2025 | Question Answeringreinforcement-learning | CodeCode Available | 3 | 5 |
| MMAD: The First-Ever Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection | Oct 12, 2024 | Anomaly Detection | CodeCode Available | 3 | 5 |
| ASFT: Aligned Supervised Fine-Tuning through Absolute Likelihood | Sep 14, 2024 | Instruction FollowingText Generation | CodeCode Available | 3 | 5 |
| AdaWorld: Learning Adaptable World Models with Latent Actions | Mar 24, 2025 | Future prediction | CodeCode Available | 3 | 5 |
| SIMPL: A Simple and Efficient Multi-agent Motion Prediction Baseline for Autonomous Driving | Feb 4, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 3 | 5 |
| cmaes : A Simple yet Practical Python Library for CMA-ES | Feb 2, 2024 | Transfer Learning | CodeCode Available | 3 | 5 |
| Emu: Generative Pretraining in Multimodality | Jul 11, 2023 | Image CaptioningImage Generation | CodeCode Available | 3 | 5 |
| BlenderLLM: Training Large Language Models for Computer-Aided Design with Self-improvement | Dec 16, 2024 | Script GenerationText to 3D | CodeCode Available | 3 | 5 |
| Automatically Interpreting Millions of Features in Large Language Models | Oct 17, 2024 | Semantic SimilaritySemantic Textual Similarity | CodeCode Available | 3 | 5 |
| GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks | Sep 20, 2024 | AllSinging Voice Synthesis | CodeCode Available | 3 | 5 |
| KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache | Feb 5, 2024 | Quantization | CodeCode Available | 3 | 5 |
| AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents | Jun 19, 2024 | | CodeCode Available | 3 | 5 |
| AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents | Oct 31, 2024 | Benchmarking | CodeCode Available | 3 | 5 |
| HAC++: Towards 100X Compression of 3D Gaussian Splatting | Jan 21, 2025 | 3DGSAttribute | CodeCode Available | 3 | 5 |
| Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation | Sep 27, 2023 | GPUText-to-Video Generation | CodeCode Available | 3 | 5 |
| Deep Reasoning Translation via Reinforcement Learning | Apr 14, 2025 | reinforcement-learningReinforcement Learning | CodeCode Available | 3 | 5 |
| Segment Anything in 3D with Radiance Fields | Apr 24, 2023 | Inverse RenderingSegmentation | CodeCode Available | 3 | 5 |
| Consistency Flow Matching: Defining Straight Flows with Velocity Consistency | Jul 2, 2024 | Image Generation | CodeCode Available | 3 | 5 |
| PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data | Feb 20, 2025 | Style Transfer | CodeCode Available | 3 | 5 |
| Deep Learning-Based Object Pose Estimation: A Comprehensive Survey | May 13, 2024 | Deep LearningObject | CodeCode Available | 3 | 5 |
| MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion | May 30, 2024 | DenoisingGPU | CodeCode Available | 3 | 5 |
| VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training | Mar 23, 2022 | 4kAction Classification | CodeCode Available | 3 | 5 |
| AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction | Apr 1, 2025 | Image Generation | CodeCode Available | 3 | 5 |
| PE3R: Perception-Efficient 3D Reconstruction | Mar 10, 2025 | 3D ReconstructionZero-shot Generalization | CodeCode Available | 3 | 5 |
| The Mighty ToRR: A Benchmark for Table Reasoning and Robustness | Feb 26, 2025 | | CodeCode Available | 3 | 5 |
| Baichuan-Omni Technical Report | Oct 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| Robot Utility Models: General Policies for Zero-Shot Deployment in New Environments | Sep 9, 2024 | Imitation Learning | CodeCode Available | 3 | 5 |
| RLVR-World: Training World Models with Reinforcement Learning | May 20, 2025 | reinforcement-learningReinforcement Learning | CodeCode Available | 3 | 5 |
| Tool Learning with Large Language Models: A Survey | May 28, 2024 | Response GenerationSurvey | CodeCode Available | 3 | 5 |
| DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing | Jun 26, 2023 | | CodeCode Available | 3 | 5 |
| Step-level Value Preference Optimization for Mathematical Reasoning | Jun 16, 2024 | Learning-To-RankMath | CodeCode Available | 3 | 5 |
| Middle Architecture Criteria | Apr 27, 2024 | | CodeCode Available | 3 | 5 |
| TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones | Dec 28, 2023 | Computational EfficiencyImage Captioning | CodeCode Available | 3 | 5 |