| Sequencer: Deep LSTM for Image Classification | May 4, 2022 | Domain Generalizationimage-classification | CodeCode Available | 5 |
| OPT: Open Pre-trained Transformer Language Models | May 2, 2022 | DecoderHate Speech Detection | CodeCode Available | 5 |
| GAM(e) changer or not? An evaluation of interpretable machine learning models based on additive model constraints | Apr 19, 2022 | Additive modelsExplainable artificial intelligence | CodeCode Available | 5 |
| MST++: Multi-stage Spectral-wise Transformer for Efficient Spectral Reconstruction | Apr 17, 2022 | Image RestorationSpectral Reconstruction | CodeCode Available | 5 |
| BigDL 2.0: Seamless Scaling of AI Pipelines from Laptops to Distributed Cluster | Apr 3, 2022 | AutoMLDistributed Computing | CodeCode Available | 5 |
| WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit | Mar 29, 2022 | DecoderLanguage Modelling | CodeCode Available | 5 |
| SuperAnimal pretrained pose estimation models for behavioral analysis | Mar 14, 2022 | 2D Pose EstimationAnimal Pose Estimation | CodeCode Available | 5 |
| BERTopic: Neural topic modeling with a class-based TF-IDF procedure | Mar 11, 2022 | ClusteringDocument Embedding | CodeCode Available | 5 |
| Slicing Aided Hyper Inference and Fine-tuning for Small Object Detection | Feb 14, 2022 | Objectobject-detection | CodeCode Available | 5 |
| On Neural Differential Equations | Feb 4, 2022 | Irregular Time SeriesSymbolic Regression | CodeCode Available | 5 |
| Flashlight: Enabling Innovation in Tools for Machine Learning | Jan 29, 2022 | BIG-bench Machine Learning | CodeCode Available | 5 |
| BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation | Jan 28, 2022 | Image CaptioningImage-text matching | CodeCode Available | 5 |
| Visual Identification of Problematic Bias in Large Label Spaces | Jan 17, 2022 | Fairness | CodeCode Available | 5 |
| AugLy: Data Augmentations for Robustness | Jan 17, 2022 | Adversarial RobustnessData Augmentation | CodeCode Available | 5 |
| A ConvNet for the 2020s | Jan 10, 2022 | ClassificationDomain Generalization | CodeCode Available | 5 |
| Hyperagents | Mar 19, 2026 | | —Unverified | 4 |
| MOSS-TTS Technical Report | Mar 18, 2026 | | —Unverified | 4 |
| Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery | Mar 18, 2026 | | —Unverified | 4 |
| Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching | Mar 17, 2026 | | —Unverified | 4 |
| MatAnyone 2: Scaling Video Matting via a Learned Quality Evaluator | Mar 16, 2026 | | —Unverified | 4 |
| Precise Object and Effect Removal with Adaptive Target-Aware Attention | Mar 16, 2026 | | —Unverified | 4 |
| SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks | Mar 13, 2026 | | —Unverified | 4 |
| On the Theoretical Limitations of Embedding-Based Retrieval | Mar 12, 2026 | | —Unverified | 4 |
| Adaptation of Agentic AI: A Survey of Post-Training, Memory, and Skills | Mar 9, 2026 | | —Unverified | 4 |
| MotionStream: Real-Time Video Generation with Interactive Motion Controls | Mar 5, 2026 | | —Unverified | 4 |
| Utonia: Toward One Encoder for All Point Clouds | Mar 3, 2026 | | —Unverified | 4 |
| TTT3R: 3D Reconstruction as Test-Time Training | Mar 3, 2026 | | —Unverified | 4 |
| OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens | Mar 2, 2026 | | —Unverified | 4 |
| UltraViCo: Breaking Extrapolation Limits in Video Diffusion Transformers | Mar 1, 2026 | | —Unverified | 4 |
| Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations | Feb 28, 2026 | | —Unverified | 4 |
| A Pragmatic VLA Foundation Model | Feb 26, 2026 | | —Unverified | 4 |
| SkillNet: Create, Evaluate, and Connect AI Skills | Feb 26, 2026 | | —Unverified | 4 |
| Cautious Weight Decay | Feb 24, 2026 | | —Unverified | 4 |
| ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models | Feb 18, 2026 | | —Unverified | 4 |
| Reinforcement Learning via Self-Distillation | Feb 16, 2026 | | —Unverified | 4 |
| R-Zero: Self-Evolving Reasoning LLM from Zero Data | Feb 13, 2026 | | —Unverified | 4 |
| VideoWorld 2: Learning Transferable Knowledge from Real-world Videos | Feb 10, 2026 | | —Unverified | 4 |
| MOVA: Towards Scalable and Synchronized Video-Audio Generation | Feb 10, 2026 | | —Unverified | 4 |
| Unified Personalized Reward Model for Vision Generation | Feb 10, 2026 | | —Unverified | 4 |
| QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining | Feb 6, 2026 | | —Unverified | 4 |
| Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation | Feb 6, 2026 | | —Unverified | 4 |
| AgentCPM-Report: Interleaving Drafting and Deepening for Open-Ended Deep Research | Feb 6, 2026 | | —Unverified | 4 |
| DFlash: Block Diffusion for Flash Speculative Decoding | Feb 5, 2026 | | —Unverified | 4 |
| Learning to Discover at Test Time | Feb 5, 2026 | | —Unverified | 4 |
| On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models | Feb 3, 2026 | | —Unverified | 4 |
| Closing the Loop: Universal Repository Representation with RPG-Encoder | Feb 3, 2026 | | —Unverified | 4 |
| SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations | Feb 2, 2026 | | —Unverified | 4 |
| Causal World Modeling for Robot Control | Jan 29, 2026 | | —Unverified | 4 |
| Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models | Jan 29, 2026 | | —Unverified | 4 |
| Masked Depth Modeling for Spatial Perception | Jan 25, 2026 | | —Unverified | 4 |