| Sequencer: Deep LSTM for Image Classification | May 4, 2022 | Domain Generalizationimage-classification | CodeCode Available | 5 |
| Differentiable Tree Search Network | Jan 22, 2024 | Decision MakingInductive Bias | CodeCode Available | 5 |
| When Does Perceptual Alignment Benefit Vision Representations? | Oct 14, 2024 | Depth EstimationImage Generation | CodeCode Available | 4 |
| Attention on the Sphere | May 16, 2025 | Depth EstimationImage Segmentation | CodeCode Available | 4 |
| Brain-inspired Multilayer Perceptron with Spiking Neurons | Mar 28, 2022 | Inductive Bias | CodeCode Available | 4 |
| Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis | Jun 1, 2023 | Audio SynthesisComputational Efficiency | CodeCode Available | 4 |
| Bird-Eye Transformers for Text Generation Models | Oct 8, 2022 | AttributeInductive Bias | CodeCode Available | 3 |
| U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers | May 4, 2024 | Image GenerationInductive Bias | CodeCode Available | 3 |
| Unveiling Encoder-Free Vision-Language Models | Jun 17, 2024 | DecoderInductive Bias | CodeCode Available | 3 |
| Sequence-Augmented SE(3)-Flow Matching For Conditional Protein Backbone Generation | May 30, 2024 | DiversityDrug Design | CodeCode Available | 3 |
| BigVGAN: A Universal Neural Vocoder with Large-Scale Training | Jun 9, 2022 | Audio GenerationAudio Synthesis | CodeCode Available | 3 |
| Neural Message Passing Induced by Energy-Constrained Diffusion | Sep 13, 2024 | Inductive Bias | CodeCode Available | 3 |
| SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series | Mar 22, 2024 | Inductive BiasMamba | CodeCode Available | 3 |
| AdaPoinTr: Diverse Point Cloud Completion with Adaptive Geometry-Aware Transformers | Jan 11, 2023 | DenoisingInductive Bias | CodeCode Available | 2 |
| TransBTSV2: Towards Better and More Efficient Volumetric Segmentation of Medical Images | Jan 30, 2022 | Brain Tumor SegmentationImage Segmentation | CodeCode Available | 2 |
| Video Swin Transformer | Jun 24, 2021 | Action ClassificationAction Recognition | CodeCode Available | 2 |
| Small-E: Small Language Model with Linear Attention for Efficient Speech Synthesis | Jun 6, 2024 | DecoderInductive Bias | CodeCode Available | 2 |
| A Closer Look at Learned Optimization: Stability, Robustness, and Inductive Biases | Sep 22, 2022 | Inductive Bias | CodeCode Available | 2 |
| Time Series Diffusion in the Frequency Domain | Feb 8, 2024 | DenoisingInductive Bias | CodeCode Available | 2 |
| ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond | Feb 21, 2022 | Image ClassificationInductive Bias | CodeCode Available | 2 |
| Neural Markov Random Field for Stereo Matching | Mar 17, 2024 | Domain GeneralizationInductive Bias | CodeCode Available | 2 |
| Masked Autoencoders As Spatiotemporal Learners | May 18, 2022 | Inductive BiasRepresentation Learning | CodeCode Available | 2 |
| Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement | Oct 15, 2024 | DisentanglementInductive Bias | CodeCode Available | 2 |
| Segment This Thing: Foveated Tokenization for Efficient Point-Prompted Segmentation | Jun 10, 2025 | FoveationImage Segmentation | CodeCode Available | 2 |
| Longitudinal Segmentation of MS Lesions via Temporal Difference Weighting | Sep 20, 2024 | Inductive BiasLesion Detection | CodeCode Available | 2 |
| Is Attention All That NeRF Needs? | Jul 27, 2022 | AllGeneralizable Novel View Synthesis | CodeCode Available | 2 |
| XTrack: Multimodal Training Boosts RGB-X Video Object Trackers | May 28, 2024 | Inductive BiasMixture-of-Experts | CodeCode Available | 2 |
| Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation | Aug 27, 2021 | Inductive BiasPlaying the Game of 2048 | CodeCode Available | 2 |
| Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges | Apr 24, 2024 | Drug DesignInductive Bias | CodeCode Available | 2 |
| NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation | Apr 17, 2025 | Data AugmentationDiversity | CodeCode Available | 2 |
| Gradformer: Graph Transformer with Exponential Decay | Apr 24, 2024 | Graph ClassificationGraph Neural Network | CodeCode Available | 2 |
| DYffusion: A Dynamics-informed Diffusion Model for Spatiotemporal Forecasting | Jun 3, 2023 | Computational EfficiencyInductive Bias | CodeCode Available | 2 |
| Forecast-MAE: Self-supervised Pre-training for Motion Forecasting with Masked Autoencoders | Aug 19, 2023 | Inductive BiasMotion Forecasting | CodeCode Available | 2 |
| Hidden Biases of End-to-End Driving Models | Jun 13, 2023 | Autonomous DrivingBench2Drive | CodeCode Available | 2 |
| Differentiable Convex Optimization Layers | Oct 28, 2019 | Inductive Bias | CodeCode Available | 2 |
| Equivariant Ensembles and Regularization for Reinforcement Learning in Map-based Path Planning | Mar 19, 2024 | Inductive BiasReinforcement Learning (RL) | CodeCode Available | 2 |
| A Data-scalable Transformer for Medical Image Segmentation: Architecture, Model Efficiency, and Benchmark | Feb 28, 2022 | Image SegmentationInductive Bias | CodeCode Available | 2 |
| Global Context Vision Transformers | Jun 20, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Discrete Event, Continuous Time RNNs | Oct 11, 2017 | Inductive BiasRetrieval | CodeCode Available | 2 |
| Interpretable Vision-Language Survival Analysis with Ordinal Inductive Bias for Computational Pathology | Sep 14, 2024 | Inductive BiasPrognosis | CodeCode Available | 2 |
| Learning General-Purpose Biomedical Volume Representations using Randomized Synthesis | Nov 4, 2024 | Contrastive LearningDiversity | CodeCode Available | 2 |
| Learning Robust Stereo Matching in the Wild with Selective Mixture-of-Experts | Jul 7, 2025 | Inductive BiasMixture-of-Experts | CodeCode Available | 2 |
| Correlation-aware Coarse-to-fine MLPs for Deformable Medical Image Registration | May 31, 2024 | Deformable Medical Image RegistrationImage Registration | CodeCode Available | 2 |
| Mega: Moving Average Equipped Gated Attention | Sep 21, 2022 | Image ClassificationInductive Bias | CodeCode Available | 2 |
| Model-Based Imitation Learning for Urban Driving | Oct 14, 2022 | 3D geometryAutonomous Driving | CodeCode Available | 2 |
| Multi-Task Learning as Multi-Objective Optimization | Oct 10, 2018 | Depth EstimationGeneral Classification | CodeCode Available | 2 |
| Learning Deep Time-index Models for Time Series Forecasting | Jul 13, 2022 | Inductive BiasMeta-Learning | CodeCode Available | 2 |
| DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer | Jul 10, 2022 | FormInductive Bias | CodeCode Available | 2 |
| Prototypical Networks for Few-shot Learning | Mar 15, 2017 | Category-Agnostic Pose EstimationFew-Shot Image Classification | CodeCode Available | 2 |
| HiFuse: Hierarchical Multi-Scale Feature Fusion Network for Medical Image Classification | Sep 21, 2022 | Classificationimage-classification | CodeCode Available | 2 |