| Differentiable Tree Search Network | Jan 22, 2024 | Decision MakingInductive Bias | CodeCode Available | 5 |
| Sequencer: Deep LSTM for Image Classification | May 4, 2022 | Domain Generalizationimage-classification | CodeCode Available | 5 |
| Attention on the Sphere | May 16, 2025 | Depth EstimationImage Segmentation | CodeCode Available | 4 |
| When Does Perceptual Alignment Benefit Vision Representations? | Oct 14, 2024 | Depth EstimationImage Generation | CodeCode Available | 4 |
| Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis | Jun 1, 2023 | Audio SynthesisComputational Efficiency | CodeCode Available | 4 |
| Brain-inspired Multilayer Perceptron with Spiking Neurons | Mar 28, 2022 | Inductive Bias | CodeCode Available | 4 |
| Neural Message Passing Induced by Energy-Constrained Diffusion | Sep 13, 2024 | Inductive Bias | CodeCode Available | 3 |
| Unveiling Encoder-Free Vision-Language Models | Jun 17, 2024 | DecoderInductive Bias | CodeCode Available | 3 |
| Sequence-Augmented SE(3)-Flow Matching For Conditional Protein Backbone Generation | May 30, 2024 | DiversityDrug Design | CodeCode Available | 3 |
| U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers | May 4, 2024 | Image GenerationInductive Bias | CodeCode Available | 3 |
| SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series | Mar 22, 2024 | Inductive BiasMamba | CodeCode Available | 3 |
| Bird-Eye Transformers for Text Generation Models | Oct 8, 2022 | AttributeInductive Bias | CodeCode Available | 3 |
| BigVGAN: A Universal Neural Vocoder with Large-Scale Training | Jun 9, 2022 | Audio GenerationAudio Synthesis | CodeCode Available | 3 |
| Learning Robust Stereo Matching in the Wild with Selective Mixture-of-Experts | Jul 7, 2025 | Inductive BiasMixture-of-Experts | CodeCode Available | 2 |
| Segment This Thing: Foveated Tokenization for Efficient Point-Prompted Segmentation | Jun 10, 2025 | FoveationImage Segmentation | CodeCode Available | 2 |
| NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation | Apr 17, 2025 | Data AugmentationDiversity | CodeCode Available | 2 |
| Learning General-Purpose Biomedical Volume Representations using Randomized Synthesis | Nov 4, 2024 | Contrastive LearningDiversity | CodeCode Available | 2 |
| Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement | Oct 15, 2024 | DisentanglementInductive Bias | CodeCode Available | 2 |
| Longitudinal Segmentation of MS Lesions via Temporal Difference Weighting | Sep 20, 2024 | Inductive BiasLesion Detection | CodeCode Available | 2 |
| Interpretable Vision-Language Survival Analysis with Ordinal Inductive Bias for Computational Pathology | Sep 14, 2024 | Inductive BiasPrognosis | CodeCode Available | 2 |
| PARE-Net: Position-Aware Rotation-Equivariant Networks for Robust Point Cloud Registration | Jul 14, 2024 | Inductive BiasPoint Cloud Registration | CodeCode Available | 2 |
| Small-E: Small Language Model with Linear Attention for Efficient Speech Synthesis | Jun 6, 2024 | DecoderInductive Bias | CodeCode Available | 2 |
| Correlation-aware Coarse-to-fine MLPs for Deformable Medical Image Registration | May 31, 2024 | Deformable Medical Image RegistrationImage Registration | CodeCode Available | 2 |
| XTrack: Multimodal Training Boosts RGB-X Video Object Trackers | May 28, 2024 | Inductive BiasMixture-of-Experts | CodeCode Available | 2 |
| Gradformer: Graph Transformer with Exponential Decay | Apr 24, 2024 | Graph ClassificationGraph Neural Network | CodeCode Available | 2 |