| Block-Recurrent Transformers | Mar 11, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Global birdsong embeddings enable superior transfer learning for bioacoustic classification | Jul 12, 2023 | Audio ClassificationDecision Making | CodeCode Available | 2 | 5 |
| Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture | Jan 19, 2023 | Depth EstimationDepth Prediction | CodeCode Available | 2 | 5 |
| MI-DETR: An Object Detection Model with Multi-time Inquiries Mechanism | Mar 3, 2025 | Object Detection | CodeCode Available | 2 | 5 |
| TadGAN: Time Series Anomaly Detection Using Generative Adversarial Networks | Sep 16, 2020 | Anomaly DetectionBenchmarking | CodeCode Available | 2 | 5 |
| When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset | Jul 14, 2024 | 3D Object DetectionMultispectral Object Detection | CodeCode Available | 2 | 5 |
| Differentiable All-pole Filters for Time-varying Audio Systems | Apr 11, 2024 | AllAudio Effects Modeling | CodeCode Available | 2 | 5 |
| TorchOpt: An Efficient Library for Differentiable Optimization | Nov 13, 2022 | CPUGPU | CodeCode Available | 2 | 5 |
| Progressive Growing of GANs for Improved Quality, Stability, and Variation | Oct 27, 2017 | Face GenerationImage Generation | CodeCode Available | 2 | 5 |
| CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities | Aug 23, 2024 | DenoisingMotion Generation | CodeCode Available | 2 | 5 |
| ForensicHub: A Unified Benchmark & Codebase for All-Domain Fake Image Detection and Localization | May 16, 2025 | AllDeepFake Detection | CodeCode Available | 2 | 5 |
| Cross-Task Generalization via Natural Language Crowdsourcing Instructions | Apr 18, 2021 | Question Answering | CodeCode Available | 2 | 5 |
| MagicFace: High-Fidelity Facial Expression Editing with Action-Unit Control | Jan 4, 2025 | AttributeDenoising | CodeCode Available | 2 | 5 |
| From Poses to Identity: Training-Free Person Re-Identification via Feature Centralization | Mar 2, 2025 | Cross-Modal Person Re-IdentificationPerson Re-Identification | CodeCode Available | 2 | 5 |
| Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions | Jun 9, 2025 | Large Language ModelReinforcement Learning (RL) | CodeCode Available | 2 | 5 |
| UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle Imagery | Jan 3, 2025 | object-detectionObject Detection | CodeCode Available | 2 | 5 |
| RWKV-UNet: Improving UNet with Long-Range Cooperation for Effective Medical Image Segmentation | Jan 14, 2025 | Computational EfficiencyImage Segmentation | CodeCode Available | 2 | 5 |
| Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning | Oct 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| You Only Segment Once: Towards Real-Time Panoptic Segmentation | Mar 26, 2023 | DecoderPanoptic Segmentation | CodeCode Available | 2 | 5 |
| GaussianHead: High-fidelity Head Avatars with Learnable Gaussian Derivation | Dec 4, 2023 | Novel View Synthesis | CodeCode Available | 2 | 5 |
| Supervised Learning for Analog and RF Circuit Design: Benchmarks and Comparative Insights | Jan 21, 2025 | | CodeCode Available | 2 | 5 |
| Synchformer: Efficient Synchronization from Sparse Cues | Jan 29, 2024 | Audio-Visual Synchronization | CodeCode Available | 2 | 5 |
| GeoDream: Disentangling 2D and Geometric Priors for High-Fidelity and Consistent 3D Generation | Nov 29, 2023 | 3D GenerationText to 3D | CodeCode Available | 2 | 5 |
| Implicit Diffusion Models for Continuous Super-Resolution | Mar 29, 2023 | DenoisingImage Super-Resolution | CodeCode Available | 2 | 5 |
| Spherical Transformer for LiDAR-based 3D Recognition | Mar 22, 2023 | 3D Object Detection3D Semantic Segmentation | CodeCode Available | 2 | 5 |