| Vivim: a Video Vision Mamba for Medical Video Segmentation | Jan 25, 2024 | Lesion SegmentationMamba | CodeCode Available | 2 | 5 |
| MobileOne: An Improved One millisecond Mobile Backbone | Jun 8, 2022 | Efficient Neural NetworkGaze Estimation | CodeCode Available | 2 | 5 |
| Sparse R-CNN: End-to-End Object Detection with Learnable Proposals | Nov 25, 2020 | 2D Object DetectionObject | CodeCode Available | 2 | 5 |
| MobileNeRF: Exploiting the Polygon Rasterization Pipeline for Efficient Neural Field Rendering on Mobile Architectures | Jul 30, 2022 | NeRFNovel View Synthesis | CodeCode Available | 2 | 5 |
| Learning explanations that are hard to vary | Sep 1, 2020 | Memorization | CodeCode Available | 2 | 5 |
| Direction-Aware Adaptive Online Neural Speech Enhancement with an Augmented Reality Headset in Real Noisy Conversational Environments | Jul 15, 2022 | blind source separationSpeech Enhancement | CodeCode Available | 2 | 5 |
| BiSeNet V2: Bilateral Network with Guided Aggregation for Real-time Semantic Segmentation | Apr 5, 2020 | Real-Time Semantic SegmentationSegmentation | CodeCode Available | 2 | 5 |
| A Large Scale Homography Benchmark | Feb 20, 2023 | Homography EstimationSurface Normal Estimation | CodeCode Available | 2 | 5 |
| External Knowledge Injection for CLIP-Based Class-Incremental Learning | Mar 11, 2025 | class-incremental learningClass Incremental Learning | CodeCode Available | 2 | 5 |
| NetMamba: Efficient Network Traffic Classification via Pre-training Unidirectional Mamba | May 19, 2024 | ClassificationFew-Shot Learning | CodeCode Available | 2 | 5 |
| Mega: Moving Average Equipped Gated Attention | Sep 21, 2022 | Image ClassificationInductive Bias | CodeCode Available | 2 | 5 |
| DaViT: Dual Attention Vision Transformers | Apr 7, 2022 | Computational EfficiencyImage Classification | CodeCode Available | 2 | 5 |
| AiTLAS: Artificial Intelligence Toolbox for Earth Observation | Jan 21, 2022 | BenchmarkingEarth Observation | CodeCode Available | 2 | 5 |
| Rodimus*: Breaking the Accuracy-Efficiency Trade-Off with Efficient Attentions | Oct 9, 2024 | Semantic Compression | CodeCode Available | 2 | 5 |
| T-GCN: A Temporal Graph ConvolutionalNetwork for Traffic Prediction | Nov 12, 2018 | ManagementPrediction | CodeCode Available | 2 | 5 |
| CDFormer: When Degradation Prediction Embraces Diffusion Model for Blind Image Super-Resolution | Jan 1, 2024 | DiversityImage Super-Resolution | CodeCode Available | 2 | 5 |
| HGRN2: Gated Linear RNNs with State Expansion | Apr 11, 2024 | Image ClassificationLanguage Modeling | CodeCode Available | 2 | 5 |
| Earthformer: Exploring Space-Time Transformers for Earth System Forecasting | Jul 12, 2022 | Earth ObservationEarth Surface Forecasting | CodeCode Available | 2 | 5 |
| Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models | Aug 13, 2022 | Deep Learning | CodeCode Available | 2 | 5 |
| Masked-attention Mask Transformer for Universal Image Segmentation | Dec 2, 2021 | 2D Semantic SegmentationImage Segmentation | CodeCode Available | 2 | 5 |
| Cross-Tokenizer Distillation via Approximate Likelihood Matching | Mar 25, 2025 | Large Language Model | CodeCode Available | 2 | 5 |
| SOLOv2: Dynamic and Fast Instance Segmentation | Mar 23, 2020 | Instance Segmentationobject-detection | CodeCode Available | 2 | 5 |
| Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors | Apr 7, 2025 | GPU | CodeCode Available | 2 | 5 |
| MVSTER: Epipolar Transformer for Efficient Multi-View Stereo | Apr 15, 2022 | | CodeCode Available | 2 | 5 |
| Understanding self-supervised Learning Dynamics without Contrastive Pairs | Feb 12, 2021 | Self-Supervised Learning | CodeCode Available | 2 | 5 |
| Motion Transformer with Global Intention Localization and Local Movement Refinement | Sep 27, 2022 | motion predictionPrediction | CodeCode Available | 2 | 5 |
| Well-Read Students Learn Better: On the Importance of Pre-training Compact Models | Aug 23, 2019 | Knowledge DistillationLanguage Modelling | CodeCode Available | 2 | 5 |
| LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models | Nov 28, 2023 | Image CaptioningQuestion Answering | CodeCode Available | 2 | 5 |
| Learning without Forgetting | Jun 29, 2016 | Class Incremental LearningContinual Learning | CodeCode Available | 2 | 5 |
| MetaBox-v2: A Unified Benchmark Platform for Meta-Black-Box Optimization | May 23, 2025 | Meta-Learning | CodeCode Available | 2 | 5 |
| Learning Multi-dimensional Edge Feature-based AU Relation Graph for Facial Action Unit Recognition | May 2, 2022 | Facial Action Unit DetectionRelation | CodeCode Available | 2 | 5 |
| Automated Deep Learning: Neural Architecture Search Is Not the End | Dec 16, 2021 | Deep LearningMachine Translation | CodeCode Available | 2 | 5 |
| Reformer: The Efficient Transformer | Jan 13, 2020 | D4RLImage Generation | CodeCode Available | 2 | 5 |
| SimpleNet: A Simple Network for Image Anomaly Detection and Localization | Mar 27, 2023 | Anomaly ClassificationAnomaly Detection | CodeCode Available | 2 | 5 |
| CAD-Coder: An Open-Source Vision-Language Model for Computer-Aided Design Code Generation | May 20, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 2 | 5 |
| TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights | Oct 6, 2024 | | CodeCode Available | 2 | 5 |
| Language Modelling with Pixels | Jul 14, 2022 | Language ModellingNamed Entity Recognition | CodeCode Available | 2 | 5 |
| Panoptic Scene Graph Generation | Jul 22, 2022 | BenchmarkingPanoptic Scene Graph Generation | CodeCode Available | 2 | 5 |
| EmoBank: Studying the Impact of Annotation Perspective and Representation Format on Dimensional Emotion Analysis | May 4, 2022 | Emotion Recognition | CodeCode Available | 2 | 5 |
| Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level | Jun 22, 2024 | Machine TranslationTranslation | CodeCode Available | 2 | 5 |
| WiMANS: A Benchmark Dataset for WiFi-based Multi-user Activity Sensing | Jan 24, 2024 | Activity Recognition | CodeCode Available | 2 | 5 |
| Pretrained LLM Adapted with LoRA as a Decision Transformer for Offline RL in Quantitative Trading | Nov 26, 2024 | Offline RLparameter-efficient fine-tuning | CodeCode Available | 2 | 5 |
| RankVicuna: Zero-Shot Listwise Document Reranking with Open-Source Large Language Models | Sep 26, 2023 | Information RetrievalReranking | CodeCode Available | 2 | 5 |
| SynJax: Structured Probability Distributions for JAX | Aug 7, 2023 | | CodeCode Available | 2 | 5 |
| GPTopic: Dynamic and Interactive Topic Representations | Mar 6, 2024 | | CodeCode Available | 2 | 5 |
| A Length-Extrapolatable Transformer | Dec 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs | Mar 13, 2022 | Image Classification | CodeCode Available | 2 | 5 |
| Democratizing Neural Machine Translation with OPUS-MT | Dec 4, 2022 | Machine TranslationTranslation | CodeCode Available | 2 | 5 |
| Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training | Jan 10, 2024 | | CodeCode Available | 2 | 5 |
| YOLOv8-AM: YOLOv8 Based on Effective Attention Mechanisms for Pediatric Wrist Fracture Detection | Feb 14, 2024 | Fracture detectionmedical image detection | CodeCode Available | 2 | 5 |