| Tracking Objects as Points | Apr 2, 2020 | Multi-Object TrackingMultiple Object Tracking | CodeCode Available | 2 |
| A Survey on Neural Topic Models: Methods, Applications, and Challenges | Jan 27, 2024 | SurveyTopic Models | CodeCode Available | 2 |
| FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models | May 5, 2025 | BenchmarkingMathematical Reasoning | CodeCode Available | 2 |
| Frame-level Prediction of Facial Expressions, Valence, Arousal and Action Units for Mobile Devices | Mar 25, 2022 | Arousal EstimationEmotion Recognition | CodeCode Available | 2 |
| Improved iterative methods for solving risk parity portfolio | Feb 28, 2022 | | CodeCode Available | 2 |
| SEED-Bench-2: Benchmarking Multimodal Large Language Models | Nov 28, 2023 | BenchmarkingImage Generation | CodeCode Available | 2 |
| SSSegmenation: An Open Source Supervised Semantic Segmentation Toolbox Based on PyTorch | May 26, 2023 | Image SegmentationSegmentation | CodeCode Available | 2 |
| Multiscale Positive-Unlabeled Detection of AI-Generated Texts | May 29, 2023 | Language Modellingtext-classification | CodeCode Available | 2 |
| Learning to Learn with Generative Models of Neural Network Checkpoints | Sep 26, 2022 | | CodeCode Available | 2 |
| ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks | Oct 8, 2019 | Dimensionality Reductionimage-classification | CodeCode Available | 2 |
| Masked Generative Distillation | May 3, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Fast Multi-Level Foreground Estimation | Jun 26, 2020 | Image Matting | CodeCode Available | 2 |
| DCdetector: Dual Attention Contrastive Representation Learning for Time Series Anomaly Detection | Jun 17, 2023 | Anomaly DetectionContrastive Learning | CodeCode Available | 2 |
| TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation | Feb 8, 2021 | Cardiac SegmentationDecoder | CodeCode Available | 2 |
| NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection | Jul 27, 2023 | 3D geometry3D Object Detection | CodeCode Available | 2 |
| Generalized Parametric Contrastive Learning | Sep 26, 2022 | Contrastive LearningDomain Generalization | CodeCode Available | 2 |
| Fine-Tuned Language Models Generate Stable Inorganic Materials as Text | Feb 6, 2024 | | CodeCode Available | 2 |
| Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers | Dec 31, 2020 | DecoderMedical Image Segmentation | CodeCode Available | 2 |
| Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation | Aug 1, 2024 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| OpenFACADES: An Open Framework for Architectural Caption and Attribute Data Enrichment via Street View Imagery | Apr 1, 2025 | Attribute | CodeCode Available | 2 |
| DarkGS: Learning Neural Illumination and 3D Gaussians Relighting for Robotic Exploration in the Dark | Mar 16, 2024 | | CodeCode Available | 2 |
| TranAD: Deep Transformer Networks for Anomaly Detection in Multivariate Time Series Data | Jan 18, 2022 | Anomaly DetectionMeta-Learning | CodeCode Available | 2 |
| Large-Scale Multi-Center CT and MRI Segmentation of Pancreas with Deep Learning | May 20, 2024 | BenchmarkingMRI segmentation | CodeCode Available | 2 |
| ElegantRL-Podracer: Scalable and Elastic Library for Cloud-Native Deep Reinforcement Learning | Dec 11, 2021 | Deep Reinforcement LearningGPU | CodeCode Available | 2 |
| MetaFormer Is Actually What You Need for Vision | Nov 22, 2021 | Image ClassificationObject Detection | CodeCode Available | 2 |
| PhyX: Does Your Model Have the "Wits" for Physical Reasoning? | May 21, 2025 | | CodeCode Available | 2 |
| Variational Bayesian Last Layers | Apr 17, 2024 | Out-of-Distribution DetectionVariational Inference | CodeCode Available | 2 |
| Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis | Oct 25, 2023 | Text Spotting | CodeCode Available | 2 |
| Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding | Oct 7, 2022 | Chart Question AnsweringDiversity | CodeCode Available | 2 |
| GroupViT: Semantic Segmentation Emerges from Text Supervision | Feb 22, 2022 | Object DetectionScene Understanding | CodeCode Available | 2 |
| GhostFaceNets: Lightweight Face Recognition Model From Cheap Operations | Apr 10, 2023 | Face IdentificationFace Recognition | CodeCode Available | 2 |
| SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward | May 22, 2025 | Reinforcement Learning (RL) | CodeCode Available | 2 |
| Tutel: Adaptive Mixture-of-Experts at Scale | Jun 7, 2022 | Mixture-of-ExpertsObject Detection | CodeCode Available | 2 |
| Vivim: a Video Vision Mamba for Medical Video Segmentation | Jan 25, 2024 | Lesion SegmentationMamba | CodeCode Available | 2 |
| MobileOne: An Improved One millisecond Mobile Backbone | Jun 8, 2022 | Efficient Neural NetworkGaze Estimation | CodeCode Available | 2 |
| Sparse R-CNN: End-to-End Object Detection with Learnable Proposals | Nov 25, 2020 | 2D Object DetectionObject | CodeCode Available | 2 |
| MobileNeRF: Exploiting the Polygon Rasterization Pipeline for Efficient Neural Field Rendering on Mobile Architectures | Jul 30, 2022 | NeRFNovel View Synthesis | CodeCode Available | 2 |
| Learning explanations that are hard to vary | Sep 1, 2020 | Memorization | CodeCode Available | 2 |
| Direction-Aware Adaptive Online Neural Speech Enhancement with an Augmented Reality Headset in Real Noisy Conversational Environments | Jul 15, 2022 | blind source separationSpeech Enhancement | CodeCode Available | 2 |
| BiSeNet V2: Bilateral Network with Guided Aggregation for Real-time Semantic Segmentation | Apr 5, 2020 | Real-Time Semantic SegmentationSegmentation | CodeCode Available | 2 |
| A Large Scale Homography Benchmark | Feb 20, 2023 | Homography EstimationSurface Normal Estimation | CodeCode Available | 2 |
| External Knowledge Injection for CLIP-Based Class-Incremental Learning | Mar 11, 2025 | class-incremental learningClass Incremental Learning | CodeCode Available | 2 |
| NetMamba: Efficient Network Traffic Classification via Pre-training Unidirectional Mamba | May 19, 2024 | ClassificationFew-Shot Learning | CodeCode Available | 2 |
| Mega: Moving Average Equipped Gated Attention | Sep 21, 2022 | Image ClassificationInductive Bias | CodeCode Available | 2 |
| DaViT: Dual Attention Vision Transformers | Apr 7, 2022 | Computational EfficiencyImage Classification | CodeCode Available | 2 |
| AiTLAS: Artificial Intelligence Toolbox for Earth Observation | Jan 21, 2022 | BenchmarkingEarth Observation | CodeCode Available | 2 |
| Rodimus*: Breaking the Accuracy-Efficiency Trade-Off with Efficient Attentions | Oct 9, 2024 | Semantic Compression | CodeCode Available | 2 |
| T-GCN: A Temporal Graph ConvolutionalNetwork for Traffic Prediction | Nov 12, 2018 | ManagementPrediction | CodeCode Available | 2 |
| CDFormer: When Degradation Prediction Embraces Diffusion Model for Blind Image Super-Resolution | Jan 1, 2024 | DiversityImage Super-Resolution | CodeCode Available | 2 |
| HGRN2: Gated Linear RNNs with State Expansion | Apr 11, 2024 | Image ClassificationLanguage Modeling | CodeCode Available | 2 |