| SoftGroup for 3D Instance Segmentation on Point Clouds | Mar 3, 2022 | 3D Instance Segmentation3D Object Detection | CodeCode Available | 2 |
| Freeform Body Motion Generation from Speech | Mar 4, 2022 | DiversityMotion Generation | CodeCode Available | 2 |
| Class-incremental Learning for Time Series: Benchmark and Evaluation | Feb 19, 2024 | Activity RecognitionBenchmarking | CodeCode Available | 2 |
| SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection | May 16, 2024 | object-detectionObject Detection | CodeCode Available | 2 |
| Diffusion-based Generation, Optimization, and Planning in 3D Scenes | Jan 15, 2023 | DenoisingGrasp Generation | CodeCode Available | 2 |
| Real-time Object Detection for Streaming Perception | Mar 23, 2022 | Autonomous DrivingObject | CodeCode Available | 2 |
| Touchstone Benchmark: Are We on the Right Way for Evaluating AI Algorithms for Medical Segmentation? | Nov 6, 2024 | | CodeCode Available | 2 |
| Latent Modulated Function for Computational Optimal Continuous Image Representation | Apr 25, 2024 | Computational EfficiencySuper-Resolution | CodeCode Available | 2 |
| ContextGS: Compact 3D Gaussian Splatting with Anchor Level Context Model | May 31, 2024 | 3DGSImage Compression | CodeCode Available | 2 |
| FLoRA: Federated Fine-Tuning Large Language Models with Heterogeneous Low-Rank Adaptations | Sep 9, 2024 | Federated LearningPrivacy Preserving | CodeCode Available | 2 |
| Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results | Apr 7, 2022 | Image ClassificationKnowledge Distillation | CodeCode Available | 2 |
| Privacy Backdoors: Stealing Data with Corrupted Pretrained Models | Mar 30, 2024 | | CodeCode Available | 2 |
| A Keypoint-based Global Association Network for Lane Detection | Apr 15, 2022 | Keypoint EstimationLane Detection | CodeCode Available | 2 |
| LifeGPT: Topology-Agnostic Generative Pretrained Transformer Model for Cellular Automata | Sep 3, 2024 | Large Language Model | CodeCode Available | 2 |
| SimAM: A Simple, Parameter-Free Attention Module for Convolutional Neural Networks | Jul 24, 2022 | | CodeCode Available | 2 |
| CLIP-Art: Contrastive Pre-training for Fine-Grained Art Classification | Apr 29, 2022 | AttributeClassification | CodeCode Available | 2 |
| ARCTIC: A Dataset for Dexterous Bimanual Hand-Object Manipulation | Apr 28, 2022 | 3D ReconstructionObject | CodeCode Available | 2 |
| An Extensive Data Processing Pipeline for MIMIC-IV | Apr 29, 2022 | modelTime Series Analysis | CodeCode Available | 2 |
| Machine Learning-Friendly Biomedical Datasets for Equivalence and Subsumption Ontology Matching | May 6, 2022 | Ontology Matching | CodeCode Available | 2 |
| Lion: Adversarial Distillation of Proprietary Large Language Models | May 22, 2023 | Instruction FollowingKnowledge Distillation | CodeCode Available | 2 |
| Vision-based Anti-UAV Detection and Tracking | May 22, 2022 | | CodeCode Available | 2 |
| Feature Mapping in Physics-Informed Neural Networks (PINNs) | Feb 10, 2024 | 10-shot image generation | CodeCode Available | 2 |
| CoNT: Contrastive Neural Text Generation | May 29, 2022 | Code Comment GenerationComment Generation | CodeCode Available | 2 |
| Rethinking Graph Neural Networks for Anomaly Detection | May 31, 2022 | Anomaly DetectionGraph Anomaly Detection | CodeCode Available | 2 |
| Continual Forgetting for Pre-trained Vision Models | Mar 18, 2024 | Continual ForgettingFace Recognition | CodeCode Available | 2 |
| Towards Ultra-Low-Power Neuromorphic Speech Enhancement with Spiking-FullSubNet | Oct 7, 2024 | DenoisingSpeech Denoising | CodeCode Available | 2 |
| HiVT: Hierarchical Vector Transformer for Multi-Agent Motion Prediction | Jan 1, 2022 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 |
| StyTr2: Image Style Transfer With Transformers | Jan 1, 2022 | DecoderStyle Transfer | CodeCode Available | 2 |
| PlanarRecon: Real-time 3D Plane Detection and Reconstruction from Posed Monocular Videos | Jun 15, 2022 | 3D Plane Detection | CodeCode Available | 2 |
| BokehMe: When Neural Rendering Meets Classical Rendering | Jun 25, 2022 | Neural Rendering | CodeCode Available | 2 |
| Parametric and Multivariate Uncertainty Calibration for Regression and Object Detection | Jul 4, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| HierarchicalForecast: A Reference Framework for Hierarchical Forecasting in Python | Jul 7, 2022 | BIG-bench Machine LearningDecision Making | CodeCode Available | 2 |
| Towards Scale-Aware, Robust, and Generalizable Unsupervised Monocular Depth Estimation by Integrating IMU Motion Dynamics | Jul 11, 2022 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 2 |
| Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning | Jul 11, 2022 | Image ClassificationInstance Segmentation | CodeCode Available | 2 |
| SenseFi: A Library and Benchmark on Deep-Learning-Empowered WiFi Human Sensing | Jul 16, 2022 | Activity RecognitionDeep Learning | CodeCode Available | 2 |
| ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild | Jul 19, 2022 | Camera Pose EstimationMotion Segmentation | CodeCode Available | 2 |
| CGVQM+D: Computer Graphics Video Quality Metric and Dataset | Jun 13, 2025 | DenoisingNovel View Synthesis | CodeCode Available | 2 |
| Language Models Can Teach Themselves to Program Better | Jul 29, 2022 | Code Generation | CodeCode Available | 2 |
| Label Sleuth: From Unlabeled Text to a Classifier in a Few Hours | Aug 2, 2022 | Text Classification | CodeCode Available | 2 |
| MVSFormer: Multi-View Stereo by Learning Robust Image Features and Temperature-based Depth | Aug 4, 2022 | 3D ReconstructionPoint Clouds | CodeCode Available | 2 |
| No More Strided Convolutions or Pooling: A New CNN Building Block for Low-Resolution Images and Small Objects | Aug 7, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Sample-Efficient Diffusion for Text-To-Speech Synthesis | Sep 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| CitySim: A Drone-Based Vehicle Trajectory Dataset for Safety Oriented Research and Digital Twins | Aug 23, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians | Dec 4, 2023 | Face Model | CodeCode Available | 2 |
| GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis | Dec 4, 2023 | 2kDepth Estimation | CodeCode Available | 2 |
| Towards Generalist Foundation Model for Radiology by Leveraging Web-scale 2D&3D Medical Data | Aug 4, 2023 | Question AnsweringVisual Question Answering | CodeCode Available | 2 |
| CW-ERM: Improving Autonomous Driving Planning with Closed-loop Weighted Empirical Risk Minimization | Oct 5, 2022 | Autonomous DrivingImitation Learning | CodeCode Available | 2 |
| Text Detection Forgot About Document OCR | Oct 14, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 2 |
| Thinking Image Color Aesthetics Assessment: Models, Datasets and Benchmarks | Jan 1, 2023 | Image Quality Assessment | CodeCode Available | 2 |
| NVIDIA FLARE: Federated Learning from Simulation to Real-World | Oct 24, 2022 | Federated LearningPrivacy Preserving | CodeCode Available | 2 |