| STPLS3D: A Large-Scale Synthetic and Real Aerial Photogrammetry 3D Point Cloud Dataset | Mar 17, 2022 | 3D Instance Segmentation3D Semantic Segmentation | CodeCode Available | 2 |
| Practical Stereo Matching via Cascaded Recurrent Network with Adaptive Correlation | Mar 22, 2022 | Stereo Matching | CodeCode Available | 2 |
| UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection | Mar 23, 2022 | DecoderHighlight Detection | CodeCode Available | 2 |
| FaceVerse: a Fine-grained and Detail-controllable 3D Face Morphable Model from a Hybrid Dataset | Mar 26, 2022 | 2k3D Face Reconstruction | CodeCode Available | 2 |
| Implementation of an Automated Learning System for Non-experts | Mar 26, 2022 | BIG-bench Machine LearningManagement | CodeCode Available | 2 |
| Pinwheel-shaped Convolution and Scale-based Dynamic Loss for Infrared Small Target Detection | Dec 22, 2024 | | CodeCode Available | 2 |
| Image-to-Lidar Self-Supervised Distillation for Autonomous Driving Data | Mar 30, 2022 | 3D Object Detection3D Semantic Segmentation | CodeCode Available | 2 |
| CoRT: Code-integrated Reasoning within Thinking | Jun 11, 2025 | Mathematical Reasoning | CodeCode Available | 2 |
| DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object Detection | Apr 12, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Localization Distillation for Object Detection | Apr 12, 2022 | Knowledge DistillationObject | CodeCode Available | 2 |
| ContentVec: An Improved Self-Supervised Speech Representation by Disentangling Speakers | Apr 20, 2022 | DisentanglementSelf-Supervised Learning | CodeCode Available | 2 |
| MuGER^2: Multi-Granularity Evidence Retrieval and Reasoning for Hybrid Question Answering | Oct 19, 2022 | NavigateQuestion Answering | CodeCode Available | 2 |
| BCI: Breast Cancer Immunohistochemical Image Generation through Pyramid Pix2pix | Apr 25, 2022 | Breast Cancer DetectionBreast Cancer Histology Image Classification | CodeCode Available | 2 |
| EMOCA: Emotion Driven Monocular Face Capture and Animation | Apr 24, 2022 | 3D Face Reconstruction3D geometry | CodeCode Available | 2 |
| Satellite Image Time Series Analysis for Big Earth Observation Data | Apr 24, 2022 | BIG-bench Machine LearningCloud Computing | CodeCode Available | 2 |
| AutoFi: Towards Automatic WiFi Human Sensing via Geometric Self-Supervised Learning | Apr 12, 2022 | Activity RecognitionDomain Adaptation | CodeCode Available | 2 |
| Self-focusing virtual screening with active design space pruning | May 3, 2022 | | CodeCode Available | 2 |
| ZIPA: A family of efficient models for multilingual phone recognition | May 29, 2025 | Diversity | CodeCode Available | 2 |
| Symphony Generation with Permutation Invariant Language Model | May 10, 2022 | Audio GenerationDecoder | CodeCode Available | 2 |
| Surface Representation for Point Clouds | May 11, 2022 | 3D Object Detection3D Point Cloud Classification | CodeCode Available | 2 |
| Learning A Sparse Transformer Network for Effective Image Deraining | Mar 21, 2023 | Image ReconstructionImage Restoration | CodeCode Available | 2 |
| Diffusion-based Time Series Imputation and Forecasting with Structured State Space Models | Aug 19, 2022 | ImputationMissing Values | CodeCode Available | 2 |
| Pocket2Mol: Efficient Molecular Sampling Based on 3D Protein Pockets | May 15, 2022 | Drug DesignGraph Neural Network | CodeCode Available | 2 |
| BEVerse: Unified Perception and Prediction in Birds-Eye-View for Vision-Centric Autonomous Driving | May 19, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Torchhd: An Open Source Python Library to Support Research on Hyperdimensional Computing and Vector Symbolic Architectures | May 18, 2022 | | CodeCode Available | 2 |
| OnePose: One-Shot Object Pose Estimation without CAD Models | May 24, 2022 | 6D Pose EstimationGraph Attention | CodeCode Available | 2 |
| Relighting4D: Neural Relightable Human from Videos | Jul 14, 2022 | | CodeCode Available | 2 |
| Weakly-supervised Audio Separation via Bi-modal Semantic Similarity | Apr 2, 2024 | Semantic SimilaritySemantic Textual Similarity | CodeCode Available | 2 |
| TimePro: Efficient Multivariate Long-term Time Series Forecasting with Variable- and Time-Aware Hyper-state | May 27, 2025 | MambaTime Series | CodeCode Available | 2 |
| Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation | May 27, 2022 | Contrastive Learningimage-classification | CodeCode Available | 2 |
| Improved Vector Quantized Diffusion Models | May 31, 2022 | DenoisingImage Generation | CodeCode Available | 2 |
| Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks | Jul 3, 2025 | Instruction Following | CodeCode Available | 2 |
| Structured prompt interrogation and recursive extraction of semantics (SPIRES): A method for populating knowledge bases using zero-shot learning | Apr 5, 2023 | Relation ExtractionZero-Shot Learning | CodeCode Available | 2 |
| Human-AI Shared Control via Policy Dissection | May 31, 2022 | Autonomous DrivingReinforcement Learning (RL) | CodeCode Available | 2 |
| Improving Diffusion Models for Inverse Problems using Manifold Constraints | Jun 2, 2022 | ColorizationImage Inpainting | CodeCode Available | 2 |
| HairMapper: Removing Hair From Portraits Using GANs | Jan 1, 2022 | 3D Face ReconstructionFace Reconstruction | CodeCode Available | 2 |
| Scaling Vision Transformers to Gigapixel Images via Hierarchical Self-Supervised Learning | Jun 6, 2022 | Self-Supervised LearningSurvival Prediction | CodeCode Available | 2 |
| Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation | Jan 29, 2025 | Red TeamingSafety Alignment | CodeCode Available | 2 |
| SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic Learning | Jun 16, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| Nocturne: a scalable driving benchmark for bringing multi-agent learning one step closer to the real world | Jun 20, 2022 | Imitation Learning | CodeCode Available | 2 |
| tntorch: Tensor Network Learning with PyTorch | Jun 22, 2022 | GPUtensor algebra | CodeCode Available | 2 |
| Equiformer: Equivariant Graph Attention Transformer for 3D Atomistic Graphs | Jun 23, 2022 | Graph AttentionGraph Neural Network | CodeCode Available | 2 |
| General Scene Adaptation for Vision-and-Language Navigation | Jan 29, 2025 | DiversityVision and Language Navigation | CodeCode Available | 2 |
| eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers | Nov 2, 2022 | Image GenerationText-to-Image Generation | CodeCode Available | 2 |
| PVO: Panoptic Visual Odometry | Jul 4, 2022 | Camera Pose EstimationOptical Flow Estimation | CodeCode Available | 2 |
| Improving Nighttime Driving-Scene Segmentation via Dual Image-adaptive Learnable Filters | Jul 4, 2022 | Autonomous DrivingScene Segmentation | CodeCode Available | 2 |
| Efficient Spatial-Temporal Information Fusion for LiDAR-Based 3D Moving Object Segmentation | Jul 5, 2022 | Autonomous DrivingCollision Avoidance | CodeCode Available | 2 |
| DadmaTools: Natural Language Processing Toolkit for Persian Language | Jul 1, 2022 | ChunkingConstituency Parsing | CodeCode Available | 2 |
| Pretraining a Neural Network before Knowing Its Architecture | Jul 20, 2022 | Diversity | CodeCode Available | 2 |
| Restoring Vision in Adverse Weather Conditions with Patch-Based Denoising Diffusion Models | Jul 29, 2022 | DenoisingImage Restoration | CodeCode Available | 2 |