| MC-Calib: A generic and robust calibration toolbox for multi-camera systems | Jan 12, 2022 | Camera Calibration | CodeCode Available | 2 |
| UniFormer: Unified Transformer for Efficient Spatiotemporal Representation Learning | Jan 12, 2022 | Representation Learning | CodeCode Available | 2 |
| pymdp: A Python library for active inference in discrete state spaces | Jan 11, 2022 | Bayesian Inference | CodeCode Available | 2 |
| MobileFaceSwap: A Lightweight Framework for Video Face Swapping | Jan 11, 2022 | Face SwappingKnowledge Distillation | CodeCode Available | 2 |
| CVSS Corpus and Massively Multilingual Speech-to-Speech Translation | Jan 11, 2022 | SentenceSpeech-to-Speech Translation | CodeCode Available | 2 |
| HumanNeRF: Free-viewpoint Rendering of Moving People from Monocular Video | Jan 11, 2022 | | CodeCode Available | 2 |
| Pedestrian Detection: Domain Generalization, CNNs, Transformers and Beyond | Jan 10, 2022 | AttributeAutonomous Driving | CodeCode Available | 2 |
| Language-driven Semantic Segmentation | Jan 10, 2022 | DescriptiveFew-Shot Semantic Segmentation | CodeCode Available | 2 |
| Black-Box Tuning for Language-Model-as-a-Service | Jan 10, 2022 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| QuadTree Attention for Vision Transformers | Jan 8, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| Equalized Focal Loss for Dense Long-Tailed Object Detection | Jan 7, 2022 | Long-tailed Object DetectionObject | CodeCode Available | 2 |
| Generalized Category Discovery | Jan 7, 2022 | Fine-Grained Visual RecognitionOpen-World Semi-Supervised Learning | CodeCode Available | 2 |
| BERN2: an advanced neural biomedical named entity recognition and normalization tool | Jan 6, 2022 | graph constructionnamed-entity-recognition | CodeCode Available | 2 |
| POCO: Point Convolution for Surface Reconstruction | Jan 5, 2022 | 3D ReconstructionSurface Reconstruction | CodeCode Available | 2 |
| Rethinking Depth Estimation for Multi-View Stereo: A Unified Representation | Jan 5, 2022 | 3D ReconstructionClassification | CodeCode Available | 2 |
| Robust Self-Supervised Audio-Visual Speech Recognition | Jan 5, 2022 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 2 |
| Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction | Jan 5, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| Multi-Representation Adaptation Network for Cross-domain Image Classification | Jan 4, 2022 | ClassificationDomain Adaptation | CodeCode Available | 2 |
| MDFEND: Multi-domain Fake News Detection | Jan 4, 2022 | Fake News DetectionMixture-of-Experts | CodeCode Available | 2 |
| Swin UNETR: Swin Transformers for Semantic Segmentation of Brain Tumors in MRI Images | Jan 4, 2022 | 3D Semantic SegmentationBrain Tumor Segmentation | CodeCode Available | 2 |
| A Transformer-Based Siamese Network for Change Detection | Jan 4, 2022 | Change DetectionDecoder | CodeCode Available | 2 |
| Aligning Domain-specific Distribution and Classifier for Cross-domain Classification from Multiple Sources | Jan 4, 2022 | Domain Adaptationdomain classification | CodeCode Available | 2 |
| Vision Transformer with Deformable Attention | Jan 3, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Vision Transformer Slimming: Multi-Dimension Searching in Continuous Optimization Space | Jan 3, 2022 | GPU | CodeCode Available | 2 |
| Language as Queries for Referring Video Object Segmentation | Jan 3, 2022 | ObjectObject Tracking | CodeCode Available | 2 |
| Integrating Artificial Intelligence and Augmented Reality in Robotic Surgery: An Initial dVRK Study Using a Surgical Education Scenario | Jan 2, 2022 | | CodeCode Available | 2 |
| Improving Out-of-Distribution Robustness via Selective Augmentation | Jan 2, 2022 | | CodeCode Available | 2 |
| DiffuseVAE: Efficient, Controllable and High-Fidelity Generation from Low-Dimensional Latents | Jan 2, 2022 | Image GenerationVocal Bursts Intensity Prediction | CodeCode Available | 2 |
| Splicing ViT Features for Semantic Appearance Transfer | Jan 2, 2022 | Appearance TransferImage Generation | CodeCode Available | 2 |
| 360MonoDepth: High-Resolution 360deg Monocular Depth Estimation | Jan 1, 2022 | 2kDepth Estimation | CodeCode Available | 2 |
| HiVT: Hierarchical Vector Transformer for Multi-Agent Motion Prediction | Jan 1, 2022 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 |
| StyTr2: Image Style Transfer With Transformers | Jan 1, 2022 | DecoderStyle Transfer | CodeCode Available | 2 |
| URetinex-Net: Retinex-Based Deep Unfolding Network for Low-Light Image Enhancement | Jan 1, 2022 | Image EnhancementLow-Light Image Enhancement | CodeCode Available | 2 |
| A Simple Episodic Linear Probe Improves Visual Recognition in the Wild | Jan 1, 2022 | Fine-Grained Image ClassificationImage Classification | CodeCode Available | 2 |
| Generating Diverse and Natural 3D Human Motions From Text | Jan 1, 2022 | Motion Synthesis | CodeCode Available | 2 |
| Pix2NeRF: Unsupervised Conditional p-GAN for Single Image to Neural Radiance Fields Translation | Jan 1, 2022 | 3D-Aware Image SynthesisImage Generation | CodeCode Available | 2 |
| All-in-One Image Restoration for Unknown Corruption | Jan 1, 2022 | 5-Degradation Blind All-in-One Image RestorationAll | CodeCode Available | 2 |
| HairMapper: Removing Hair From Portraits Using GANs | Jan 1, 2022 | 3D Face ReconstructionFace Reconstruction | CodeCode Available | 2 |
| C2AM: Contrastive Learning of Class-Agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation | Jan 1, 2022 | Contrastive Learningimage-classification | CodeCode Available | 2 |
| Deep Learning Interviews: Hundreds of fully solved job interview questions from a wide range of key topics in AI | Dec 30, 2021 | | CodeCode Available | 2 |
| JoJoGAN: One Shot Face Stylization | Dec 22, 2021 | Image StylizationOne-Shot Face Stylization | CodeCode Available | 2 |
| BEVDet: High-performance Multi-camera 3D Object Detection in Bird-Eye-View | Dec 22, 2021 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| NICE-SLAM: Neural Implicit Scalable Encoding for SLAM | Dec 22, 2021 | Simultaneous Localization and Mapping | CodeCode Available | 2 |
| GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models | Dec 20, 2021 | DiversityImage Generation | CodeCode Available | 2 |
| Mask2Former for Video Instance Segmentation | Dec 20, 2021 | Image SegmentationInstance Segmentation | CodeCode Available | 2 |
| M2DGR: A Multi-sensor and Multi-scenario SLAM Dataset for Ground Robots | Dec 19, 2021 | | CodeCode Available | 2 |
| Automated Deep Learning: Neural Architecture Search Is Not the End | Dec 16, 2021 | Deep LearningMachine Translation | CodeCode Available | 2 |
| ICON: Implicit Clothed humans Obtained from Normals | Dec 16, 2021 | 3D Human Pose Estimation3D Human Reconstruction | CodeCode Available | 2 |
| Are Graph Augmentations Necessary? Simple Graph Contrastive Learning for Recommendation | Dec 16, 2021 | Contrastive LearningRecommendation Systems | CodeCode Available | 2 |
| Putting People in their Place: Monocular Regression of 3D People in Depth | Dec 15, 2021 | 3D Depth Estimationregression | CodeCode Available | 2 |