| Accelerating Sparse Deep Neural Networks | Apr 16, 2021 | GPUMath | CodeCode Available | 2 |
| Adversarial Open Domain Adaptation for Sketch-to-Photo Synthesis | Apr 12, 2021 | Domain AdaptationImage-to-Image Translation | CodeCode Available | 2 |
| A Replication Study of Dense Passage Retriever | Apr 12, 2021 | Open-Domain Question AnsweringQuestion Answering | CodeCode Available | 2 |
| TenSEAL: A Library for Encrypted Tensor Operations Using Homomorphic Encryption | Apr 7, 2021 | BIG-bench Machine LearningPrivacy Preserving | CodeCode Available | 2 |
| ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refinement | Apr 6, 2021 | Image GenerationReal-to-Cartoon translation | CodeCode Available | 2 |
| AST: Audio Spectrogram Transformer | Apr 5, 2021 | Audio ClassificationAudio Tagging | CodeCode Available | 2 |
| AMP: Adversarial Motion Priors for Stylized Physics-Based Character Control | Apr 5, 2021 | Imitation LearningReinforcement Learning (RL) | CodeCode Available | 2 |
| Russian Paraphrasers: Paraphrase with Transformers | Apr 1, 2021 | | CodeCode Available | 2 |
| Reconstructing 3D Human Pose by Watching Humans in the Mirror | Apr 1, 2021 | 3D Pose EstimationPose Estimation | CodeCode Available | 2 |
| StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery | Mar 31, 2021 | Image Manipulation | CodeCode Available | 2 |
| VITON-HD: High-Resolution Virtual Try-On via Misalignment-Aware Normalization | Mar 31, 2021 | Virtual Try-onVocal Bursts Intensity Prediction | CodeCode Available | 2 |
| Sparse Auxiliary Networks for Unified Monocular Depth Prediction and Completion | Mar 30, 2021 | Depth EstimationDepth Prediction | CodeCode Available | 2 |
| MVSNeRF: Fast Generalizable Radiance Field Reconstruction from Multi-View Stereo | Mar 29, 2021 | NeRFNeural Rendering | CodeCode Available | 2 |
| Swin Transformer: Hierarchical Vision Transformer using Shifted Windows | Mar 25, 2021 | image-classificationImage Classification | CodeCode Available | 2 |
| FastMoE: A Fast Mixture-of-Expert Training System | Mar 24, 2021 | GPULanguage Modeling | CodeCode Available | 2 |
| Articulated Object Interaction in Unknown Scenes with Whole-Body Mobile Manipulation | Mar 18, 2021 | Object | CodeCode Available | 2 |
| GPT Understands, Too | Mar 18, 2021 | Knowledge ProbingLanguage Modeling | CodeCode Available | 2 |
| Full Page Handwriting Recognition via Image to Sequence Extraction | Mar 11, 2021 | Handwriting RecognitionHandwritten Text Recognition | CodeCode Available | 2 |
| Involution: Inverting the Inherence of Convolution for Visual Recognition | Mar 10, 2021 | Image Classification | CodeCode Available | 2 |
| hls4ml: An Open-Source Codesign Workflow to Empower Scientific Low-Power Machine Learning Devices | Mar 9, 2021 | BIG-bench Machine LearningDiagnostic | CodeCode Available | 2 |
| Virtual Normal: Enforcing Geometric Constraints for Accurate and Robust Depth Prediction | Mar 7, 2021 | Depth EstimationDepth Prediction | CodeCode Available | 2 |
| Measuring Mathematical Problem Solving With the MATH Dataset | Mar 5, 2021 | MathMathematical Problem-Solving | CodeCode Available | 2 |
| CLAIMED, a visual and scalable component library for Trusted AI | Mar 4, 2021 | Adversarial RobustnessFairness | CodeCode Available | 2 |
| Coordinate Attention for Efficient Mobile Network Design | Mar 4, 2021 | object-detectionObject Detection | CodeCode Available | 2 |
| OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal Association | Mar 3, 2021 | Car Pose EstimationKeypoint Detection | CodeCode Available | 2 |
| Learning to Fly -- a Gym Environment with PyBullet Physics for Reinforcement Learning of Multi-agent Quadcopter Control | Mar 3, 2021 | BenchmarkingMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual Machine Learning | Mar 2, 2021 | BIG-bench Machine LearningImage Retrieval | CodeCode Available | 2 |
| CogDL: A Comprehensive Library for Graph Deep Learning | Mar 1, 2021 | Deep LearningGraph Classification | CodeCode Available | 2 |
| Generative Adversarial Transformers | Mar 1, 2021 | DisentanglementImage Generation | CodeCode Available | 2 |
| Learning Transferable Visual Models From Natural Language Supervision | Feb 26, 2021 | Action RecognitionBenchmarking | CodeCode Available | 2 |
| Fast Minimum-norm Adversarial Attacks through Adaptive Norm Constraints | Feb 25, 2021 | Adversarial AttackAdversarial Robustness | CodeCode Available | 2 |
| When Attention Meets Fast Recurrence: Training Language Models with Reduced Compute | Feb 24, 2021 | GPULanguage Modeling | CodeCode Available | 2 |
| ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks | Feb 23, 2021 | Image Classification | CodeCode Available | 2 |
| Pyserini: An Easy-to-Use Python Toolkit to Support Replicable IR Research with Sparse and Dense Representations | Feb 19, 2021 | Cultural Vocal Bursts Intensity PredictionInformation Retrieval | CodeCode Available | 2 |
| Therapeutics Data Commons: Machine Learning Datasets and Tasks for Drug Discovery and Development | Feb 18, 2021 | BIG-bench Machine LearningDrug Discovery | CodeCode Available | 2 |
| LambdaNetworks: Modeling Long-Range Interactions Without Attention | Feb 17, 2021 | image-classificationImage Classification | CodeCode Available | 2 |
| Neural Network Libraries: A Deep Learning Framework Designed from Engineers' Perspectives | Feb 12, 2021 | Deep Learning | CodeCode Available | 2 |
| Understanding self-supervised Learning Dynamics without Contrastive Pairs | Feb 12, 2021 | Self-Supervised Learning | CodeCode Available | 2 |
| Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision | Feb 11, 2021 | Cross-Modal RetrievalFine-Grained Image Classification | CodeCode Available | 2 |
| Scale Normalized Image Pyramids with AutoFocus for Object Detection | Feb 10, 2021 | Objectobject-detection | CodeCode Available | 2 |
| Is Space-Time Attention All You Need for Video Understanding? | Feb 9, 2021 | Action ClassificationAction Recognition | CodeCode Available | 2 |
| Robust Motion In-betweening | Feb 9, 2021 | Human Pose Forecastingmotion in-betweening | CodeCode Available | 2 |
| TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation | Feb 8, 2021 | Cardiac SegmentationDecoder | CodeCode Available | 2 |
| Neural SDEs as Infinite-Dimensional GANs | Feb 6, 2021 | Time SeriesTime Series Analysis | CodeCode Available | 2 |
| Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses | Feb 3, 2021 | DecoderSpeech Denoising | CodeCode Available | 2 |
| Evaluating Large-Vocabulary Object Detectors: The Devil is in the Details | Feb 1, 2021 | Benchmarkingobject-detection | CodeCode Available | 2 |
| Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet | Jan 28, 2021 | image-classificationImage Classification | CodeCode Available | 2 |
| Autoregressive Denoising Diffusion Models for Multivariate Probabilistic Time Series Forecasting | Jan 28, 2021 | Multivariate Time Series ForecastingProbabilistic Time Series Forecasting | CodeCode Available | 2 |
| BirdNET: A deep learning solution for avian diversity monitoring | Jan 27, 2021 | Data AugmentationDeep Learning | CodeCode Available | 2 |
| Bottleneck Transformers for Visual Recognition | Jan 27, 2021 | image-classificationImage Classification | CodeCode Available | 2 |