| Sintel: A Machine Learning Framework to Extract Insights from Signals | Apr 19, 2022 | Anomaly DetectionBIG-bench Machine Learning | CodeCode Available | 3 |
| SymForce: Symbolic Computation and Code Generation for Robotics | Apr 17, 2022 | Code GenerationMath | CodeCode Available | 3 |
| Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks | Apr 16, 2022 | BenchmarkingInstruction Following | CodeCode Available | 3 |
| MiniViT: Compressing Vision Transformers with Weight Multiplexing | Apr 14, 2022 | DiversityImage Classification | CodeCode Available | 3 |
| Hierarchical Text-Conditional Image Generation with CLIP Latents | Apr 13, 2022 | Conditional Image GenerationDecoder | CodeCode Available | 3 |
| VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration | Apr 12, 2022 | Speech DenoisingSpeech Enhancement | CodeCode Available | 3 |
| Towards An End-to-End Framework for Flow-Guided Video Inpainting | Apr 6, 2022 | HallucinationOptical Flow Estimation | CodeCode Available | 3 |
| MaxViT: Multi-Axis Vision Transformer | Apr 4, 2022 | image-classificationImage Classification | CodeCode Available | 3 |
| UNetFormer: A Unified Vision Transformer Model and Pre-Training Framework for 3D Medical Image Segmentation | Apr 1, 2022 | Brain Tumor SegmentationImage Segmentation | CodeCode Available | 3 |
| Learnable latent embeddings for joint behavioral and neural analysis | Apr 1, 2022 | | CodeCode Available | 3 |
| BEVDet4D: Exploit Temporal Cues in Multi-camera 3D Object Detection | Mar 31, 2022 | 3D Object Detectionobject-detection | CodeCode Available | 3 |
| Min-Max Similarity: A Contrastive Semi-Supervised Deep Learning Network for Surgical Tools Segmentation | Mar 29, 2022 | Contrastive LearningSegmentation | CodeCode Available | 3 |
| Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking | Mar 27, 2022 | CPUMulti-Object Tracking | CodeCode Available | 3 |
| EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation | Mar 24, 2022 | 3D Object Detection6D Pose Estimation using RGB | CodeCode Available | 3 |
| Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer | Mar 24, 2022 | Style TransferTransfer Learning | CodeCode Available | 3 |
| VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training | Mar 23, 2022 | 4kAction Classification | CodeCode Available | 3 |
| NeuMan: Neural Human Radiance Field from a Single Video | Mar 23, 2022 | NeRF | CodeCode Available | 3 |
| Visual Prompt Tuning | Mar 23, 2022 | Image ClassificationLong-tail Learning | CodeCode Available | 3 |
| Inferring Articulated Rigid Body Dynamics from RGBD Video | Mar 20, 2022 | Contact mechanicsInverse Rendering | CodeCode Available | 3 |
| Half-Inverse Gradients for Physical Deep Learning | Mar 18, 2022 | Deep Learning | CodeCode Available | 3 |
| Pushing the limits of raw waveform speaker recognition | Mar 16, 2022 | Self-Supervised LearningSpeaker Recognition | CodeCode Available | 3 |
| Image Quality Assessment for Magnetic Resonance Imaging | Mar 15, 2022 | DenoisingImage Enhancement | CodeCode Available | 3 |
| Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models | Mar 14, 2022 | Text Classification | CodeCode Available | 3 |
| A Unified Framework for Rank-based Evaluation Metrics for Link Prediction in Knowledge Graphs | Mar 14, 2022 | BenchmarkingGraph Embedding | CodeCode Available | 3 |
| Don't fear the unlabelled: safe semi-supervised learning via simple debiasing | Mar 14, 2022 | Learning TheoryPseudo Label | CodeCode Available | 3 |
| CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio Classification | Mar 13, 2022 | Audio ClassificationKnowledge Distillation | CodeCode Available | 3 |
| Automated Formulaic Alpha Generation for Quantitative Investing using Evolutionary Algorithms | Mar 13, 2022 | Evolutionary Algorithms | CodeCode Available | 3 |
| PETR: Position Embedding Transformation for Multi-View 3D Object Detection | Mar 10, 2022 | 3D Object DetectionObject | CodeCode Available | 3 |
| BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis | Mar 10, 2022 | Gesture GenerationGesture Recognition | CodeCode Available | 3 |
| Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer | Mar 7, 2022 | | CodeCode Available | 3 |
| Nuclei instance segmentation and classification in histopathology images with StarDist | Mar 3, 2022 | ClassificationInstance Segmentation | CodeCode Available | 3 |
| Autoregressive Image Generation using Residual Quantization | Mar 3, 2022 | Conditional Image GenerationImage Generation | CodeCode Available | 3 |
| Magnitude-aware Probabilistic Speaker Embeddings | Feb 28, 2022 | Out-of-Distribution DetectionSpeaker Verification | CodeCode Available | 3 |
| QOC: Quantum On-Chip Training with Parameter Shift and Gradient Pruning | Feb 26, 2022 | image-classificationImage Classification | CodeCode Available | 3 |
| A Systematic Evaluation of Large Language Models of Code | Feb 26, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| End-to-end LPCNet: A Neural Vocoder With Fully-Differentiable LPC Estimation | Feb 23, 2022 | Speech Synthesis | CodeCode Available | 3 |
| Neural Speech Synthesis on a Shoestring: Improving the Efficiency of LPCNet | Feb 22, 2022 | Speech Synthesis | CodeCode Available | 3 |
| ST-MoE: Designing Stable and Transferable Sparse Expert Models | Feb 17, 2022 | ARCCommon Sense Reasoning | CodeCode Available | 3 |
| Graph-Augmented Normalizing Flows for Anomaly Detection of Multiple Time Series | Feb 16, 2022 | Anomaly DetectionDensity Estimation | CodeCode Available | 3 |
| TorchDrug: A Powerful and Flexible Machine Learning Platform for Drug Discovery | Feb 16, 2022 | BIG-bench Machine LearningDrug Discovery | CodeCode Available | 3 |
| Block-NeRF: Scalable Large Scene Neural View Synthesis | Feb 10, 2022 | NeRF | CodeCode Available | 3 |
| Locating and Editing Factual Associations in GPT | Feb 10, 2022 | counterfactualModel Editing | CodeCode Available | 3 |
| DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models | Feb 8, 2022 | DiagnosticImage Captioning | CodeCode Available | 3 |
| MaskGIT: Masked Generative Image Transformer | Feb 8, 2022 | DecoderImage Generation | CodeCode Available | 3 |
| A new face swap method for image and video domains: a technical report | Feb 7, 2022 | Action Recognition In VideosFace Recognition | CodeCode Available | 3 |
| Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model | Jan 28, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 3 |
| VRT: A Video Restoration Transformer | Jan 28, 2022 | DeblurringDenoising | CodeCode Available | 3 |
| Transformers in Medical Imaging: A Survey | Jan 24, 2022 | Image ClassificationImage Segmentation | CodeCode Available | 3 |
| Patches Are All You Need? | Jan 24, 2022 | AllImage Classification | CodeCode Available | 3 |
| Point-NeRF: Point-based Neural Radiance Fields | Jan 21, 2022 | 3D ReconstructionNeRF | CodeCode Available | 3 |