| Block-Recurrent Transformers | Mar 11, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Tevatron: An Efficient and Flexible Toolkit for Dense Retrieval | Mar 11, 2022 | GPURetrieval | CodeCode Available | 2 |
| Masked Visual Pre-training for Motor Control | Mar 11, 2022 | Robot Manipulation GeneralizationState Estimation | CodeCode Available | 2 |
| QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quantization | Mar 11, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Democratizing Contrastive Language-Image Pre-training: A CLIP Benchmark of Data, Model, and Supervision | Mar 11, 2022 | | CodeCode Available | 2 |
| Embedding Earth: Self-supervised contrastive pre-training for dense land cover classification | Mar 11, 2022 | Earth ObservationLand Cover Classification | CodeCode Available | 2 |
| On Embeddings for Numerical Features in Tabular Deep Learning | Mar 10, 2022 | Deep Learning | CodeCode Available | 2 |
| Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time | Mar 10, 2022 | Domain Generalization | CodeCode Available | 2 |
| Iterative Corresponding Geometry: Fusing Region and Depth for Highly Efficient 3D Tracking of Textureless Objects | Mar 10, 2022 | 3D Object Tracking6D Pose Estimation | CodeCode Available | 2 |
| Restoring and attributing ancient texts using deep neural networks | Mar 9, 2022 | Ancient Text RestorationAttribute | CodeCode Available | 2 |
| A Unified Transformer Framework for Group-based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection | Mar 9, 2022 | Co-Salient Object Detectionobject-detection | CodeCode Available | 2 |
| CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers | Mar 9, 2022 | 3D Object DetectionAutonomous Vehicles | CodeCode Available | 2 |
| Temporal Difference Learning for Model Predictive Control | Mar 9, 2022 | continuous-controlContinuous Control | CodeCode Available | 2 |
| UNeXt: MLP-based Rapid Medical Image Segmentation Network | Mar 9, 2022 | DecoderImage Segmentation | CodeCode Available | 2 |
| StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN | Mar 8, 2022 | Face GenerationFacial Editing | CodeCode Available | 2 |
| E2EC: An End-to-End Contour-based Method for High-Quality High-Speed Instance Segmentation | Mar 8, 2022 | GPUInstance Segmentation | CodeCode Available | 2 |
| Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels | Mar 8, 2022 | Semi-Supervised Semantic Segmentation | CodeCode Available | 2 |
| Probabilistic Warp Consistency for Weakly-Supervised Semantic Correspondences | Mar 8, 2022 | TripletWeakly-supervised Learning | CodeCode Available | 2 |
| RC-MVSNet: Unsupervised Multi-View Stereo with Neural Rendering | Mar 8, 2022 | Neural Rendering | CodeCode Available | 2 |
| ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and Transformer | Mar 8, 2022 | Image Classificationobject-detection | CodeCode Available | 2 |
| Deep Rectangling for Image Stitching: A Learning Baseline | Mar 8, 2022 | DiversityImage Stitching | CodeCode Available | 2 |
| Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval | Mar 7, 2022 | Information RetrievalPassage Retrieval | CodeCode Available | 2 |
| L2CS-Net: Fine-Grained Gaze Estimation in Unconstrained Environments | Mar 7, 2022 | DiversityGaze Estimation | CodeCode Available | 2 |
| GeoDiff: a Geometric Diffusion Model for Molecular Conformation Generation | Mar 6, 2022 | Drug Discovery | CodeCode Available | 2 |
| Learning Affinity from Attention: End-to-End Weakly-Supervised Semantic Segmentation with Transformers | Mar 5, 2022 | Semantic SegmentationWeakly supervised Semantic Segmentation | CodeCode Available | 2 |
| MetaFormer: A Unified Meta Framework for Fine-Grained Recognition | Mar 5, 2022 | AttributeFine-Grained Image Classification | CodeCode Available | 2 |
| Cross Language Image Matching for Weakly Supervised Semantic Segmentation | Mar 5, 2022 | ObjectSemantic Segmentation | CodeCode Available | 2 |
| iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform | Mar 4, 2022 | Speech Synthesistext-to-speech | CodeCode Available | 2 |
| F2DNet: Fast Focal Detection Network for Pedestrian Detection | Mar 4, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| Rethinking Efficient Lane Detection via Curve Modeling | Mar 4, 2022 | Lane Detection | CodeCode Available | 2 |
| Freeform Body Motion Generation from Speech | Mar 4, 2022 | DiversityMotion Generation | CodeCode Available | 2 |
| SimKGC: Simple Contrastive Knowledge Graph Completion with Pre-trained Language Models | Mar 4, 2022 | Contrastive LearningGraph Embedding | CodeCode Available | 2 |
| Attention Concatenation Volume for Accurate and Efficient Stereo Matching | Mar 4, 2022 | Patch MatchingStereo Depth Estimation | CodeCode Available | 2 |
| LiteTransformerSearch: Training-free Neural Architecture Search for Efficient Language Models | Mar 4, 2022 | DecoderGPU | CodeCode Available | 2 |
| TCTrack: Temporal Contexts for Aerial Tracking | Mar 3, 2022 | | CodeCode Available | 2 |
| CenterSnap: Single-Shot Multi-Object 3D Shape Reconstruction and Categorical 6D Pose and Size Estimation | Mar 3, 2022 | 3D Reconstruction3D Shape Reconstruction | CodeCode Available | 2 |
| Beyond 3D Siamese Tracking: A Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds | Mar 3, 2022 | 3D Single Object TrackingAutonomous Driving | CodeCode Available | 2 |
| Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows | Mar 3, 2022 | Speech Synthesistext-to-speech | CodeCode Available | 2 |
| Graph Neural Networks for Multimodal Single-Cell Data Integration | Mar 3, 2022 | Data IntegrationGraph Neural Network | CodeCode Available | 2 |
| BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning | Mar 3, 2022 | Compositional Zero-Shot LearningContrastive Learning | CodeCode Available | 2 |
| NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation | Mar 3, 2022 | DecoderDepth Estimation | CodeCode Available | 2 |
| SoftGroup for 3D Instance Segmentation on Point Clouds | Mar 3, 2022 | 3D Instance Segmentation3D Object Detection | CodeCode Available | 2 |
| Recovering 3D Human Mesh from Monocular Images: A Survey | Mar 3, 2022 | 3D human pose and shape estimationHuman Mesh Recovery | CodeCode Available | 2 |
| Colar: Effective and Efficient Online Action Detection by Consulting Exemplars | Mar 2, 2022 | Action DetectionOnline Action Detection | CodeCode Available | 2 |
| MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video | Mar 2, 2022 | 3D Human Pose EstimationClassification | CodeCode Available | 2 |
| FastFold: Reducing AlphaFold Training Time from 11 Days to 67 Hours | Mar 2, 2022 | Protein Structure PredictionTranslation | CodeCode Available | 2 |
| Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional Encoding | Mar 2, 2022 | Image Inpainting | CodeCode Available | 2 |
| CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding | Mar 1, 2022 | 3D Object Classification3D Point Cloud Linear Classification | CodeCode Available | 2 |
| OpenDR: An Open Toolkit for Enabling High Performance, Low Footprint Deep Learning for Robotics | Mar 1, 2022 | | CodeCode Available | 2 |
| StrongSORT: Make DeepSORT Great Again | Feb 28, 2022 | Multi-Object Trackingobject-detection | CodeCode Available | 2 |