| NeRFusion: Fusing Radiance Fields for Large-Scale Scene Reconstruction | Mar 21, 2022 | 3D ReconstructionNeRF | CodeCode Available | 2 |
| Not All Points Are Equal: Learning Highly Efficient Point-based Detectors for 3D LiDAR Point Clouds | Mar 21, 2022 | AllGPU | CodeCode Available | 2 |
| MixFormer: End-to-End Tracking with Iterative Mixed Attention | Mar 21, 2022 | Semi-Supervised Video Object SegmentationVideo Object Tracking | CodeCode Available | 2 |
| FUTR3D: A Unified Sensor Fusion Framework for 3D Detection | Mar 20, 2022 | Autonomous DrivingDecoder | CodeCode Available | 2 |
| g2pW: A Conditional Weighted Softmax BERT for Polyphone Disambiguation in Mandarin | Mar 20, 2022 | Part-Of-Speech TaggingPolyphone disambiguation | CodeCode Available | 2 |
| A 3D Generative Model for Structure-Based Drug Design | Mar 20, 2022 | Drug Designvalid | CodeCode Available | 2 |
| V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer | Mar 20, 2022 | 3D Object DetectionAutonomous Vehicles | CodeCode Available | 2 |
| CLRNet: Cross Layer Refinement Network for Lane Detection | Mar 19, 2022 | Lane Detection | CodeCode Available | 2 |
| ChartQA: A Benchmark for Question Answering about Charts with Visual and Logical Reasoning | Mar 19, 2022 | Chart Question AnsweringLogical Reasoning | CodeCode Available | 2 |
| Efficient Neural Network Analysis with Sum-of-Infeasibilities | Mar 19, 2022 | Adversarial AttackEfficient Neural Network | CodeCode Available | 2 |
| Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection from Point Clouds | Mar 19, 2022 | 3D Object Detectionobject-detection | CodeCode Available | 2 |
| SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition | Mar 19, 2022 | Scene Text DetectionText Detection | CodeCode Available | 2 |
| ViewFormer: NeRF-free Neural Rendering from Few Images Using Transformers | Mar 18, 2022 | Camera Pose EstimationNeRF | CodeCode Available | 2 |
| REALY: Rethinking the Evaluation of 3D Face Reconstruction | Mar 18, 2022 | 3D Face ReconstructionFace Reconstruction | CodeCode Available | 2 |
| Sparse Fuse Dense: Towards High Quality 3D Detection with Depth Completion | Mar 18, 2022 | 3D Object DetectionData Augmentation | CodeCode Available | 2 |
| AutoSDF: Shape Priors for 3D Completion, Reconstruction and Generation | Mar 17, 2022 | | CodeCode Available | 2 |
| HybridNets: End-to-End Perception Network | Mar 17, 2022 | Autonomous DrivingDrivable Area Detection | CodeCode Available | 2 |
| STPLS3D: A Large-Scale Synthetic and Real Aerial Photogrammetry 3D Point Cloud Dataset | Mar 17, 2022 | 3D Instance Segmentation3D Semantic Segmentation | CodeCode Available | 2 |
| Interacting Attention Graph for Single Image Two-Hand Reconstruction | Mar 17, 2022 | 3D Interacting Hand Pose EstimationVocal Bursts Valence Prediction | CodeCode Available | 2 |
| Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution | Mar 17, 2022 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 |
| ToxiGen: A Large-Scale Machine-Generated Dataset for Adversarial and Implicit Hate Speech Detection | Mar 17, 2022 | Hate Speech DetectionLanguage Modelling | CodeCode Available | 2 |
| EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with Large-Scale Pre-Training | Mar 17, 2022 | Chatbot | CodeCode Available | 2 |
| BrainGB: A Benchmark for Brain Network Analysis with Graph Neural Networks | Mar 17, 2022 | Functional Connectivity | CodeCode Available | 2 |
| Scribble-Supervised LiDAR Semantic Segmentation | Mar 16, 2022 | 3D Semantic SegmentationLIDAR Semantic Segmentation | CodeCode Available | 2 |
| Unsupervised Semantic Segmentation by Distilling Feature Correspondences | Mar 16, 2022 | FormSemantic Segmentation | CodeCode Available | 2 |
| Memorizing Transformers | Mar 16, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Dual Diffusion Implicit Bridges for Image-to-Image Translation | Mar 16, 2022 | Image-to-Image TranslationTranslation | CodeCode Available | 2 |
| Decoupled Knowledge Distillation | Mar 16, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| EDTER: Edge Detection with Transformer | Mar 16, 2022 | DecoderEdge Detection | CodeCode Available | 2 |
| Learning What Not to Segment: A New Perspective on Few-Shot Segmentation | Mar 15, 2022 | Few-Shot Semantic SegmentationMeta-Learning | CodeCode Available | 2 |
| Animatable Implicit Neural Representations for Creating Realistic Avatars from Videos | Mar 15, 2022 | | CodeCode Available | 2 |
| InvPT: Inverted Pyramid Multi-task Transformer for Dense Scene Understanding | Mar 15, 2022 | Boundary DetectionHuman Parsing | CodeCode Available | 2 |
| ADATIME: A Benchmarking Suite for Domain Adaptation on Time Series Data | Mar 15, 2022 | BenchmarkingDomain Adaptation | CodeCode Available | 2 |
| OcclusionFusion: Occlusion-aware Motion Estimation for Real-time Dynamic 3D Reconstruction | Mar 15, 2022 | 3D ReconstructionGraph Neural Network | CodeCode Available | 2 |
| MotionCLIP: Exposing Human Motion Generation to CLIP Space | Mar 15, 2022 | DisentanglementMotion Generation | CodeCode Available | 2 |
| Real-time Neural-MPC: Deep Learning Model Predictive Control for Quadrotors and Agile Robotic Platforms | Mar 15, 2022 | Model Predictive Control | CodeCode Available | 2 |
| LiDAR-based 4D Panoptic Segmentation via Dynamic Shifting Network | Mar 14, 2022 | 4D Panoptic SegmentationAutonomous Driving | CodeCode Available | 2 |
| Modelling Non-Smooth Signals with Complex Spectral Structure | Mar 14, 2022 | Variational Inference | CodeCode Available | 2 |
| All in One: Exploring Unified Video-Language Pre-training | Mar 14, 2022 | AllLanguage Modelling | CodeCode Available | 2 |
| PERT: Pre-training BERT with Permuted Language Model | Mar 14, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Accelerating DETR Convergence via Semantic-Aligned Matching | Mar 14, 2022 | Objectobject-detection | CodeCode Available | 2 |
| Respecting causality is all you need for training physics-informed neural networks | Mar 14, 2022 | AllAttribute | CodeCode Available | 2 |
| A Supervised Learning Approach to Rankability | Mar 14, 2022 | | CodeCode Available | 2 |
| Dawn of the transformer era in speech emotion recognition: closing the valence gap | Mar 14, 2022 | Cross-corpusEmotion Recognition | CodeCode Available | 2 |
| ScienceWorld: Is your Agent Smarter than a 5th Grader? | Mar 14, 2022 | Question Answering | CodeCode Available | 2 |
| Masked Autoencoders for Point Cloud Self-supervised Learning | Mar 13, 2022 | 3D Part Segmentation3D Point Cloud Classification | CodeCode Available | 2 |
| Efficient Long-Range Attention Network for Image Super-resolution | Mar 13, 2022 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 |
| Depth-Aware Generative Adversarial Network for Talking Head Video Generation | Mar 13, 2022 | 3D geometryGenerative Adversarial Network | CodeCode Available | 2 |
| Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs | Mar 13, 2022 | Image Classification | CodeCode Available | 2 |
| Protein Representation Learning by Geometric Structure Pretraining | Mar 11, 2022 | Contrastive LearningPrediction | CodeCode Available | 2 |