| MV-FCOS3D++: Multi-View Camera-Only 4D Object Detection with Pretrained Monocular Backbones | Jul 26, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| Monocular 3D Object Detection with Depth from Motion | Jul 26, 2022 | 3D Object DetectionDepth Estimation | CodeCode Available | 2 |
| Classifier-Free Diffusion Guidance | Jul 26, 2022 | Diversity | CodeCode Available | 2 |
| Exploring CLIP for Assessing the Look and Feel of Images | Jul 25, 2022 | Image Quality AssessmentNo-Reference Image Quality Assessment | CodeCode Available | 2 |
| AMLB: an AutoML Benchmark | Jul 25, 2022 | AutoML | CodeCode Available | 2 |
| CelebV-HQ: A Large-Scale Video Facial Attributes Dataset | Jul 25, 2022 | AttributeDiversity | CodeCode Available | 2 |
| Patchwork++: Fast and Robust Ground Segmentation Solving Partial Under-Segmentation Using 3D Point Cloud | Jul 25, 2022 | Object RecognitionSegmentation | CodeCode Available | 2 |
| SimAM: A Simple, Parameter-Free Attention Module for Convolutional Neural Networks | Jul 24, 2022 | | CodeCode Available | 2 |
| Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis | Jul 24, 2022 | 3D geometryNeRF | CodeCode Available | 2 |
| Thermal half-lives of azobenzene derivatives: virtual screening based on intersystem crossing using a machine learning potential | Jul 23, 2022 | | CodeCode Available | 2 |
| When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition | Jul 23, 2022 | DecoderHandwritten Mathmatical Expression Recognition | CodeCode Available | 2 |
| Low-Complexity Acoustic Echo Cancellation with Neural Kalman Filtering | Jul 23, 2022 | Acoustic echo cancellation | CodeCode Available | 2 |
| Visual Speech-Aware Perceptual 3D Facial Expression Reconstruction from Videos | Jul 22, 2022 | 3D Face Reconstruction3D Reconstruction | CodeCode Available | 2 |
| Panoptic Scene Graph Generation | Jul 22, 2022 | BenchmarkingPanoptic Scene Graph Generation | CodeCode Available | 2 |
| MeshLoc: Mesh-Based Visual Localization | Jul 21, 2022 | Camera Pose EstimationNeural Rendering | CodeCode Available | 2 |
| Pose for Everything: Towards Category-Agnostic Pose Estimation | Jul 21, 2022 | 2D Pose EstimationCategory-Agnostic Pose Estimation | CodeCode Available | 2 |
| AdaNeRF: Adaptive Sampling for Real-time Rendering of Neural Radiance Fields | Jul 21, 2022 | Novel View Synthesis | CodeCode Available | 2 |
| Generative Multiplane Images: Making a 2D GAN 3D-Aware | Jul 21, 2022 | | CodeCode Available | 2 |
| DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object Detection | Jul 21, 2022 | 3D Object Detection3D Object Detection From Monocular Images | CodeCode Available | 2 |
| Language Model Cascades | Jul 21, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 2 |
| Omni3D: A Large Benchmark and Model for 3D Object Detection in the Wild | Jul 21, 2022 | 3D Object Detection3D Object Detection From Monocular Images | CodeCode Available | 2 |
| Unsupervised Night Image Enhancement: When Layer Decomposition Meets Light-Effects Suppression | Jul 21, 2022 | HallucinationImage Enhancement | CodeCode Available | 2 |
| CodeT: Code Generation with Generated Tests | Jul 21, 2022 | Code GenerationHumanEval | CodeCode Available | 2 |
| DC-ShadowNet: Single-Image Hard and Soft Shadow Removal Using Unsupervised Domain-Classifier Guided Network | Jul 21, 2022 | Image EnhancementImage Reconstruction | CodeCode Available | 2 |
| In Defense of Online Models for Video Instance Segmentation | Jul 21, 2022 | Contrastive LearningInstance Segmentation | CodeCode Available | 2 |
| EC-KitY: Evolutionary Computation Tool Kit in Python with Seamless Machine Learning Integration | Jul 21, 2022 | BIG-bench Machine Learning | CodeCode Available | 2 |
| 3D Clothed Human Reconstruction in the Wild | Jul 20, 2022 | Garment Reconstruction | CodeCode Available | 2 |
| Deep Learning Based Automatic Modulation Recognition: Models, Datasets, and Challenges | Jul 20, 2022 | Automatic Modulation RecognitionDeep Learning | CodeCode Available | 2 |
| Fully Sparse 3D Object Detection | Jul 20, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Pretraining a Neural Network before Knowing Its Architecture | Jul 20, 2022 | Diversity | CodeCode Available | 2 |
| Diffsound: Discrete Diffusion Model for Text-to-sound Generation | Jul 20, 2022 | Audio GenerationDecoder | CodeCode Available | 2 |
| Towards Efficient and Scale-Robust Ultra-High-Definition Image Demoireing | Jul 20, 2022 | 4kImage Enhancement | CodeCode Available | 2 |
| Large Scale Radio Frequency Signal Classification | Jul 20, 2022 | DiversityGeneral Classification | CodeCode Available | 2 |
| Box-supervised Instance Segmentation with Level Set Evolution | Jul 19, 2022 | Box-supervised Instance SegmentationInstance Segmentation | CodeCode Available | 2 |
| ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild | Jul 19, 2022 | Camera Pose EstimationMotion Segmentation | CodeCode Available | 2 |
| Tip-Adapter: Training-free Adaption of CLIP for Few-shot Classification | Jul 19, 2022 | RetrievalTransfer Learning | CodeCode Available | 2 |
| Active-Learning-as-a-Service: An Automatic and Efficient MLOps System for Data-Centric AI | Jul 19, 2022 | Active LearningAutoML | CodeCode Available | 2 |
| Why do tree-based models still outperform deep learning on tabular data? | Jul 18, 2022 | Benchmarking | CodeCode Available | 2 |
| SatMAE: Pre-training Transformers for Temporal and Multi-Spectral Satellite Imagery | Jul 17, 2022 | Land Cover ClassificationSemantic Segmentation | CodeCode Available | 2 |
| Unsupervised Medical Image Translation with Adversarial Diffusion Models | Jul 17, 2022 | DiversityImage Generation | CodeCode Available | 2 |
| Towards Lightweight Super-Resolution with Dual Regression Learning | Jul 16, 2022 | Image Super-ResolutionModel Compression | CodeCode Available | 2 |
| [Reproducibility Report] Path Planning using Neural A* Search | Jul 16, 2022 | Motion Planning | CodeCode Available | 2 |
| SenseFi: A Library and Benchmark on Deep-Learning-Empowered WiFi Human Sensing | Jul 16, 2022 | Activity RecognitionDeep Learning | CodeCode Available | 2 |
| Registration based Few-Shot Anomaly Detection | Jul 15, 2022 | Anomaly Detection | CodeCode Available | 2 |
| ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal Feature Learning | Jul 15, 2022 | Autonomous DrivingBird's-Eye View Semantic Segmentation | CodeCode Available | 2 |
| Direction-Aware Adaptive Online Neural Speech Enhancement with an Augmented Reality Headset in Real Noisy Conversational Environments | Jul 15, 2022 | blind source separationSpeech Enhancement | CodeCode Available | 2 |
| Recurrent Memory Transformer | Jul 14, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Current Trends in Deep Learning for Earth Observation: An Open-source Benchmark Arena for Image Classification | Jul 14, 2022 | ClassificationEarth Observation | CodeCode Available | 2 |
| u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled Modality | Jul 14, 2022 | Speaker Verificationspeech-recognition | CodeCode Available | 2 |
| Point-to-Box Network for Accurate Object Detection via Single Point Supervision | Jul 14, 2022 | AttributeMultiple Instance Learning | CodeCode Available | 2 |