| Deep Incubation: Training Large Models by Divide-and-Conquering | Dec 8, 2022 | Image Segmentationobject-detection | CodeCode Available | 2 |
| Editing Models with Task Arithmetic | Dec 8, 2022 | NegationTask Arithmetic | CodeCode Available | 2 |
| Learning Video Representations from Large Language Models | Dec 8, 2022 | Action ClassificationAction Recognition | CodeCode Available | 2 |
| SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation | Dec 8, 2022 | 3D Reconstruction3D Shape Generation | CodeCode Available | 2 |
| Generating Holistic 3D Human Motion from Speech | Dec 8, 2022 | 3D Face AnimationGesture Generation | CodeCode Available | 2 |
| ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation | Dec 7, 2022 | Semantic Segmentationzero-shot-classification | CodeCode Available | 2 |
| Spatio-Temporal Self-Supervised Learning for Traffic Flow Prediction | Dec 7, 2022 | AttributePrediction | CodeCode Available | 2 |
| Discovering Latent Knowledge in Language Models Without Supervision | Dec 7, 2022 | Imitation LearningLanguage Modelling | CodeCode Available | 2 |
| Deep Learning Methods for Partial Differential Equations and Related Parameter Identification Problems | Dec 6, 2022 | Deep Learning | CodeCode Available | 2 |
| GAUCHE: A Library for Gaussian Processes in Chemistry | Dec 6, 2022 | Bayesian OptimisationGaussian Processes | CodeCode Available | 2 |
| Perspective Fields for Single Image Camera Calibration | Dec 6, 2022 | Camera Calibration | CodeCode Available | 2 |
| Yggdrasil Decision Forests: A Fast and Extensible Decision Forests Library | Dec 6, 2022 | | CodeCode Available | 2 |
| Semantic-Conditional Diffusion Networks for Image Captioning | Dec 6, 2022 | Cross-Modal RetrievalDecoder | CodeCode Available | 2 |
| Fine-tuned CLIP Models are Efficient Video Learners | Dec 6, 2022 | | CodeCode Available | 2 |
| Learning Neural Parametric Head Models | Dec 6, 2022 | | CodeCode Available | 2 |
| Diffusion-SDF: Text-to-Shape via Voxelized Diffusion | Dec 6, 2022 | | CodeCode Available | 2 |
| DiffusionInst: Diffusion Model for Instance Segmentation | Dec 6, 2022 | DenoisingInstance Segmentation | CodeCode Available | 2 |
| SceneRF: Self-Supervised Monocular 3D Scene Reconstruction with Radiance Fields | Dec 5, 2022 | 3D Reconstruction3D Scene Reconstruction | CodeCode Available | 2 |
| Learning Physically Realizable Skills for Online Packing of General 3D Shapes | Dec 5, 2022 | 3D geometryAction Generation | CodeCode Available | 2 |
| Democratizing Neural Machine Translation with OPUS-MT | Dec 4, 2022 | Machine TranslationTranslation | CodeCode Available | 2 |
| Melody transcription via generative pre-training | Dec 4, 2022 | Chord RecognitionInformation Retrieval | CodeCode Available | 2 |
| Box2Mask: Box-supervised Instance Segmentation via Level-set Evolution | Dec 3, 2022 | Box-supervised Instance SegmentationDecoder | CodeCode Available | 2 |
| MIC: Masked Image Consistency for Context-Enhanced Domain Adaptation | Dec 2, 2022 | Domain Adaptationimage-classification | CodeCode Available | 2 |
| CC-3DT: Panoramic 3D Object Tracking via Cross-Camera Fusion | Dec 2, 2022 | 3D Object TrackingAutonomous Vehicles | CodeCode Available | 2 |
| GRiT: A Generative Region-to-text Transformer for Object Understanding | Dec 1, 2022 | DecoderDense Captioning | CodeCode Available | 2 |
| Scaling Language-Image Pre-training via Masking | Dec 1, 2022 | Diversity | CodeCode Available | 2 |
| Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D Generation | Dec 1, 2022 | 3D GenerationText to 3D | CodeCode Available | 2 |
| MO-Gym: A Library of Multi-Objective Reinforcement Learning Environments | Nov 30, 2022 | Multi-Objective Reinforcement LearningOpenAI Gym | CodeCode Available | 2 |
| ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT | Nov 30, 2022 | Molecular System PredictionSentence Classification | CodeCode Available | 2 |
| Analyzing Infrastructure LiDAR Placement with Realistic LiDAR Simulation Library | Nov 29, 2022 | | CodeCode Available | 2 |
| ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency | Nov 29, 2022 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| PLA: Language-Driven Open-Vocabulary 3D Scene Understanding | Nov 29, 2022 | 3D Open-Vocabulary Instance SegmentationContrastive Learning | CodeCode Available | 2 |
| Compressing Volumetric Radiance Fields to 1 MB | Nov 29, 2022 | Model CompressionNeRF | CodeCode Available | 2 |
| Wavelet Diffusion Models are fast and scalable Image Generators | Nov 29, 2022 | BlockingImage Generation | CodeCode Available | 2 |
| NeuralLift-360: Lifting An In-the-wild 2D Photo to A 3D Object with 360° Views | Nov 29, 2022 | 3D ReconstructionImage to 3D | CodeCode Available | 2 |
| Fast-SNARF: A Fast Deformer for Articulated Neural Fields | Nov 28, 2022 | 3D ReconstructionComputational Efficiency | CodeCode Available | 2 |
| OpenScene: 3D Scene Understanding with Open Vocabularies | Nov 28, 2022 | 3D Open-Vocabulary Instance Segmentation3D Semantic Segmentation | CodeCode Available | 2 |
| Why do tree-based models still outperform deep learning on typical tabular data? | Nov 28, 2022 | Benchmarking | CodeCode Available | 2 |
| DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models | Nov 28, 2022 | DenoisingLanguage Modeling | CodeCode Available | 2 |
| High-fidelity 3D GAN Inversion by Pseudo-multi-view Optimization | Nov 28, 2022 | AttributeGenerative Adversarial Network | CodeCode Available | 2 |
| Semi-Supervised Confidence-Level-based Contrastive Discrimination for Class-Imbalanced Semantic Segmentation | Nov 28, 2022 | Contrastive LearningRoad Segmentation | CodeCode Available | 2 |
| FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network | Nov 28, 2022 | GPUVisual Localization | CodeCode Available | 2 |
| SuperFusion: Multilevel LiDAR-Camera Fusion for Long-Range HD Map Generation | Nov 28, 2022 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 |
| SatlasPretrain: A Large-Scale Dataset for Remote Sensing Image Understanding | Nov 28, 2022 | Time SeriesTime Series Analysis | CodeCode Available | 2 |
| Connecting the Dots: Floorplan Reconstruction Using Two-Level Queries | Nov 28, 2022 | Structured PredictionVocal Bursts Valence Prediction | CodeCode Available | 2 |
| H3WB: Human3.6M 3D WholeBody Dataset and Benchmark | Nov 28, 2022 | 3D Facial Landmark Localization3D Hand Pose Estimation | CodeCode Available | 2 |
| Dense Text Retrieval based on Pretrained Language Models: A Survey | Nov 27, 2022 | RetrievalSurvey | CodeCode Available | 2 |
| Open-Source Ground-based Sky Image Datasets for Very Short-term Solar Forecasting, Cloud Analysis and Modeling: A Comprehensive Survey | Nov 27, 2022 | motion prediction | CodeCode Available | 2 |
| Medical Image Segmentation Review: The success of U-Net | Nov 27, 2022 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| AvatarGen: A 3D Generative Model for Animatable Human Avatars | Nov 26, 2022 | Human Animation | CodeCode Available | 2 |