| DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation | Nov 18, 2022 | Code GenerationMemorization | CodeCode Available | 2 |
| CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical Flow | Nov 18, 2022 | Optical Flow EstimationPosition | CodeCode Available | 2 |
| RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and Generation | Nov 17, 2022 | 3D Generation3D Reconstruction | CodeCode Available | 2 |
| Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion Models | Nov 17, 2022 | Gesture GenerationMotion Synthesis | CodeCode Available | 2 |
| EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones | Nov 17, 2022 | Data AugmentationSelf-Supervised Learning | CodeCode Available | 2 |
| UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer | Nov 17, 2022 | Video Understanding | CodeCode Available | 2 |
| DeepPrivacy2: Towards Realistic Full-Body Anonymization | Nov 17, 2022 | DiversityFace Anonymization | CodeCode Available | 2 |
| Towards Building Text-To-Speech Systems for the Next Billion Users | Nov 17, 2022 | DiversitySpeech Synthesis | CodeCode Available | 2 |
| MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors | Nov 17, 2022 | Multi-Object TrackingMultiple Object Tracking | CodeCode Available | 2 |
| Ignore Previous Prompt: Attack Techniques For Language Models | Nov 17, 2022 | Adversarial AttackAdversarial Text | CodeCode Available | 2 |
| Where a Strong Backbone Meets Strong Features -- ActionFormer for Ego4D Moment Queries Challenge | Nov 16, 2022 | Action LocalizationMoment Queries | CodeCode Available | 2 |
| SMILEtrack: SiMIlarity LEarning for Occlusion-Aware Multiple Object Tracking | Nov 16, 2022 | Multi-Object TrackingMultiple Object Tracking | CodeCode Available | 2 |
| RetroMAE v2: Duplex Masked Auto-Encoder For Pre-Training Retrieval-Oriented Language Models | Nov 16, 2022 | Dimensionality ReductionInformation Retrieval | CodeCode Available | 2 |
| Improving Feature-based Visual Localization by Geometry-Aided Matching | Nov 16, 2022 | 3D Feature MatchingPose Estimation | CodeCode Available | 2 |
| MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis | Nov 16, 2022 | Image GenerationRepresentation Learning | CodeCode Available | 2 |
| Physics-Informed Machine Learning: A Survey on Problems, Methods and Applications | Nov 15, 2022 | Physics-informed machine learning | CodeCode Available | 2 |
| A mixed-categorical correlation kernel for Gaussian process | Nov 15, 2022 | | CodeCode Available | 2 |
| Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures | Nov 14, 2022 | 3D GenerationImage Generation | CodeCode Available | 2 |
| A Novel Sampling Scheme for Text- and Image-Conditional Image Synthesis in Quantized Latent Spaces | Nov 14, 2022 | Conditional Image GenerationDenoising | CodeCode Available | 2 |
| Towards A Unified Conformer Structure: from ASR to ASV Task | Nov 14, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| TorchOpt: An Efficient Library for Differentiable Optimization | Nov 13, 2022 | CPUGPU | CodeCode Available | 2 |
| Learning Heterogeneous Agent Cooperation via Multiagent League Training | Nov 13, 2022 | Diversityreinforcement-learning | CodeCode Available | 2 |
| SSL4EO-S12: A Large-Scale Multi-Modal, Multi-Temporal Dataset for Self-Supervised Learning in Earth Observation | Nov 13, 2022 | Earth ObservationMulti-Label Image Classification | CodeCode Available | 2 |
| Seeing Beyond the Brain: Conditional Diffusion Model with Sparse Masked Modeling for Vision Decoding | Nov 13, 2022 | Brain Computer Interface | CodeCode Available | 2 |
| Deep Learning Generates Synthetic Cancer Histology for Explainability and Education | Nov 12, 2022 | Deep LearningGenerative Adversarial Network | CodeCode Available | 2 |
| A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges | Nov 12, 2022 | reinforcement-learningReinforcement Learning | CodeCode Available | 2 |
| MARLIN: Masked Autoencoder for facial video Representation LearnINg | Nov 12, 2022 | Action ClassificationAttribute | CodeCode Available | 2 |
| MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain Conversation | Nov 10, 2022 | Multimodal Intent RecognitionRetrieval | CodeCode Available | 2 |
| LERT: A Linguistically-motivated Pre-trained Language Model | Nov 10, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives | Nov 9, 2022 | DisentanglementVideo Generation | CodeCode Available | 2 |
| Efficient Large-scale Audio Tagging via Transformer-to-CNN Knowledge Distillation | Nov 9, 2022 | Audio ClassificationAudio Tagging | CodeCode Available | 2 |
| Medical Diffusion: Denoising Diffusion Probabilistic Models for 3D Medical Image Generation | Nov 7, 2022 | Computed Tomography (CT)Denoising | CodeCode Available | 2 |
| SC-DepthV3: Robust Self-supervised Monocular Depth Estimation for Dynamic Scenes | Nov 7, 2022 | Depth EstimationIndoor Monocular Depth Estimation | CodeCode Available | 2 |
| MogaNet: Multi-order Gated Aggregation Network | Nov 7, 2022 | 3D Human Pose EstimationImage Classification | CodeCode Available | 2 |
| Body Part-Based Representation Learning for Occluded Person Re-Identification | Nov 7, 2022 | Human ParsingOccluded Person Re-Identification | CodeCode Available | 2 |
| A Survey of Deep Face Restoration: Denoise, Super-Resolution, Deblur, Artifact Removal | Nov 5, 2022 | Deep LearningImage Restoration | CodeCode Available | 2 |
| Robust Reflection Removal with Flash-only Cues in the Wild | Nov 5, 2022 | Reflection Removal | CodeCode Available | 2 |
| GLOBEM Dataset: Multi-Year Datasets for Longitudinal Human Behavior Modeling Generalization | Nov 4, 2022 | Depression DetectionDomain Generalization | CodeCode Available | 2 |
| SSDA-YOLO: Semi-supervised Domain Adaptive YOLO for Cross-Domain Object Detection | Nov 4, 2022 | Domain AdaptationKnowledge Distillation | CodeCode Available | 2 |
| scikit-fda: A Python Package for Functional Data Analysis | Nov 4, 2022 | Model Selection | CodeCode Available | 2 |
| Large Scale Radio Frequency Wideband Signal Detection & Recognition | Nov 4, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| Real-Time Target Sound Extraction | Nov 4, 2022 | DecoderStreaming Target Sound Extraction | CodeCode Available | 2 |
| Sky-image-based solar forecasting using deep learning with multi-location data: training models locally, globally or via transfer learning? | Nov 3, 2022 | Transfer Learning | CodeCode Available | 2 |
| Crosslingual Generalization through Multitask Finetuning | Nov 3, 2022 | Coreference ResolutionCross-Lingual Transfer | CodeCode Available | 2 |
| Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models | Nov 3, 2022 | GPU | CodeCode Available | 2 |
| WITT: A Wireless Image Transmission Transformer for Semantic Communications | Nov 2, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Pop2Piano : Pop Audio-based Piano Cover Generation | Nov 2, 2022 | | CodeCode Available | 2 |
| eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers | Nov 2, 2022 | Image GenerationText-to-Image Generation | CodeCode Available | 2 |
| Text-Only Training for Image Captioning using Noise-Injected CLIP | Nov 1, 2022 | DecoderImage Captioning | CodeCode Available | 2 |
| Dungeons and Data: A Large-Scale NetHack Dataset | Nov 1, 2022 | Decision MakingNetHack | CodeCode Available | 2 |