| DEP-RL: Embodied Exploration for Reinforcement Learning in Overactuated and Musculoskeletal Systems | May 30, 2022 | Diversityreinforcement-learning | CodeCode Available | 2 |
| You Only Need 90K Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure Correction | May 30, 2022 | Exposure CorrectionImage Enhancement | CodeCode Available | 2 |
| Kernel Neural Optimal Transport | May 30, 2022 | Image-to-Image TranslationTranslation | CodeCode Available | 2 |
| Fast Dynamic Radiance Fields with Time-Aware Neural Voxels | May 30, 2022 | NeRF | CodeCode Available | 2 |
| Multi-Agent Reinforcement Learning is a Sequence Modeling Problem | May 30, 2022 | Decision MakingMuJoCo | CodeCode Available | 2 |
| Re-parameterizing Your Optimizers rather than Architectures | May 30, 2022 | Quantization | CodeCode Available | 2 |
| StyleTTS: A Style-Based Generative Model for Natural and Diverse Text-to-Speech Synthesis | May 30, 2022 | Data AugmentationSelf-Supervised Learning | CodeCode Available | 2 |
| IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation | May 29, 2022 | DecoderOptical Flow Estimation | CodeCode Available | 2 |
| Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning | May 29, 2022 | Few-Shot Text ClassificationMemorization | CodeCode Available | 2 |
| CPED: A Large-Scale Chinese Personalized and Emotional Dialogue Dataset for Conversational AI | May 29, 2022 | Chinese Sentiment AnalysisConversational Response Generation | CodeCode Available | 2 |
| CoNT: Contrastive Neural Text Generation | May 29, 2022 | Code Comment GenerationComment Generation | CodeCode Available | 2 |
| MolScribe: Robust Molecular Structure Recognition with Image-To-Graph Generation | May 28, 2022 | Data AugmentationGraph Generation | CodeCode Available | 2 |
| Non-stationary Transformers: Exploring the Stationarity in Time Series Forecasting | May 28, 2022 | Time SeriesTime Series Analysis | CodeCode Available | 2 |
| Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training | May 28, 2022 | 3D Object Detection3D Point Cloud Classification | CodeCode Available | 2 |
| GIT: A Generative Image-to-text Transformer for Vision and Language | May 27, 2022 | DecoderImage Captioning | CodeCode Available | 2 |
| Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation | May 27, 2022 | Contrastive Learningimage-classification | CodeCode Available | 2 |
| BEVFusion: A Simple and Robust LiDAR-Camera Fusion Framework | May 27, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| DevFormer: A Symmetric Transformer for Context-Aware Device Placement | May 26, 2022 | Combinatorial OptimizationMeta-Learning | CodeCode Available | 2 |
| A Model or 603 Exemplars: Towards Memory-Efficient Class-Incremental Learning | May 26, 2022 | class-incremental learningClass Incremental Learning | CodeCode Available | 2 |
| Matryoshka Representation Learning | May 26, 2022 | 4kImage Classification | CodeCode Available | 2 |
| Towards Learning Universal Hyperparameter Optimizers with Transformers | May 26, 2022 | Hyperparameter OptimizationMeta-Learning | CodeCode Available | 2 |
| Fine-grained Image Captioning with CLIP Reward | May 26, 2022 | Caption GenerationDescriptive | CodeCode Available | 2 |
| AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition | May 26, 2022 | Action RecognitionVideo Recognition | CodeCode Available | 2 |
| Fast Vision Transformers with HiLo Attention | May 26, 2022 | BenchmarkingEfficient ViTs | CodeCode Available | 2 |
| Inception Transformer | May 25, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Recipe for a General, Powerful, Scalable Graph Transformer | May 25, 2022 | Graph ClassificationGraph Property Prediction | CodeCode Available | 2 |
| Pretraining is All You Need for Image-to-Image Translation | May 25, 2022 | AllImage-to-Image Translation | CodeCode Available | 2 |
| RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning | May 25, 2022 | reinforcement-learningReinforcement Learning | CodeCode Available | 2 |
| Overcoming Catastrophic Forgetting in Zero-Shot Cross-Lingual Generation | May 25, 2022 | Cross-Lingual TransferMachine Translation | CodeCode Available | 2 |
| Non-rigid Point Cloud Registration with Neural Deformation Pyramid | May 25, 2022 | Point Cloud Registration | CodeCode Available | 2 |
| Perturbation Augmentation for Fairer NLP | May 25, 2022 | Fairness | CodeCode Available | 2 |
| QAMPARI: An Open-domain Question Answering Benchmark for Questions with Many Answers from Multiple Paragraphs | May 25, 2022 | Answer GenerationNatural Questions | CodeCode Available | 2 |
| Neural 3D Reconstruction in the Wild | May 25, 2022 | 3D ReconstructionSurface Reconstruction | CodeCode Available | 2 |
| OnePose: One-Shot Object Pose Estimation without CAD Models | May 24, 2022 | 6D Pose EstimationGraph Attention | CodeCode Available | 2 |
| Large Language Models are Zero-Shot Reasoners | May 24, 2022 | Arithmetic ReasoningCommon Sense Reasoning | CodeCode Available | 2 |
| recommenderlab: An R Framework for Developing and Testing Recommendation Algorithms | May 24, 2022 | Recommendation Systems | CodeCode Available | 2 |
| Unsupervised Misaligned Infrared and Visible Image Fusion via Cross-Modality Image Generation and Registration | May 24, 2022 | Image GenerationInfrared And Visible Image Fusion | CodeCode Available | 2 |
| Single-View View Synthesis in the Wild with Learned Adaptive Multiplane Images | May 24, 2022 | 3D geometryDepth Estimation | CodeCode Available | 2 |
| RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder | May 24, 2022 | DecoderInformation Retrieval | CodeCode Available | 2 |
| Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding | May 23, 2022 | | CodeCode Available | 2 |
| Keypoint-Based Category-Level Object Pose Tracking from an RGB Sequence with Uncertainty Estimation | May 23, 2022 | Pose EstimationPose Tracking | CodeCode Available | 2 |
| BBTv2: Towards a Gradient-Free Future with Large Language Models | May 23, 2022 | Few-Shot LearningLanguage Modelling | CodeCode Available | 2 |
| GraphMAE: Self-Supervised Masked Graph Autoencoders | May 22, 2022 | Contrastive LearningGraph Classification | CodeCode Available | 2 |
| Vision-based Anti-UAV Detection and Tracking | May 22, 2022 | | CodeCode Available | 2 |
| Structured Attention Composition for Temporal Action Localization | May 20, 2022 | Action DetectionAction Localization | CodeCode Available | 2 |
| A Review of Safe Reinforcement Learning: Methods, Theory and Applications | May 20, 2022 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| Towards Explanation for Unsupervised Graph-Level Representation Learning | May 20, 2022 | Decision MakingGraph Classification | CodeCode Available | 2 |
| MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation | May 19, 2022 | DenoisingPrediction | CodeCode Available | 2 |
| BARS: Towards Open Benchmarking for Recommender Systems | May 19, 2022 | BenchmarkingClick-Through Rate Prediction | CodeCode Available | 2 |
| BEVerse: Unified Perception and Prediction in Birds-Eye-View for Vision-Centric Autonomous Driving | May 19, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |