| A Keypoint-based Global Association Network for Lane Detection | Apr 15, 2022 | Keypoint EstimationLane Detection | CodeCode Available | 2 |
| In-BoXBART: Get Instructions into Biomedical Multi-Task Learning | Apr 15, 2022 | Few-Shot LearningMulti-Task Learning | CodeCode Available | 2 |
| MVSTER: Epipolar Transformer for Efficient Multi-View Stereo | Apr 15, 2022 | | CodeCode Available | 2 |
| ResT V2: Simpler, Faster and Stronger | Apr 15, 2022 | Semantic Segmentation | CodeCode Available | 2 |
| Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation | Apr 15, 2022 | 3D Semantic SegmentationColorization | CodeCode Available | 2 |
| Neighborhood Attention Transformer | Apr 14, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Cross-Image Relational Knowledge Distillation for Semantic Segmentation | Apr 14, 2022 | Knowledge DistillationSegmentation | CodeCode Available | 2 |
| YOLO-Pose: Enhancing YOLO for Multi Person Pose Estimation Using Object Keypoint Similarity Loss | Apr 14, 2022 | Multi-Person Pose Estimationobject-detection | CodeCode Available | 2 |
| Accelerated Policy Learning with Parallel Differentiable Simulation | Apr 14, 2022 | Deep Reinforcement Learning | CodeCode Available | 2 |
| Masked Siamese Networks for Label-Efficient Learning | Apr 14, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Any-resolution Training for High-resolution Image Synthesis | Apr 14, 2022 | 2kImage Generation | CodeCode Available | 2 |
| Towards Metrical Reconstruction of Human Faces | Apr 13, 2022 | 2k3D Face Reconstruction | CodeCode Available | 2 |
| Neural Texture Extraction and Distribution for Controllable Person Image Synthesis | Apr 13, 2022 | Image Generation | CodeCode Available | 2 |
| Decomposed Meta-Learning for Few-Shot Named Entity Recognition | Apr 12, 2022 | Entity TypingFew-shot NER | CodeCode Available | 2 |
| DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object Detection | Apr 12, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| AutoFi: Towards Automatic WiFi Human Sensing via Geometric Self-Supervised Learning | Apr 12, 2022 | Activity RecognitionDomain Adaptation | CodeCode Available | 2 |
| Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback | Apr 12, 2022 | Code GenerationOut of Distribution (OOD) Detection | CodeCode Available | 2 |
| Localization Distillation for Object Detection | Apr 12, 2022 | Knowledge DistillationObject | CodeCode Available | 2 |
| TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation | Apr 12, 2022 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| InCoder: A Generative Model for Code Infilling and Synthesis | Apr 12, 2022 | Code GenerationComment Generation | CodeCode Available | 2 |
| JORLDY: a fully customizable open source framework for reinforcement learning | Apr 11, 2022 | MuJoCoOpenAI Gym | CodeCode Available | 2 |
| On the Generalization of BasicVSR++ to Video Deblurring and Denoising | Apr 11, 2022 | DeblurringDenoising | CodeCode Available | 2 |
| Learning Local Equivariant Representations for Large-Scale Atomistic Dynamics | Apr 11, 2022 | Atomic Forces | CodeCode Available | 2 |
| DualPrompt: Complementary Prompting for Rehearsal-free Continual Learning | Apr 10, 2022 | Continual Learning | CodeCode Available | 2 |
| Uncertainty-Informed Deep Learning Models Enable High-Confidence Predictions for Digital Histopathology | Apr 9, 2022 | AttributeUncertainty Quantification | CodeCode Available | 2 |
| Investigating Deep Learning Benchmarks for Electrocardiography Signal Processing | Apr 9, 2022 | Atrial Fibrillation DetectionDeep Learning | CodeCode Available | 2 |
| DeepLIIF: An Online Platform for Quantification of Clinical Pathology Slides | Apr 9, 2022 | GPU | CodeCode Available | 2 |
| Particle Video Revisited: Tracking Through Occlusions Using Point Trajectories | Apr 8, 2022 | Motion EstimationObject Tracking | CodeCode Available | 2 |
| Learning Trajectory-Aware Transformer for Video Super-Resolution | Apr 8, 2022 | Super-ResolutionVideo deraining | CodeCode Available | 2 |
| ReservoirComputing.jl: An Efficient and Modular Library for Reservoir Computing Models | Apr 8, 2022 | | CodeCode Available | 2 |
| Sat2lod2: A Software For Automated Lod-2 Modeling From Satellite-Derived Orthophoto And Digital Surface Model | Apr 8, 2022 | Semantic Segmentation | CodeCode Available | 2 |
| Vision Transformers for Single Image Dehazing | Apr 8, 2022 | Image DehazingSingle Image Dehazing | CodeCode Available | 2 |
| Contrastive language and vision learning of general fashion concepts | Apr 8, 2022 | Contrastive LearningRetrieval | CodeCode Available | 2 |
| Deep Visual Geo-localization Benchmark | Apr 7, 2022 | BenchmarkingData Augmentation | CodeCode Available | 2 |
| SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation | Apr 7, 2022 | Autonomous DrivingDepth Estimation | CodeCode Available | 2 |
| Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer | Apr 7, 2022 | Video Generation | CodeCode Available | 2 |
| HIT-UAV: A high-altitude infrared thermal dataset for Unmanned Aerial Vehicle-based object detection | Apr 7, 2022 | Objectobject-detection | CodeCode Available | 2 |
| Video Diffusion Models | Apr 7, 2022 | Unconditional Video GenerationVideo Generation | CodeCode Available | 2 |
| Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results | Apr 7, 2022 | Image ClassificationKnowledge Distillation | CodeCode Available | 2 |
| DaViT: Dual Attention Vision Transformers | Apr 7, 2022 | Computational EfficiencyImage Classification | CodeCode Available | 2 |
| Unified Contrastive Learning in Image-Text-Label Space | Apr 7, 2022 | Contrastive Learningimage-classification | CodeCode Available | 2 |
| DAD-3DHeads: A Large-scale Dense, Accurate and Diverse Dataset for 3D Head Alignment from a Single Image | Apr 7, 2022 | 3D ReconstructionDiversity | CodeCode Available | 2 |
| An Empirical Study of Remote Sensing Pretraining | Apr 6, 2022 | Aerial Scene ClassificationBuilding change detection for remote sensing images | CodeCode Available | 2 |
| Attention-based CNN-LSTM and XGBoost hybrid model for stock prediction | Apr 6, 2022 | PredictionStock Prediction | CodeCode Available | 2 |
| ByT5 model for massively multilingual grapheme-to-phoneme conversion | Apr 6, 2022 | Grapheme-to-Phoneme Conversion | CodeCode Available | 2 |
| Aesthetic Text Logo Synthesis via Content-aware Layout Inferring | Apr 6, 2022 | Layout DesignLayout Generation | CodeCode Available | 2 |
| FocalClick: Towards Practical Interactive Image Segmentation | Apr 6, 2022 | Image SegmentationInteractive Segmentation | CodeCode Available | 2 |
| Fusing finetuned models for better pretraining | Apr 6, 2022 | | CodeCode Available | 2 |
| Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection | Apr 6, 2022 | Instance SegmentationObject | CodeCode Available | 2 |
| PaLM: Scaling Language Modeling with Pathways | Apr 5, 2022 | Auto DebuggingCode Generation | CodeCode Available | 2 |