| Reference-based Video Super-Resolution Using Multi-Camera Video Triplets | Mar 28, 2022 | Reference-based Video Super-ResolutionSuper-Resolution | CodeCode Available | 2 |
| Video Polyp Segmentation: A Deep Learning Perspective | Mar 27, 2022 | AttributeDeep Learning | CodeCode Available | 2 |
| MedMCQA : A Large-scale Multi-Subject Multi-Choice Dataset for Medical domain Question Answering | Mar 27, 2022 | DiversityMultiple-choice | CodeCode Available | 2 |
| Deep Hierarchical Semantic Segmentation | Mar 27, 2022 | Multi-Label ClassificationMUlTI-LABEL-ClASSIFICATION | CodeCode Available | 2 |
| DeepDPM: Deep Clustering With an Unknown Number of Clusters | Mar 27, 2022 | ClusteringDeep Clustering | CodeCode Available | 2 |
| Implementation of an Automated Learning System for Non-experts | Mar 26, 2022 | BIG-bench Machine LearningManagement | CodeCode Available | 2 |
| FaceVerse: a Fine-grained and Detail-controllable 3D Face Morphable Model from a Hybrid Dataset | Mar 26, 2022 | 2k3D Face Reconstruction | CodeCode Available | 2 |
| Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion | Mar 25, 2022 | DiversityPedestrian Trajectory Prediction | CodeCode Available | 2 |
| JAX-FLUIDS: A fully-differentiable high-order computational fluid dynamics solver for compressible two-phase flows | Mar 25, 2022 | | CodeCode Available | 2 |
| Frame-level Prediction of Facial Expressions, Valence, Arousal and Action Units for Mobile Devices | Mar 25, 2022 | Arousal EstimationEmotion Recognition | CodeCode Available | 2 |
| BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis | Mar 25, 2022 | Image GenerationSpeech Synthesis | CodeCode Available | 2 |
| Risk-Aware Off-Road Navigation via a Learned Speed Distribution Map | Mar 25, 2022 | Motion PlanningUnity | CodeCode Available | 2 |
| Continual Test-Time Domain Adaptation | Mar 25, 2022 | Domain AdaptationTest-time Adaptation | CodeCode Available | 2 |
| Rank-based Non-dominated Sorting | Mar 25, 2022 | Evolutionary Algorithms | CodeCode Available | 2 |
| Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation | Mar 25, 2022 | Contrastive Learningimage-classification | CodeCode Available | 2 |
| Global Tracking Transformers | Mar 24, 2022 | Multi-Object TrackingObject | CodeCode Available | 2 |
| Domino: Discovering Systematic Errors with Cross-Modal Embeddings | Mar 24, 2022 | Representation LearningSlice Discovery | CodeCode Available | 2 |
| Recommendation as Language Processing (RLP): A Unified Pretrain, Personalized Prompt & Predict Paradigm (P5) | Mar 24, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection | Mar 24, 2022 | 3D Object Detection3D Object Detection From Monocular Images | CodeCode Available | 2 |
| CLIP-Mesh: Generating textured meshes from text using pretrained image-text models | Mar 24, 2022 | | CodeCode Available | 2 |
| Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic Memory | Mar 24, 2022 | Motion Synthesis | CodeCode Available | 2 |
| Sparse Instance Activation for Real-Time Instance Segmentation | Mar 24, 2022 | Instance SegmentationObject | CodeCode Available | 2 |
| Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors | Mar 24, 2022 | Image GenerationSemantic Segmentation | CodeCode Available | 2 |
| pyABC: Efficient and robust easy-to-use approximate Bayesian computation | Mar 24, 2022 | | CodeCode Available | 2 |
| Practical Blind Image Denoising via Swin-Conv-UNet and Data Synthesis | Mar 24, 2022 | DenoisingImage Denoising | CodeCode Available | 2 |
| BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training | Mar 24, 2022 | Objectobject-detection | CodeCode Available | 2 |
| Neural Neighbor Style Transfer | Mar 24, 2022 | Style Transfer | CodeCode Available | 2 |
| Learning to generate line drawings that convey geometry and semantics | Mar 23, 2022 | Image-to-Image TranslationTranslation | CodeCode Available | 2 |
| FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement | Mar 23, 2022 | Speech Enhancement | CodeCode Available | 2 |
| Real-time Object Detection for Streaming Perception | Mar 23, 2022 | Autonomous DrivingObject | CodeCode Available | 2 |
| UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection | Mar 23, 2022 | DecoderHighlight Detection | CodeCode Available | 2 |
| Unified Structure Generation for Universal Information Extraction | Mar 23, 2022 | Aspect-Based Sentiment Analysis (ABSA)UIE | CodeCode Available | 2 |
| MONAI Label: A framework for AI-assisted Interactive Labeling of 3D Medical Images | Mar 23, 2022 | Active Learning | CodeCode Available | 2 |
| M-SENA: An Integrated Platform for Multimodal Sentiment Analysis | Mar 23, 2022 | ManagementMultimodal Sentiment Analysis | CodeCode Available | 2 |
| ThingTalk: An Extensible, Executable Representation Language for Task-Oriented Dialogues | Mar 23, 2022 | Semantic Parsing | CodeCode Available | 2 |
| R3M: A Universal Visual Representation for Robot Manipulation | Mar 23, 2022 | Contrastive LearningRobot Manipulation | CodeCode Available | 2 |
| Dataset Distillation by Matching Training Trajectories | Mar 22, 2022 | Dataset DistillationDataset Distillation - 1IPC | CodeCode Available | 2 |
| Scalable Video Object Segmentation with Identification Mechanism | Mar 22, 2022 | ObjectSegmentation | CodeCode Available | 2 |
| Practical tradeoffs between memory, compute, and performance in learned optimizers | Mar 22, 2022 | | CodeCode Available | 2 |
| Learning from All Vehicles | Mar 22, 2022 | AllAutonomous Driving | CodeCode Available | 2 |
| Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework | Mar 22, 2022 | Object TrackingRelation | CodeCode Available | 2 |
| TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers | Mar 22, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions | Mar 22, 2022 | Vision and Language Navigation | CodeCode Available | 2 |
| Focal Modulation Networks | Mar 22, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Safety of Sampled-Data Systems with Control Barrier Functions via Approximate Discrete Time Models | Mar 22, 2022 | | CodeCode Available | 2 |
| Practical Stereo Matching via Cascaded Recurrent Network with Adaptive Correlation | Mar 22, 2022 | Stereo Matching | CodeCode Available | 2 |
| CM-GAN: Image Inpainting with Cascaded Modulation GAN and Object-Aware Training | Mar 22, 2022 | DecoderImage Inpainting | CodeCode Available | 2 |
| Open-Vocabulary DETR with Conditional Matching | Mar 22, 2022 | Language Modellingobject-detection | CodeCode Available | 2 |
| Hyperbolic Vision Transformers: Combining Improvements in Metric Learning | Mar 21, 2022 | Metric Learning | CodeCode Available | 2 |
| PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark | Mar 21, 2022 | 3D Lane DetectionAutonomous Driving | CodeCode Available | 2 |