| Turning a CLIP Model into a Scene Text Detector | Feb 28, 2023 | Domain AdaptationScene Text Detection | CodeCode Available | 2 |
| HugNLP: A Unified and Comprehensive Library for Natural Language Processing | Feb 28, 2023 | | CodeCode Available | 2 |
| CHGNet: Pretrained universal neural network potential for charge-informed atomistic modeling | Feb 28, 2023 | Atomic ForcesGraph Neural Network | CodeCode Available | 2 |
| BakedSDF: Meshing Neural SDFs for Real-Time View Synthesis | Feb 28, 2023 | Novel View Synthesis | CodeCode Available | 2 |
| RoPAWS: Robust Semi-supervised Representation Learning from Uncurated Data | Feb 28, 2023 | Density Estimationimage-classification | CodeCode Available | 2 |
| A Hierarchical Representation Network for Accurate and Detailed Face Reconstruction from In-The-Wild Images | Feb 28, 2023 | 3D Face ReconstructionDisentanglement | CodeCode Available | 2 |
| PyReason: Software for Open World Temporal Logic | Feb 27, 2023 | Knowledge Graphs | CodeCode Available | 2 |
| OccDepth: A Depth-Aware Method for 3D Semantic Scene Completion | Feb 27, 2023 | 3D geometry3D Semantic Scene Completion | CodeCode Available | 2 |
| SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks | Feb 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning | Feb 27, 2023 | Dense Video CaptioningLanguage Modeling | CodeCode Available | 2 |
| ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation | Feb 27, 2023 | Image GenerationText to Image Generation | CodeCode Available | 2 |
| FedCLIP: Fast Generalization and Personalization for CLIP in Federated Learning | Feb 27, 2023 | Federated LearningPrivacy Preserving | CodeCode Available | 2 |
| Reward Design with Language Models | Feb 27, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Inseq: An Interpretability Toolkit for Sequence Generation Models | Feb 27, 2023 | DecoderFeature Importance | CodeCode Available | 2 |
| Spatially-Adaptive Feature Modulation for Efficient Image Super-Resolution | Feb 27, 2023 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 |
| Pillar R-CNN for Point Cloud 3D Object Detection | Feb 26, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Point Cloud Forecasting as a Proxy for 4D Occupancy Forecasting | Feb 25, 2023 | Motion Planning | CodeCode Available | 2 |
| PITS: Variational Pitch Inference without Fundamental Frequency for End-to-End Pitch-controllable TTS | Feb 24, 2023 | Decodertext-to-speech | CodeCode Available | 2 |
| Decoupling Human and Camera Motion from Videos in the Wild | Feb 24, 2023 | | CodeCode Available | 2 |
| Language-Driven Representation Learning for Robotics | Feb 24, 2023 | Contrastive LearningImitation Learning | CodeCode Available | 2 |
| Towards Stable Test-Time Adaptation in Dynamic Wild World | Feb 24, 2023 | Test-time Adaptation | CodeCode Available | 2 |
| Side Adapter Network for Open-Vocabulary Semantic Segmentation | Feb 23, 2023 | Language ModellingOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Active Prompting with Chain-of-Thought for Large Language Models | Feb 23, 2023 | Active LearningZero-Shot Learning | CodeCode Available | 2 |
| Learning stiff chemical kinetics using extended deep neural operators | Feb 23, 2023 | Unity | CodeCode Available | 2 |
| One Fits All:Power General Time Series Analysis by Pretrained LM | Feb 23, 2023 | Anomaly DetectionFew-Shot Learning | CodeCode Available | 2 |
| Language Model Crossover: Variation through Few-Shot Prompting | Feb 23, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models | Feb 23, 2023 | DenoisingNeRF | CodeCode Available | 2 |
| Fusing Visual Appearance and Geometry for Multi-modality 6DoF Object Tracking | Feb 22, 2023 | 3D Object Tracking6D Pose Estimation | CodeCode Available | 2 |
| Learning to Generalize Provably in Learning to Optimize | Feb 22, 2023 | | CodeCode Available | 2 |
| Assessment of Reinforcement Learning for Macro Placement | Feb 21, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 2 |
| Multi-Modal Self-Supervised Learning for Recommendation | Feb 21, 2023 | Contrastive LearningData Augmentation | CodeCode Available | 2 |
| SF2Former: Amyotrophic Lateral Sclerosis Identification From Multi-center MRI Data Using Spatial and Frequency Fusion Transformer | Feb 21, 2023 | Deep LearningMedical Image Analysis | CodeCode Available | 2 |
| PC^2: Projection-Conditioned Point Cloud Diffusion for Single-Image 3D Reconstruction | Feb 21, 2023 | 3D ReconstructionDenoising | CodeCode Available | 2 |
| EC-SfM: Efficient Covisibility-based Structure-from-Motion for Both Sequential and Unordered Images | Feb 21, 2023 | | CodeCode Available | 2 |
| Hyena Hierarchy: Towards Larger Convolutional Language Models | Feb 21, 2023 | 2k8k | CodeCode Available | 2 |
| Towards Universal Fake Image Detectors that Generalize Across Generative Models | Feb 20, 2023 | ClassificationLanguage Modeling | CodeCode Available | 2 |
| Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot | Feb 20, 2023 | Efficient Explorationreinforcement-learning | CodeCode Available | 2 |
| Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey | Feb 20, 2023 | Survey | CodeCode Available | 2 |
| A Large Scale Homography Benchmark | Feb 20, 2023 | Homography EstimationSurface Normal Estimation | CodeCode Available | 2 |
| ChatIE: Zero-Shot Information Extraction via Chatting with ChatGPT | Feb 20, 2023 | Event Extractionnamed-entity-recognition | CodeCode Available | 2 |
| Social4Rec: Distilling User Preference from Social Graph for Video Recommendation in Tencent | Feb 20, 2023 | Knowledge DistillationRecommendation Systems | CodeCode Available | 2 |
| Growing Steerable Neural Cellular Automata | Feb 19, 2023 | | CodeCode Available | 2 |
| MedViT: A Robust Vision Transformer for Generalized Medical Image Classification | Feb 19, 2023 | image-classificationImage Classification | CodeCode Available | 2 |
| RETVec: Resilient and Efficient Text Vectorizer | Feb 18, 2023 | Adversarial TextMetric Learning | CodeCode Available | 2 |
| BBT-Fin: Comprehensive Construction of Chinese Financial Domain Pre-trained Language Model, Corpus and Benchmark | Feb 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| JANA: Jointly Amortized Neural Approximation of Complex Bayesian Models | Feb 17, 2023 | DiagnosticTime Series | CodeCode Available | 2 |
| Trieste: Efficiently Exploring The Depths of Black-box Functions with TensorFlow | Feb 16, 2023 | Active LearningBayesian Optimization | CodeCode Available | 2 |
| LightGCL: Simple Yet Effective Graph Contrastive Learning for Recommendation | Feb 16, 2023 | Contrastive LearningData Augmentation | CodeCode Available | 2 |
| Parallax-Tolerant Unsupervised Deep Image Stitching | Feb 16, 2023 | Image RegistrationImage Stitching | CodeCode Available | 2 |
| DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization | Feb 16, 2023 | Combinatorial OptimizationDenoising | CodeCode Available | 2 |