| Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels | Feb 21, 2023 | Classification | CodeCode Available | 3 |
| Composer: Creative and Controllable Image Synthesis with Composable Conditions | Feb 20, 2023 | Image ColorizationImage Generation | CodeCode Available | 3 |
| MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation | Feb 16, 2023 | Image GenerationText to Image Generation | CodeCode Available | 3 |
| Tri-Perspective View for Vision-Based 3D Semantic Occupancy Prediction | Feb 15, 2023 | 3D Semantic Scene CompletionAutonomous Driving | CodeCode Available | 3 |
| Deep Neural Networks for Encrypted Inference with TFHE | Feb 13, 2023 | Privacy Preserving | CodeCode Available | 3 |
| Privacy-Preserving Tree-Based Inference with TFHE | Feb 13, 2023 | Privacy Preserving | CodeCode Available | 3 |
| MarioGPT: Open-Ended Text2Level Generation through Large Language Models | Feb 12, 2023 | | CodeCode Available | 3 |
| Zero-shot Image-to-Image Translation | Feb 6, 2023 | Image-to-Image TranslationText-based Image Editing | CodeCode Available | 3 |
| The Flan Collection: Designing Data and Methods for Effective Instruction Tuning | Jan 31, 2023 | | CodeCode Available | 3 |
| REPLUG: Retrieval-Augmented Black-Box Language Models | Jan 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| ThoughtSource: A central hub for large language model reasoning data | Jan 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Cut and Learn for Unsupervised Object Detection and Instance Segmentation | Jan 26, 2023 | Instance Segmentationobject-detection | CodeCode Available | 3 |
| Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning | Jan 26, 2023 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 3 |
| SoftMatch: Addressing the Quantity-Quality Trade-off in Semi-supervised Learning | Jan 26, 2023 | imbalanced classification | CodeCode Available | 3 |
| StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis | Jan 23, 2023 | Image GenerationText-to-Image Generation | CodeCode Available | 3 |
| Champion Solution for the WSDM2023 Toloka VQA Challenge | Jan 22, 2023 | Question AnsweringVisual Grounding | CodeCode Available | 3 |
| MedSegDiff-V2: Diffusion based Medical Image Segmentation with Transformer | Jan 19, 2023 | Image GenerationImage Segmentation | CodeCode Available | 3 |
| Data-centric AI: Perspectives and Challenges | Jan 12, 2023 | | CodeCode Available | 3 |
| Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling | Jan 9, 2023 | 2D Object DetectionContrastive Learning | CodeCode Available | 3 |
| Cross Modal Transformer: Towards Fast and Robust 3D Object Detection | Jan 3, 2023 | 3D Object Detectionobject-detection | CodeCode Available | 3 |
| ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders | Jan 2, 2023 | Object DetectionRepresentation Learning | CodeCode Available | 3 |
| PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360deg | Jan 1, 2023 | Image GenerationImage Segmentation | CodeCode Available | 3 |
| High-Resolution Image Reconstruction With Latent Diffusion Models From Human Brain Activity | Jan 1, 2023 | DenoisingImage Reconstruction | CodeCode Available | 3 |
| Cramming: Training a Language Model on a Single GPU in One Day | Dec 28, 2022 | GPULanguage Modeling | CodeCode Available | 3 |
| The Forward-Forward Algorithm: Some Preliminary Investigations | Dec 27, 2022 | | CodeCode Available | 3 |
| AER: Auto-Encoder with Regression for Time Series Anomaly Detection | Dec 27, 2022 | Anomaly DetectionBenchmarking | CodeCode Available | 3 |
| TextBox 2.0: A Text Generation Library with Pre-trained Language Models | Dec 26, 2022 | Abstractive Text SummarizationData-to-Text Generation | CodeCode Available | 3 |
| DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders | Dec 22, 2022 | ColorizationDecoder | CodeCode Available | 3 |
| Generalized Decoding for Pixel, Image, and Language | Dec 21, 2022 | DecoderImage Segmentation | CodeCode Available | 3 |
| Reasoning with Language Model Prompting: A Survey | Dec 19, 2022 | Arithmetic ReasoningCommon Sense Reasoning | CodeCode Available | 3 |
| Discovering Language Model Behaviors with Model-Written Evaluations | Dec 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Rethinking Vision Transformers for MobileNet Size and Speed | Dec 15, 2022 | | CodeCode Available | 3 |
| ECON: Explicit Clothed humans Optimized via Normal integration | Dec 14, 2022 | 3D Human ReconstructionSurface Reconstruction | CodeCode Available | 3 |
| RT-1: Robotics Transformer for Real-World Control at Scale | Dec 13, 2022 | DiversityRobot Manipulation | CodeCode Available | 3 |
| DifFace: Blind Face Restoration with Diffused Error Contraction | Dec 13, 2022 | Blind Face Restoration | CodeCode Available | 3 |
| A Survey of Knowledge Graph Reasoning on Graph Types: Static, Dynamic, and Multimodal | Dec 12, 2022 | General KnowledgeGraph Embedding | CodeCode Available | 3 |
| Prompting Is Programming: A Query Language for Large Language Models | Dec 12, 2022 | Code GenerationLanguage Modeling | CodeCode Available | 3 |
| Multi-Concept Customization of Text-to-Image Diffusion | Dec 8, 2022 | Diffusion Personalization | CodeCode Available | 3 |
| ViTPose++: Vision Transformer for Generic Body Pose Estimation | Dec 7, 2022 | 2D Human Pose EstimationAnimal Pose Estimation | CodeCode Available | 3 |
| Unifying Vision, Text, and Layout for Universal Document Processing | Dec 5, 2022 | Document AIdocument understanding | CodeCode Available | 3 |
| BEVPoolv2: A Cutting-edge Implementation of BEVDet Toward Deployment | Nov 30, 2022 | | CodeCode Available | 3 |
| MegaBlocks: Efficient Sparse Training with Mixture-of-Experts | Nov 29, 2022 | GPUMixture-of-Experts | CodeCode Available | 3 |
| Open-Source Skull Reconstruction with MONAI | Nov 25, 2022 | C++ codeDeep Learning | CodeCode Available | 3 |
| Paint by Example: Exemplar-based Image Editing with Diffusion Models | Nov 23, 2022 | Image GenerationImage Manipulation | CodeCode Available | 3 |
| DETRs with Collaborative Hybrid Assignments Training | Nov 22, 2022 | DecoderInstance Segmentation | CodeCode Available | 3 |
| Human-level play in the game of Diplomacy by combining language models with strategic reasoning | Nov 22, 2022 | AI AgentLanguage Modeling | CodeCode Available | 3 |
| Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks | Nov 22, 2022 | Math | CodeCode Available | 3 |
| imitation: Clean Imitation Learning Implementations | Nov 22, 2022 | Imitation Learningreinforcement-learning | CodeCode Available | 3 |
| Adversarial Cheap Talk | Nov 20, 2022 | Meta-LearningReinforcement Learning (RL) | CodeCode Available | 3 |
| PAL: Program-aided Language Models | Nov 18, 2022 | Arithmetic ReasoningGSM8K | CodeCode Available | 3 |