| PyPOTS: A Python Toolbox for Data Mining on Partially-Observed Time Series | May 30, 2023 | Anomaly DetectionClassification | CodeCode Available | 2 |
| StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation | May 30, 2023 | 3D GenerationAttribute | CodeCode Available | 2 |
| Cones 2: Customizable Image Synthesis with Multiple Subjects | May 30, 2023 | Image Generation | CodeCode Available | 2 |
| Functional-Group-Based Diffusion for Pocket-Specific Molecule Generation and Elaboration | May 30, 2023 | Drug Design | CodeCode Available | 2 |
| Blockwise Parallel Transformer for Large Context Models | May 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| UniScene: Multi-Camera Unified Pre-training via 3D Scene Reconstruction for Autonomous Driving | May 30, 2023 | 3D Object Detection3D Scene Reconstruction | CodeCode Available | 2 |
| HiFA: High-fidelity Text-to-3D Generation with Advanced Diffusion Guidance | May 30, 2023 | 3D Generation3D geometry | CodeCode Available | 2 |
| Conditional Diffusion Models for Semantic 3D Brain MRI Synthesis | May 29, 2023 | Data AugmentationImage Generation | CodeCode Available | 2 |
| Explicit Visual Prompting for Universal Foreground Segmentations | May 29, 2023 | Camouflaged Object SegmentationDefocus Blur Detection | CodeCode Available | 2 |
| TaleCrafter: Interactive Story Visualization with Multiple Characters | May 29, 2023 | Image GenerationLayout Generation | CodeCode Available | 2 |
| GlyphControl: Glyph Conditional Control for Visual Text Generation | May 29, 2023 | Optical Character Recognition (OCR)Text Generation | CodeCode Available | 2 |
| Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising | May 29, 2023 | DenoisingImage Generation | CodeCode Available | 2 |
| 3DTeethSeg'22: 3D Teeth Scan Segmentation and Labeling Challenge | May 29, 2023 | AnatomySegmentation | CodeCode Available | 2 |
| BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages | May 29, 2023 | Machine TranslationTranslation | CodeCode Available | 2 |
| VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset | May 29, 2023 | Audio captioningAudio-Visual Captioning | CodeCode Available | 2 |
| Contextual Object Detection with Multimodal Large Language Models | May 29, 2023 | Cloze TestDecoder | CodeCode Available | 2 |
| Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models | May 29, 2023 | Attribute | CodeCode Available | 2 |
| Multiscale Positive-Unlabeled Detection of AI-Generated Texts | May 29, 2023 | Language Modellingtext-classification | CodeCode Available | 2 |
| 4DRadarSLAM: A 4D Imaging Radar SLAM System for Large-scale Environments based on Pose Graph Optimization | May 29, 2023 | | CodeCode Available | 2 |
| Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion Priors | May 29, 2023 | Contrastive LearningImage Reconstruction | CodeCode Available | 2 |
| KoSBi: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model Application | May 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| SQuARe: A Large-Scale Dataset of Sensitive Questions and Acceptable Responses Created Through Human-Machine Collaboration | May 28, 2023 | Response Generation | CodeCode Available | 2 |
| Dink-Net: Neural Clustering on Large Graphs | May 28, 2023 | ClusteringGraph Clustering | CodeCode Available | 2 |
| NeRO: Neural Geometry and BRDF Reconstruction of Reflective Objects from Multiview Images | May 27, 2023 | Neural RenderingObject | CodeCode Available | 2 |
| Accelerating Text-to-Image Editing via Cache-Enabled Sparse Diffusion Inference | May 27, 2023 | GPUImage Generation | CodeCode Available | 2 |
| APRIL-GAN: A Zero-/Few-Shot Anomaly Classification and Segmentation Method for CVPR 2023 VAND Workshop Challenge Tracks 1&2: 1st Place on Zero-shot AD and 4th Place on Few-shot AD | May 27, 2023 | Anomaly ClassificationAnomaly Detection | CodeCode Available | 2 |
| SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks | May 27, 2023 | Decoder | CodeCode Available | 2 |
| NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models | May 26, 2023 | Instruction FollowingVision and Language Navigation | CodeCode Available | 2 |
| SSSegmenation: An Open Source Supervised Semantic Segmentation Toolbox Based on PyTorch | May 26, 2023 | Image SegmentationSegmentation | CodeCode Available | 2 |
| Language Models Can Improve Event Prediction by Few-Shot Abductive Reasoning | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| The Brain Tumor Segmentation (BraTS) Challenge 2023: Focus on Pediatrics (CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs) | May 26, 2023 | BenchmarkingBrain Tumor Segmentation | CodeCode Available | 2 |
| ControlVideo: Conditional Control for One-shot Text-driven Video Editing and Beyond | May 26, 2023 | Text-to-Video EditingVideo Editing | CodeCode Available | 2 |
| On Evaluating Adversarial Robustness of Large Vision-Language Models | May 26, 2023 | Adversarial Robustnessmultimodal generation | CodeCode Available | 2 |
| BiomedGPT: A Generalist Vision-Language Foundation Model for Diverse Biomedical Tasks | May 26, 2023 | Image CaptioningMedical Visual Question Answering | CodeCode Available | 2 |
| Generating Images with Multimodal Language Models | May 26, 2023 | DecoderImage Generation | CodeCode Available | 2 |
| Training Socially Aligned Language Models on Simulated Social Interactions | May 26, 2023 | | CodeCode Available | 2 |
| AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec | May 26, 2023 | CPUGPU | CodeCode Available | 2 |
| DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion | May 25, 2023 | DenoisingStyle Transfer | CodeCode Available | 2 |
| Scaling Data-Constrained Language Models | May 25, 2023 | | CodeCode Available | 2 |
| IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages | May 25, 2023 | AllMachine Translation | CodeCode Available | 2 |
| Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory | May 25, 2023 | Common Sense ReasoningCPU | CodeCode Available | 2 |
| BK-SDM: A Lightweight, Fast, and Cheap Version of Stable Diffusion | May 25, 2023 | DreamBooth Personalized GenerationImage-to-Image Translation | CodeCode Available | 2 |
| Break-A-Scene: Extracting Multiple Concepts from a Single Image | May 25, 2023 | Complex Scene Breaking and Synthesis | CodeCode Available | 2 |
| On the Planning Abilities of Large Language Models : A Critical Investigation | May 25, 2023 | | CodeCode Available | 2 |
| MixFormerV2: Efficient Fully Transformer Tracking | May 25, 2023 | CPUGPU | CodeCode Available | 2 |
| An AI-Ready Multiplex Staining Dataset for Reproducible and Accurate Characterization of Tumor Immune Microenvironment | May 25, 2023 | Style Transfer | CodeCode Available | 2 |
| Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models | May 25, 2023 | All | CodeCode Available | 2 |
| Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models | May 25, 2023 | Conditional Text-to-Image SynthesisImage Generation | CodeCode Available | 2 |
| PandaGPT: One Model To Instruction-Follow Them All | May 25, 2023 | AllImage Description | CodeCode Available | 2 |
| Anomaly Detection with Conditioned Denoising Diffusion Models | May 25, 2023 | Anomaly DetectionDenoising | CodeCode Available | 2 |