| Exploiting Diffusion Prior for Real-World Image Super-Resolution | May 11, 2023 | Blind Super-ResolutionImage Super-Resolution | CodeCode Available | 4 |
| VideoChat: Chat-Centric Video Understanding | May 10, 2023 | Question AnsweringVideo-based Generative Performance Benchmarking | CodeCode Available | 4 |
| InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language | May 9, 2023 | Language Modelling | CodeCode Available | 4 |
| Otter: A Multi-Modal Model with In-Context Instruction Tuning | May 5, 2023 | GPUIn-Context Learning | CodeCode Available | 4 |
| Contextual Multilingual Spellchecker for User Queries | May 1, 2023 | | CodeCode Available | 4 |
| The Ideal Continual Learner: An Agent That Never Forgets | Apr 29, 2023 | Continual LearningGeneralization Bounds | CodeCode Available | 4 |
| Towards Automated Circuit Discovery for Mechanistic Interpretability | Apr 28, 2023 | | CodeCode Available | 4 |
| mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality | Apr 27, 2023 | Visual Question Answering (VQA)Zero-Shot Video Question Answer | CodeCode Available | 4 |
| Segment Anything in Medical Images | Apr 24, 2023 | DiagnosticImage Segmentation | CodeCode Available | 4 |
| Phoenix: Democratizing ChatGPT across Languages | Apr 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Enhancing Suno's Bark Text-to-Speech Model: Addressing Limitations Through Meta's Encodec and Pre-Trained Hubert | Apr 18, 2023 | Audio GenerationExpressive Speech Synthesis | CodeCode Available | 4 |
| pgmpy: A Python Toolkit for Bayesian Networks | Apr 17, 2023 | Causal DiscoveryCausal Identification | CodeCode Available | 4 |
| HuaTuo: Tuning LLaMA Model with Chinese Medical Knowledge | Apr 14, 2023 | model | CodeCode Available | 4 |
| OpenAGI: When LLM Meets Domain Experts | Apr 10, 2023 | BenchmarkingNatural Language Queries | CodeCode Available | 4 |
| Instruction Tuning with GPT-4 | Apr 6, 2023 | Instruction Following | CodeCode Available | 4 |
| SegGPT: Segmenting Everything In Context | Apr 6, 2023 | Few-Shot Semantic SegmentationIn-Context Learning | CodeCode Available | 4 |
| Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster | Apr 6, 2023 | | CodeCode Available | 4 |
| Vision-Language Models for Vision Tasks: A Survey | Apr 3, 2023 | BenchmarkingKnowledge Distillation | CodeCode Available | 4 |
| Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data | Apr 3, 2023 | ChatbotLanguage Modeling | CodeCode Available | 4 |
| Token Merging for Fast Stable Diffusion | Mar 30, 2023 | Image Generation | CodeCode Available | 4 |
| AnnoLLM: Making Large Language Models to Be Better Crowdsourced Annotators | Mar 29, 2023 | Information RetrievalRetrieval | CodeCode Available | 4 |
| InceptionNeXt: When Inception Meets ConvNeXt | Mar 29, 2023 | Image ClassificationSemantic Segmentation | CodeCode Available | 4 |
| ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks | Mar 27, 2023 | text annotationText Classification | CodeCode Available | 4 |
| ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain Knowledge | Mar 24, 2023 | Information RetrievalLanguage Modeling | CodeCode Available | 4 |
| Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators | Mar 23, 2023 | Image GenerationText-to-Video Generation | CodeCode Available | 4 |
| Real-time volumetric rendering of dynamic humans | Mar 21, 2023 | 3D ReconstructionGPU | CodeCode Available | 4 |
| FedML-HE: An Efficient Homomorphic-Encryption-Based Privacy-Preserving Federated Learning System | Mar 20, 2023 | Federated LearningPrivacy Preserving | CodeCode Available | 4 |
| Reflexion: Language Agents with Verbal Reinforcement Learning | Mar 20, 2023 | Decision MakingHumanEval | CodeCode Available | 4 |
| Zero-1-to-3: Zero-shot One Image to 3D Object | Mar 20, 2023 | 3D ReconstructionImage to 3D | CodeCode Available | 4 |
| Data-centric Artificial Intelligence: A Survey | Mar 17, 2023 | Survey | CodeCode Available | 4 |
| VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation | Mar 15, 2023 | Code GenerationDenoising | CodeCode Available | 4 |
| Eliciting Latent Predictions from Transformers with the Tuned Lens | Mar 14, 2023 | Language Modelling | CodeCode Available | 4 |
| A Convergent Single-Loop Algorithm for Relaxation of Gromov-Wasserstein in Graph Data | Mar 12, 2023 | Computational Efficiency | CodeCode Available | 4 |
| Tag2Text: Guiding Vision-Language Model via Image Tagging | Mar 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Cost-Effective Hyperparameter Optimization for Large Language Model Generation Inference | Mar 8, 2023 | Hyperparameter OptimizationLanguage Modeling | CodeCode Available | 4 |
| FedML Parrot: A Scalable Federated Learning System via Heterogeneity-aware Scheduling on Sequential and Hierarchical Training | Mar 3, 2023 | Federated LearningGPU | CodeCode Available | 4 |
| Aligning benchmark datasets for table structure recognition | Mar 1, 2023 | Table DetectionTable Recognition | CodeCode Available | 4 |
| Structured Pruning for Deep Convolutional Neural Networks: A survey | Mar 1, 2023 | Network PruningNeural Architecture Search | CodeCode Available | 4 |
| Memory-aided Contrastive Consensus Learning for Co-salient Object Detection | Feb 28, 2023 | Co-Salient Object Detectionobject-detection | CodeCode Available | 4 |
| Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection | Feb 23, 2023 | Code CompletionComputer Security | CodeCode Available | 4 |
| AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving | Feb 22, 2023 | Deep Learning | CodeCode Available | 4 |
| ChatGPT for Robotics: Design Principles and Model Abilities | Feb 20, 2023 | Mathematical ReasoningPrompt Engineering | CodeCode Available | 4 |
| Improving Training Stability for Multitask Ranking Models in Recommender Systems | Feb 17, 2023 | Recommendation Systems | CodeCode Available | 4 |
| T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models | Feb 16, 2023 | Image GenerationStyle Transfer | CodeCode Available | 4 |
| 3D-aware Conditional Image Synthesis | Feb 16, 2023 | Image Generation | CodeCode Available | 4 |
| SLIM: Sparsified Late Interaction for Multi-Vector Retrieval with Inverted Indexes | Feb 13, 2023 | Information RetrievalRetrieval | CodeCode Available | 4 |
| An Extended Sequence Tagging Vocabulary for Grammatical Error Correction | Feb 12, 2023 | Grammatical Error CorrectionMorphological Inflection | CodeCode Available | 4 |
| Multimodal Chain-of-Thought Reasoning in Language Models | Feb 2, 2023 | HallucinationLanguage Modelling | CodeCode Available | 4 |
| Improving and generalizing flow-based generative models with minibatch optimal transport | Feb 1, 2023 | | CodeCode Available | 4 |
| mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video | Feb 1, 2023 | Action ClassificationImage Classification | CodeCode Available | 4 |