| FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU | Mar 13, 2023 | CPUGPU | CodeCode Available | 5 |
| Retinexformer: One-stage Retinex-based Transformer for Low-light Image Enhancement | Mar 12, 2023 | Image EnhancementLow-light Image Deblurring and Enhancement | CodeCode Available | 5 |
| Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection | Mar 9, 2023 | DecoderObject Detection | CodeCode Available | 5 |
| Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling | Mar 7, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 5 |
| Improved Differentially Private Regression via Gradient Boosting | Mar 6, 2023 | regression | CodeCode Available | 5 |
| Consistency Models | Mar 2, 2023 | ColorizationImage Generation | CodeCode Available | 5 |
| ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth | Feb 23, 2023 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 5 |
| RealFusion: 360° Reconstruction of Any Object from a Single Image | Feb 21, 2023 | 3D ReconstructionObject | CodeCode Available | 5 |
| EfficientRep:An Efficient Repvgg-style ConvNets with Hardware-aware Neural Network Design | Feb 1, 2023 | GPUobject-detection | CodeCode Available | 5 |
| Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions | Jan 20, 2023 | text-to-speechText to Speech | CodeCode Available | 5 |
| YOLOv6 v3.0: A Full-Scale Reloading | Jan 13, 2023 | GPUObject Detection | CodeCode Available | 5 |
| Orbit: A Unified Simulation Framework for Interactive Robot Learning Environments | Jan 10, 2023 | GPUImitation Learning | CodeCode Available | 5 |
| SantaCoder: don't reach for the stars! | Jan 9, 2023 | Code GenerationPII Redaction | CodeCode Available | 5 |
| GraphCast: Learning skillful medium-range global weather forecasting | Dec 24, 2022 | Decision MakingWeather Forecasting | CodeCode Available | 5 |
| Self-Instruct: Aligning Language Models with Self-Generated Instructions | Dec 20, 2022 | Instruction FollowingLanguage Modelling | CodeCode Available | 5 |
| Scalable Diffusion Models with Transformers | Dec 19, 2022 | Image Generation | CodeCode Available | 5 |
| Point-E: A System for Generating 3D Point Clouds from Complex Prompts | Dec 16, 2022 | Generating 3D Point CloudsGPU | CodeCode Available | 5 |
| Fast Inference from Transformers via Speculative Decoding | Nov 30, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild | Nov 27, 2022 | Video EditingVideo Generation | CodeCode Available | 5 |
| A Time Series is Worth 64 Words: Long-term Forecasting with Transformers | Nov 27, 2022 | Multivariate Time Series ForecastingRepresentation Learning | CodeCode Available | 5 |
| A Brief Overview of AI Governance for Responsible Machine Learning Systems | Nov 21, 2022 | | CodeCode Available | 5 |
| InstructPix2Pix: Learning to Follow Image Editing Instructions | Nov 17, 2022 | Image Editing | CodeCode Available | 5 |
| Hybrid Transformers for Music Source Separation | Nov 15, 2022 | Music Source SeparationSpeech Enhancement | CodeCode Available | 5 |
| AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time | Nov 7, 2022 | Knowledge DistillationMulti-Person Pose Estimation | CodeCode Available | 5 |
| MONAI: An open-source framework for deep learning in healthcare | Nov 4, 2022 | Deep LearningMedical Image Classification | CodeCode Available | 5 |
| DPM-Solver++: Fast Solver for Guided Sampling of Diffusion Probabilistic Models | Nov 2, 2022 | Image GenerationText to Image Generation | CodeCode Available | 5 |
| Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese | Nov 2, 2022 | Contrastive Learningimage-classification | CodeCode Available | 5 |
| TrimTail: Low-Latency Streaming ASR with Simple but Effective Spectrogram-Level Length Penalty | Nov 1, 2022 | | CodeCode Available | 5 |
| EBEN: Extreme bandwidth extension network applied to speech signals captured with noise-resilient body-conduction microphones | Oct 25, 2022 | Bandwidth ExtensionGenerative Adversarial Network | CodeCode Available | 5 |
| DreamFusion: Text-to-3D using 2D Diffusion | Sep 29, 2022 | DenoisingImage Generation | CodeCode Available | 5 |
| Deep Lake: a Lakehouse for Deep Learning | Sep 22, 2022 | Decision MakingDeep Learning | CodeCode Available | 5 |
| MoVQ: Modulating Quantized Vectors for High-Fidelity Image Generation | Sep 19, 2022 | DecoderImage Generation | CodeCode Available | 5 |
| Monolith: Real Time Recommendation System With Collisionless Embedding Table | Sep 16, 2022 | | CodeCode Available | 5 |
| YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications | Sep 7, 2022 | GPUObject Detection | CodeCode Available | 5 |
| DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation | Aug 25, 2022 | Diffusion PersonalizationImage Generation | CodeCode Available | 5 |
| LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale | Aug 15, 2022 | GPULanguage Modelling | CodeCode Available | 5 |
| An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion | Aug 2, 2022 | Image GenerationPersonalized Image Generation | CodeCode Available | 5 |
| GLEAN: Generative Latent Bank for Image Super-Resolution and Beyond | Jul 29, 2022 | ColorizationDecoder | CodeCode Available | 5 |
| DeepPhase: Periodic Autoencoders for Learning Motion Phase Manifolds | Jul 22, 2022 | Motion Synthesis | CodeCode Available | 5 |
| Robust Multivariate Time-Series Forecasting: Adversarial Attacks and Defense Mechanisms | Jul 19, 2022 | Adversarial AttackMultivariate Time Series Forecasting | CodeCode Available | 5 |
| TabPFN: A Transformer That Solves Small Tabular Classification Problems in a Second | Jul 5, 2022 | AutoMLBayesian Inference | CodeCode Available | 5 |
| Interpretability, Then What? Editing Machine Learning Models to Reflect Human Knowledge and Values | Jun 30, 2022 | Additive modelsBIG-bench Machine Learning | CodeCode Available | 5 |
| Feature Refinement to Improve High Resolution Image Inpainting | Jun 27, 2022 | Image InpaintingVocal Bursts Intensity Prediction | CodeCode Available | 5 |
| EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine | Jun 21, 2022 | MuJoCoreinforcement-learning | CodeCode Available | 5 |
| DoWhy-GCM: An extension of DoWhy for causal inference in graphical causal models | Jun 14, 2022 | Causal Inference | CodeCode Available | 5 |
| On the reusability of samples in active learning | Jun 13, 2022 | Active Learning | CodeCode Available | 5 |
| Factuality Enhanced Language Models for Open-Ended Text Generation | Jun 9, 2022 | MisconceptionsSentence | CodeCode Available | 5 |
| MOSPAT: AutoML based Model Selection and Parameter Tuning for Time Series Anomaly Detection | May 24, 2022 | Anomaly DetectionAutoML | CodeCode Available | 5 |
| Can Foundation Models Wrangle Your Data? | May 20, 2022 | Entity ResolutionImputation | CodeCode Available | 5 |
| Vectorized and performance-portable Quicksort | May 12, 2022 | CPU | CodeCode Available | 5 |