| Generative Agents: Interactive Simulacra of Human Behavior | Apr 7, 2023 | Language ModellingLarge Language Model | CodeCode Available | 6 | 5 |
| SqueezeLLM: Dense-and-Sparse Quantization | Jun 13, 2023 | GPUQuantization | CodeCode Available | 6 | 5 |
| Versatile Diffusion: Text, Images and Variations All in One Diffusion Model | Nov 15, 2022 | AllDisentanglement | CodeCode Available | 6 | 5 |
| Dynamic Datasets and Market Environments for Financial Reinforcement Learning | Apr 25, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 6 | 5 |
| SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation | Nov 22, 2022 | Image AnimationTalking Head Generation | CodeCode Available | 6 | 5 |
| Efficient Memory Management for Large Language Model Serving with PagedAttention | Sep 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 | 5 |
| A Method for Animating Children's Drawings of the Human Figure | Mar 7, 2023 | Image to Video Generation | CodeCode Available | 6 | 5 |
| CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society | Mar 31, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 6 | 5 |
| ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages | Dec 13, 2022 | Code SummarizationLanguage Modeling | CodeCode Available | 6 | 5 |
| Pseudo Numerical Methods for Diffusion Models on Manifolds | Feb 20, 2022 | DenoisingImage Generation | CodeCode Available | 6 | 5 |
| Efficient Guided Generation for Large Language Models | Jul 19, 2023 | Language ModellingText Generation | CodeCode Available | 6 | 5 |
| Semi-Parametric Neural Image Synthesis | Apr 25, 2022 | Image GenerationRetrieval | CodeCode Available | 6 | 5 |
| Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond | Apr 26, 2023 | Language ModellingNatural Language Understanding | CodeCode Available | 6 | 5 |
| Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages | Aug 23, 2023 | Image GenerationImage to text | CodeCode Available | 6 | 5 |
| Qwen Technical Report | Sep 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 | 5 |
| SGLang: Efficient Execution of Structured Language Model Programs | Dec 12, 2023 | Few-Shot LearningLanguage Modeling | CodeCode Available | 6 | 5 |
| StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation | Dec 19, 2023 | DenoisingImage Generation | CodeCode Available | 6 | 5 |
| FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness | May 27, 2022 | 16k4k | CodeCode Available | 6 | 5 |
| Quantized Training of Gradient Boosting Decision Trees | Jul 20, 2022 | Quantization | CodeCode Available | 6 | 5 |
| Automatic Chain of Thought Prompting in Large Language Models | Oct 7, 2022 | DiversityQuestion Answering | CodeCode Available | 6 | 5 |
| A Survey of Large Language Models | Mar 31, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 | 5 |
| Petals: Collaborative Inference and Fine-tuning of Large Models | Sep 2, 2022 | Collaborative Inference | CodeCode Available | 6 | 5 |
| Continual Pre-Training of Large Language Models: How to (re)warm your model? | Aug 8, 2023 | Language Modelling | CodeCode Available | 6 | 5 |
| Synthetic Dataset Generation for Adversarial Machine Learning Research | Jul 21, 2022 | BIG-bench Machine LearningDataset Generation | CodeCode Available | 6 | 5 |
| ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech | Nov 7, 2022 | Representation LearningSpeech Representation Learning | CodeCode Available | 6 | 5 |
| AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head | Apr 25, 2023 | | CodeCode Available | 6 | 5 |
| 3D Gaussian Splatting for Real-Time Radiance Field Rendering | Aug 8, 2023 | Camera CalibrationNovel View Synthesis | CodeCode Available | 6 | 5 |
| Code Llama: Open Foundation Models for Code | Aug 24, 2023 | 16kCode Generation | CodeCode Available | 6 | 5 |
| Adversarial Diffusion Distillation | Nov 28, 2023 | Image Generation | CodeCode Available | 6 | 5 |
| Sparks of Artificial General Intelligence: Early experiments with GPT-4 | Mar 22, 2023 | Arithmetic ReasoningMathematical Reasoning | CodeCode Available | 6 | 5 |
| Shap-E: Generating Conditional 3D Implicit Functions | May 3, 2023 | | CodeCode Available | 6 | 5 |
| ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models | Sep 2, 2023 | | CodeCode Available | 6 | 5 |
| The Dormant Neuron Phenomenon in Deep Reinforcement Learning | Feb 24, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 6 | 5 |
| Mamba: Linear-Time Sequence Modeling with Selective State Spaces | Dec 1, 2023 | 2D Pose EstimationCommon Sense Reasoning | CodeCode Available | 6 | 5 |
| What's Behind the Mask: Understanding Masked Graph Modeling for Graph Autoencoders | May 20, 2022 | Contrastive LearningLink Prediction | CodeCode Available | 6 | 5 |
| RET-LLM: Towards a General Read-Write Memory for Large Language Models | May 23, 2023 | Question Answering | CodeCode Available | 6 | 5 |
| TensorIR: An Abstraction for Automatic Tensorized Program Optimization | Jul 9, 2022 | BIG-bench Machine LearningDeep Learning | CodeCode Available | 6 | 5 |
| Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models | Jul 26, 2022 | Image GenerationPrompt Engineering | CodeCode Available | 6 | 5 |
| LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models | Sep 21, 2023 | 4kGPU | CodeCode Available | 6 | 5 |
| GPT-4 Technical Report | Mar 15, 2023 | answerability predictionArithmetic Reasoning | CodeCode Available | 6 | 5 |
| DINOv2: Learning Robust Visual Features without Supervision | Apr 14, 2023 | Depth EstimationDomain Generalization | CodeCode Available | 6 | 5 |
| GLM-130B: An Open Bilingual Pre-trained Model | Oct 5, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 6 | 5 |
| Chain-of-Thought Prompting Elicits Reasoning in Large Language Models | Jan 28, 2022 | Common Sense ReasoningGSM8K | CodeCode Available | 6 | 5 |
| h2oGPT: Democratizing Large Language Models | Jun 13, 2023 | ChatbotFairness | CodeCode Available | 6 | 5 |
| NEFTune: Noisy Embeddings Improve Instruction Finetuning | Oct 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 | 5 |
| Improved Baselines with Visual Instruction Tuning | Oct 5, 2023 | Factual Inconsistency Detection in Chart CaptioningImage Classification | CodeCode Available | 6 | 5 |
| An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models | Sep 18, 2023 | Visual Question Answering | CodeCode Available | 6 | 5 |
| AutoGluon-TimeSeries: AutoML for Probabilistic Time Series Forecasting | Aug 10, 2023 | AutoMLPhilosophy | CodeCode Available | 6 | 5 |
| QLoRA: Efficient Finetuning of Quantized LLMs | May 23, 2023 | ChatbotGPU | CodeCode Available | 6 | 5 |
| FinGPT: Open-Source Financial Large Language Models | Jun 9, 2023 | Algorithmic TradingLanguage Modeling | CodeCode Available | 6 | 5 |