| Dynamic Datasets and Market Environments for Financial Reinforcement Learning | Apr 25, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 6 |
| Efficient and Effective Text Encoding for Chinese LLaMA and Alpaca | Apr 17, 2023 | | CodeCode Available | 6 |
| Visual Instruction Tuning | Apr 17, 2023 | 1 Image, 2*2 Stitching3D Question Answering (3D-QA) | CodeCode Available | 6 |
| DINOv2: Learning Robust Visual Features without Supervision | Apr 14, 2023 | Depth EstimationDomain Generalization | CodeCode Available | 6 |
| Generative Agents: Interactive Simulacra of Human Behavior | Apr 7, 2023 | Language ModellingLarge Language Model | CodeCode Available | 6 |
| Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling | Apr 3, 2023 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 6 |
| A Survey of Large Language Models | Mar 31, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 |
| CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society | Mar 31, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 6 |
| HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face | Mar 30, 2023 | Automatic Machine Learning Model SelectionModel Selection | CodeCode Available | 6 |
| Sparks of Artificial General Intelligence: Early experiments with GPT-4 | Mar 22, 2023 | Arithmetic ReasoningMathematical Reasoning | CodeCode Available | 6 |
| ART: Automatic multi-step reasoning and tool-use for large language models | Mar 16, 2023 | MMLU | CodeCode Available | 6 |
| GPT-4 Technical Report | Mar 15, 2023 | answerability predictionArithmetic Reasoning | CodeCode Available | 6 |
| A Method for Animating Children's Drawings of the Human Figure | Mar 7, 2023 | Image to Video Generation | CodeCode Available | 6 |
| The Dormant Neuron Phenomenon in Deep Reinforcement Learning | Feb 24, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 6 |
| Nerfstudio: A Modular Framework for Neural Radiance Field Development | Feb 8, 2023 | NeRFNovel View Synthesis | CodeCode Available | 6 |
| MusicLM: Generating Music From Text | Jan 26, 2023 | Music GenerationText-to-Music Generation | CodeCode Available | 6 |
| A Watermark for Large Language Models | Jan 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 |
| ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages | Dec 13, 2022 | Code SummarizationLanguage Modeling | CodeCode Available | 6 |
| SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation | Nov 22, 2022 | Image AnimationTalking Head Generation | CodeCode Available | 6 |
| SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models | Nov 18, 2022 | Quantization | CodeCode Available | 6 |
| Versatile Diffusion: Text, Images and Variations All in One Diffusion Model | Nov 15, 2022 | AllDisentanglement | CodeCode Available | 6 |
| ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech | Nov 7, 2022 | Representation LearningSpeech Representation Learning | CodeCode Available | 6 |
| FinRL-Meta: Market Environments and Benchmarks for Data-Driven Financial Reinforcement Learning | Nov 6, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 6 |
| Automatic Chain of Thought Prompting in Large Language Models | Oct 7, 2022 | DiversityQuestion Answering | CodeCode Available | 6 |
| GLM-130B: An Open Bilingual Pre-trained Model | Oct 5, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 6 |
| TimesNet: Temporal 2D-Variation Modeling for General Time Series Analysis | Oct 5, 2022 | Action RecognitionAnomaly Detection | CodeCode Available | 6 |
| AudioGen: Textually Guided Audio Generation | Sep 30, 2022 | Audio GenerationDescriptive | CodeCode Available | 6 |
| Petals: Collaborative Inference and Fine-tuning of Large Models | Sep 2, 2022 | Collaborative Inference | CodeCode Available | 6 |
| Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models | Jul 26, 2022 | Image GenerationPrompt Engineering | CodeCode Available | 6 |
| Synthetic Dataset Generation for Adversarial Machine Learning Research | Jul 21, 2022 | BIG-bench Machine LearningDataset Generation | CodeCode Available | 6 |
| Quantized Training of Gradient Boosting Decision Trees | Jul 20, 2022 | Quantization | CodeCode Available | 6 |
| TensorIR: An Abstraction for Automatic Tensorized Program Optimization | Jul 9, 2022 | BIG-bench Machine LearningDeep Learning | CodeCode Available | 6 |
| Towards Robust Blind Face Restoration with Codebook Lookup Transformer | Jun 22, 2022 | Blind Face RestorationPrediction | CodeCode Available | 6 |
| CVNets: High Performance Library for Computer Vision | Jun 4, 2022 | Video UnderstandingVocal Bursts Intensity Prediction | CodeCode Available | 6 |
| CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers | May 29, 2022 | Text-to-Video GenerationVideo Generation | CodeCode Available | 6 |
| FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness | May 27, 2022 | 16k4k | CodeCode Available | 6 |
| What's Behind the Mask: Understanding Masked Graph Modeling for Graph Autoencoders | May 20, 2022 | Contrastive LearningLink Prediction | CodeCode Available | 6 |
| PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit | May 20, 2022 | AllAutomatic Speech Recognition (ASR) | CodeCode Available | 6 |
| Semi-Parametric Neural Image Synthesis | Apr 25, 2022 | Image GenerationRetrieval | CodeCode Available | 6 |
| Training Compute-Optimal Large Language Models | Mar 29, 2022 | AnachronismsAnalogical Similarity | CodeCode Available | 6 |
| CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis | Mar 25, 2022 | Code GenerationHumanEval | CodeCode Available | 6 |
| Long Document Summarization with Top-down and Bottom-up Inference | Mar 15, 2022 | Text Summarization | CodeCode Available | 6 |
| Training language models to follow instructions with human feedback | Mar 4, 2022 | Question Answering | CodeCode Available | 6 |
| Pseudo Numerical Methods for Diffusion Models on Manifolds | Feb 20, 2022 | DenoisingImage Generation | CodeCode Available | 6 |
| Chain-of-Thought Prompting Elicits Reasoning in Large Language Models | Jan 28, 2022 | Common Sense ReasoningGSM8K | CodeCode Available | 6 |
| Instant Neural Graphics Primitives with a Multiresolution Hash Encoding | Jan 16, 2022 | 3D Reconstruction3D Shape Reconstruction | CodeCode Available | 6 |
| LucidFlux: Caption-Free Photo-Realistic Image Restoration via a Large-Scale Diffusion Transformer | Mar 19, 2026 | | —Unverified | 5 |
| Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length | Mar 16, 2026 | | —Unverified | 5 |
| DeepEyesV2: Toward Agentic Multimodal Model | Mar 11, 2026 | | —Unverified | 5 |
| EvoScientist: Towards Multi-Agent Evolving AI Scientists for End-to-End Scientific Discovery | Mar 9, 2026 | | —Unverified | 5 |