| Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models | Jul 26, 2022 | Image GenerationPrompt Engineering | CodeCode Available | 6 |
| LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models | Sep 21, 2023 | 4kGPU | CodeCode Available | 6 |
| GPT-4 Technical Report | Mar 15, 2023 | answerability predictionArithmetic Reasoning | CodeCode Available | 6 |
| DINOv2: Learning Robust Visual Features without Supervision | Apr 14, 2023 | Depth EstimationDomain Generalization | CodeCode Available | 6 |
| GLM-130B: An Open Bilingual Pre-trained Model | Oct 5, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 6 |
| Chain-of-Thought Prompting Elicits Reasoning in Large Language Models | Jan 28, 2022 | Common Sense ReasoningGSM8K | CodeCode Available | 6 |
| h2oGPT: Democratizing Large Language Models | Jun 13, 2023 | ChatbotFairness | CodeCode Available | 6 |
| NEFTune: Noisy Embeddings Improve Instruction Finetuning | Oct 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 |
| Improved Baselines with Visual Instruction Tuning | Oct 5, 2023 | Factual Inconsistency Detection in Chart CaptioningImage Classification | CodeCode Available | 6 |
| An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models | Sep 18, 2023 | Visual Question Answering | CodeCode Available | 6 |
| AutoGluon-TimeSeries: AutoML for Probabilistic Time Series Forecasting | Aug 10, 2023 | AutoMLPhilosophy | CodeCode Available | 6 |
| QLoRA: Efficient Finetuning of Quantized LLMs | May 23, 2023 | ChatbotGPU | CodeCode Available | 6 |
| FinGPT: Open-Source Financial Large Language Models | Jun 9, 2023 | Algorithmic TradingLanguage Modeling | CodeCode Available | 6 |
| PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding | Dec 7, 2023 | Diffusion PersonalizationDiffusion Personalization Tuning Free | CodeCode Available | 6 |
| Enhancing Financial Sentiment Analysis via Retrieval Augmented Large Language Models | Oct 6, 2023 | Decision MakingRetrieval | CodeCode Available | 6 |
| CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis | Mar 25, 2022 | Code GenerationHumanEval | CodeCode Available | 6 |
| TaskBench: Benchmarking Large Language Models for Task Automation | Nov 30, 2023 | BenchmarkingParameter Prediction | CodeCode Available | 6 |
| MemGPT: Towards LLMs as Operating Systems | Oct 12, 2023 | Management | CodeCode Available | 6 |
| CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers | May 29, 2022 | Text-to-Video GenerationVideo Generation | CodeCode Available | 6 |
| TimesNet: Temporal 2D-Variation Modeling for General Time Series Analysis | Oct 5, 2022 | Action RecognitionAnomaly Detection | CodeCode Available | 6 |
| PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit | May 20, 2022 | AllAutomatic Speech Recognition (ASR) | CodeCode Available | 6 |
| TabRepo: A Large Scale Repository of Tabular Model Evaluations and its AutoML Applications | Nov 6, 2023 | AutoMLHyperparameter Optimization | CodeCode Available | 6 |
| AudioGen: Textually Guided Audio Generation | Sep 30, 2022 | Audio GenerationDescriptive | CodeCode Available | 6 |
| Data Formulator: AI-powered Concept-driven Visualization Authoring | Sep 18, 2023 | AI Agent | CodeCode Available | 6 |
| SoundStorm: Efficient Parallel Audio Generation | May 16, 2023 | Audio Generation | CodeCode Available | 6 |
| ART: Automatic multi-step reasoning and tool-use for large language models | Mar 16, 2023 | MMLU | CodeCode Available | 6 |
| Distributed Inference and Fine-tuning of Large Language Models Over The Internet | Dec 13, 2023 | | CodeCode Available | 6 |
| Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution | Jul 12, 2023 | FairnessImage Classification | CodeCode Available | 6 |
| Simple and Controllable Music Generation | Jun 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 |
| RAGAS: Automated Evaluation of Retrieval Augmented Generation | Sep 26, 2023 | RAGRetrieval | CodeCode Available | 6 |
| MusicLM: Generating Music From Text | Jan 26, 2023 | Music GenerationText-to-Music Generation | CodeCode Available | 6 |
| Long Document Summarization with Top-down and Bottom-up Inference | Mar 15, 2022 | Text Summarization | CodeCode Available | 6 |
| Training Compute-Optimal Large Language Models | Mar 29, 2022 | AnachronismsAnalogical Similarity | CodeCode Available | 6 |
| Nerfstudio: A Modular Framework for Neural Radiance Field Development | Feb 8, 2023 | NeRFNovel View Synthesis | CodeCode Available | 6 |
| Extending Context Window of Large Language Models via Positional Interpolation | Jun 27, 2023 | Document SummarizationLanguage Modeling | CodeCode Available | 6 |
| Seamless: Multilingual Expressive and Streaming Speech Translation | Dec 8, 2023 | automatic-speech-translationMachine Translation | CodeCode Available | 6 |
| SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models | Nov 28, 2023 | Video Generation | CodeCode Available | 6 |
| SegRNN: Segment Recurrent Neural Network for Long-Term Time Series Forecasting | Aug 22, 2023 | Time SeriesTime Series Forecasting | CodeCode Available | 6 |
| Gorilla: Large Language Model Connected with Massive APIs | May 24, 2023 | HallucinationLanguage Modeling | CodeCode Available | 6 |
| HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face | Mar 30, 2023 | Automatic Machine Learning Model SelectionModel Selection | CodeCode Available | 6 |
| U-Net v2: Rethinking the Skip Connections of U-Net for Medical Image Segmentation | Nov 29, 2023 | Computational EfficiencyDecoder | CodeCode Available | 6 |
| FinRL-Meta: Market Environments and Benchmarks for Data-Driven Financial Reinforcement Learning | Nov 6, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 6 |
| AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration | Jun 1, 2023 | Autonomous DrivingCloud Computing | CodeCode Available | 6 |
| OxfordVGG Submission to the EGO4D AV Transcription Challenge | Jul 18, 2023 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 6 |
| Efficient and Effective Text Encoding for Chinese LLaMA and Alpaca | Apr 17, 2023 | | CodeCode Available | 6 |
| Training language models to follow instructions with human feedback | Mar 4, 2022 | Question Answering | CodeCode Available | 6 |
| LucidFlux: Caption-Free Photo-Realistic Image Restoration via a Large-Scale Diffusion Transformer | Mar 19, 2026 | | —Unverified | 5 |
| DeepEyesV2: Toward Agentic Multimodal Model | Mar 11, 2026 | | —Unverified | 5 |
| Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters | Feb 23, 2026 | | —Unverified | 5 |
| OpenTSLM: Time-Series Language Models for Reasoning over Multivariate Medical Text- and Time-Series Data | Feb 14, 2026 | | —Unverified | 5 |