| TaskBench: Benchmarking Large Language Models for Task Automation | Nov 30, 2023 | BenchmarkingParameter Prediction | CodeCode Available | 6 |
| U-Net v2: Rethinking the Skip Connections of U-Net for Medical Image Segmentation | Nov 29, 2023 | Computational EfficiencyDecoder | CodeCode Available | 6 |
| SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models | Nov 28, 2023 | Video Generation | CodeCode Available | 6 |
| Adversarial Diffusion Distillation | Nov 28, 2023 | Image Generation | CodeCode Available | 6 |
| TabRepo: A Large Scale Repository of Tabular Model Evaluations and its AutoML Applications | Nov 6, 2023 | AutoMLHyperparameter Optimization | CodeCode Available | 6 |
| Res-Tuning: A Flexible and Efficient Tuning Paradigm via Unbinding Tuner from Backbone | Oct 30, 2023 | Disentanglement | CodeCode Available | 6 |
| H2O Open Ecosystem for State-of-the-art Large Language Models | Oct 17, 2023 | | CodeCode Available | 6 |
| A decoder-only foundation model for time-series forecasting | Oct 14, 2023 | DecoderTime Series | CodeCode Available | 6 |
| MemGPT: Towards LLMs as Operating Systems | Oct 12, 2023 | Management | CodeCode Available | 6 |
| Mistral 7B | Oct 10, 2023 | answerability predictionArithmetic Reasoning | CodeCode Available | 6 |
| iTransformer: Inverted Transformers Are Effective for Time Series Forecasting | Oct 10, 2023 | Time SeriesTime Series Forecasting | CodeCode Available | 6 |
| NEFTune: Noisy Embeddings Improve Instruction Finetuning | Oct 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 |
| Enhancing Financial Sentiment Analysis via Retrieval Augmented Large Language Models | Oct 6, 2023 | Decision MakingRetrieval | CodeCode Available | 6 |
| Improved Baselines with Visual Instruction Tuning | Oct 5, 2023 | Factual Inconsistency Detection in Chart CaptioningImage Classification | CodeCode Available | 6 |
| Qwen Technical Report | Sep 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 |
| Vision Transformers Need Registers | Sep 28, 2023 | Object DiscoverySelf-Supervised Image Classification | CodeCode Available | 6 |
| RAGAS: Automated Evaluation of Retrieval Augmented Generation | Sep 26, 2023 | RAGRetrieval | CodeCode Available | 6 |
| LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models | Sep 21, 2023 | 4kGPU | CodeCode Available | 6 |
| Data Formulator: AI-powered Concept-driven Visualization Authoring | Sep 18, 2023 | AI Agent | CodeCode Available | 6 |
| An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models | Sep 18, 2023 | Visual Question Answering | CodeCode Available | 6 |
| Efficient Memory Management for Large Language Model Serving with PagedAttention | Sep 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 |
| ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models | Sep 2, 2023 | | CodeCode Available | 6 |
| YaRN: Efficient Context Window Extension of Large Language Models | Aug 31, 2023 | Position | CodeCode Available | 6 |
| Code Llama: Open Foundation Models for Code | Aug 24, 2023 | 16kCode Generation | CodeCode Available | 6 |
| Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages | Aug 23, 2023 | Image GenerationImage to text | CodeCode Available | 6 |
| SegRNN: Segment Recurrent Neural Network for Long-Term Time Series Forecasting | Aug 22, 2023 | Time SeriesTime Series Forecasting | CodeCode Available | 6 |
| AutoGluon-TimeSeries: AutoML for Probabilistic Time Series Forecasting | Aug 10, 2023 | AutoMLPhilosophy | CodeCode Available | 6 |
| Continual Pre-Training of Large Language Models: How to (re)warm your model? | Aug 8, 2023 | Language Modelling | CodeCode Available | 6 |
| 3D Gaussian Splatting for Real-Time Radiance Field Rendering | Aug 8, 2023 | Camera CalibrationNovel View Synthesis | CodeCode Available | 6 |
| L-Eval: Instituting Standardized Evaluation for Long Context Language Models | Jul 20, 2023 | Instruction Following | CodeCode Available | 6 |
| Efficient Guided Generation for Large Language Models | Jul 19, 2023 | Language ModellingText Generation | CodeCode Available | 6 |
| OxfordVGG Submission to the EGO4D AV Transcription Challenge | Jul 18, 2023 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 6 |
| FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning | Jul 17, 2023 | GPULanguage Modeling | CodeCode Available | 6 |
| Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution | Jul 12, 2023 | FairnessImage Classification | CodeCode Available | 6 |
| Extending Context Window of Large Language Models via Positional Interpolation | Jun 27, 2023 | Document SummarizationLanguage Modeling | CodeCode Available | 6 |
| SqueezeLLM: Dense-and-Sparse Quantization | Jun 13, 2023 | GPUQuantization | CodeCode Available | 6 |
| h2oGPT: Democratizing Large Language Models | Jun 13, 2023 | ChatbotFairness | CodeCode Available | 6 |
| FinGPT: Open-Source Financial Large Language Models | Jun 9, 2023 | Algorithmic TradingLanguage Modeling | CodeCode Available | 6 |
| Simple and Controllable Music Generation | Jun 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 |
| AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration | Jun 1, 2023 | Autonomous DrivingCloud Computing | CodeCode Available | 6 |
| Direct Preference Optimization: Your Language Model is Secretly a Reward Model | May 29, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 |
| Gorilla: Large Language Model Connected with Massive APIs | May 24, 2023 | HallucinationLanguage Modeling | CodeCode Available | 6 |
| RET-LLM: Towards a General Read-Write Memory for Large Language Models | May 23, 2023 | Question Answering | CodeCode Available | 6 |
| QLoRA: Efficient Finetuning of Quantized LLMs | May 23, 2023 | ChatbotGPU | CodeCode Available | 6 |
| RWKV: Reinventing RNNs for the Transformer Era | May 22, 2023 | Computational EfficiencyNatural Language Inference | CodeCode Available | 6 |
| SoundStorm: Efficient Parallel Audio Generation | May 16, 2023 | Audio Generation | CodeCode Available | 6 |
| Better speech synthesis through scaling | May 12, 2023 | Image GenerationSpeech Synthesis | CodeCode Available | 6 |
| Shap-E: Generating Conditional 3D Implicit Functions | May 3, 2023 | | CodeCode Available | 6 |
| Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond | Apr 26, 2023 | Language ModellingNatural Language Understanding | CodeCode Available | 6 |
| AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head | Apr 25, 2023 | | CodeCode Available | 6 |