| Instant Neural Graphics Primitives with a Multiresolution Hash Encoding | Jan 16, 2022 | 3D Reconstruction3D Shape Reconstruction | CodeCode Available | 6 |
| SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models | Nov 18, 2022 | Quantization | CodeCode Available | 6 |
| Mistral 7B | Oct 10, 2023 | answerability predictionArithmetic Reasoning | CodeCode Available | 6 |
| Visual Instruction Tuning | Apr 17, 2023 | 1 Image, 2*2 Stitching3D Question Answering (3D-QA) | CodeCode Available | 6 |
| A decoder-only foundation model for time-series forecasting | Oct 14, 2023 | DecoderTime Series | CodeCode Available | 6 |
| RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback | Dec 1, 2023 | HallucinationImage Captioning | CodeCode Available | 6 |
| CVNets: High Performance Library for Computer Vision | Jun 4, 2022 | Video UnderstandingVocal Bursts Intensity Prediction | CodeCode Available | 6 |
| Better speech synthesis through scaling | May 12, 2023 | Image GenerationSpeech Synthesis | CodeCode Available | 6 |
| YaRN: Efficient Context Window Extension of Large Language Models | Aug 31, 2023 | Position | CodeCode Available | 6 |
| H2O Open Ecosystem for State-of-the-art Large Language Models | Oct 17, 2023 | | CodeCode Available | 6 |
| Towards Robust Blind Face Restoration with Codebook Lookup Transformer | Jun 22, 2022 | Blind Face RestorationPrediction | CodeCode Available | 6 |
| Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling | Apr 3, 2023 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 6 |
| FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning | Jul 17, 2023 | GPULanguage Modeling | CodeCode Available | 6 |
| Generative Agents: Interactive Simulacra of Human Behavior | Apr 7, 2023 | Language ModellingLarge Language Model | CodeCode Available | 6 |
| SqueezeLLM: Dense-and-Sparse Quantization | Jun 13, 2023 | GPUQuantization | CodeCode Available | 6 |
| Versatile Diffusion: Text, Images and Variations All in One Diffusion Model | Nov 15, 2022 | AllDisentanglement | CodeCode Available | 6 |
| Dynamic Datasets and Market Environments for Financial Reinforcement Learning | Apr 25, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 6 |
| SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation | Nov 22, 2022 | Image AnimationTalking Head Generation | CodeCode Available | 6 |
| Efficient Memory Management for Large Language Model Serving with PagedAttention | Sep 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 |
| A Method for Animating Children's Drawings of the Human Figure | Mar 7, 2023 | Image to Video Generation | CodeCode Available | 6 |
| CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society | Mar 31, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 6 |
| ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages | Dec 13, 2022 | Code SummarizationLanguage Modeling | CodeCode Available | 6 |
| Pseudo Numerical Methods for Diffusion Models on Manifolds | Feb 20, 2022 | DenoisingImage Generation | CodeCode Available | 6 |
| Efficient Guided Generation for Large Language Models | Jul 19, 2023 | Language ModellingText Generation | CodeCode Available | 6 |
| Semi-Parametric Neural Image Synthesis | Apr 25, 2022 | Image GenerationRetrieval | CodeCode Available | 6 |
| Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond | Apr 26, 2023 | Language ModellingNatural Language Understanding | CodeCode Available | 6 |
| Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages | Aug 23, 2023 | Image GenerationImage to text | CodeCode Available | 6 |
| Qwen Technical Report | Sep 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 |
| SGLang: Efficient Execution of Structured Language Model Programs | Dec 12, 2023 | Few-Shot LearningLanguage Modeling | CodeCode Available | 6 |
| StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation | Dec 19, 2023 | DenoisingImage Generation | CodeCode Available | 6 |
| FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness | May 27, 2022 | 16k4k | CodeCode Available | 6 |
| Quantized Training of Gradient Boosting Decision Trees | Jul 20, 2022 | Quantization | CodeCode Available | 6 |
| Automatic Chain of Thought Prompting in Large Language Models | Oct 7, 2022 | DiversityQuestion Answering | CodeCode Available | 6 |
| A Survey of Large Language Models | Mar 31, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 |
| Petals: Collaborative Inference and Fine-tuning of Large Models | Sep 2, 2022 | Collaborative Inference | CodeCode Available | 6 |
| Continual Pre-Training of Large Language Models: How to (re)warm your model? | Aug 8, 2023 | Language Modelling | CodeCode Available | 6 |
| Synthetic Dataset Generation for Adversarial Machine Learning Research | Jul 21, 2022 | BIG-bench Machine LearningDataset Generation | CodeCode Available | 6 |
| ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech | Nov 7, 2022 | Representation LearningSpeech Representation Learning | CodeCode Available | 6 |
| AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head | Apr 25, 2023 | | CodeCode Available | 6 |
| 3D Gaussian Splatting for Real-Time Radiance Field Rendering | Aug 8, 2023 | Camera CalibrationNovel View Synthesis | CodeCode Available | 6 |
| Code Llama: Open Foundation Models for Code | Aug 24, 2023 | 16kCode Generation | CodeCode Available | 6 |
| Adversarial Diffusion Distillation | Nov 28, 2023 | Image Generation | CodeCode Available | 6 |
| Sparks of Artificial General Intelligence: Early experiments with GPT-4 | Mar 22, 2023 | Arithmetic ReasoningMathematical Reasoning | CodeCode Available | 6 |
| Shap-E: Generating Conditional 3D Implicit Functions | May 3, 2023 | | CodeCode Available | 6 |
| ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models | Sep 2, 2023 | | CodeCode Available | 6 |
| The Dormant Neuron Phenomenon in Deep Reinforcement Learning | Feb 24, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 6 |
| Mamba: Linear-Time Sequence Modeling with Selective State Spaces | Dec 1, 2023 | 2D Pose EstimationCommon Sense Reasoning | CodeCode Available | 6 |
| What's Behind the Mask: Understanding Masked Graph Modeling for Graph Autoencoders | May 20, 2022 | Contrastive LearningLink Prediction | CodeCode Available | 6 |
| RET-LLM: Towards a General Read-Write Memory for Large Language Models | May 23, 2023 | Question Answering | CodeCode Available | 6 |
| TensorIR: An Abstraction for Automatic Tensorized Program Optimization | Jul 9, 2022 | BIG-bench Machine LearningDeep Learning | CodeCode Available | 6 |