| GIT: A Generative Image-to-text Transformer for Vision and Language | May 27, 2022 | DecoderImage Captioning | CodeCode Available | 2 | 5 |
| Hierarchical Expert Prompt for Large-Language-Model: An Approach Defeat Elite AI in TextStarCraft II for the First Time | Feb 16, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 2 | 5 |
| Tiled Flash Linear Attention: More Efficient Linear RNN and xLSTM Kernels | Mar 18, 2025 | GPULanguage Modeling | CodeCode Available | 2 | 5 |
| TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding | Dec 4, 2023 | Dense CaptioningHighlight Detection | CodeCode Available | 2 | 5 |
| Generate rather than Retrieve: Large Language Models are Strong Context Generators | Sep 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Generalized Interpolating Discrete Diffusion | Mar 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| AntiFold: Improved antibody structure-based design using inverse folding | May 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| A Generalist Agent | May 12, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model | Mar 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Generating Benchmarks for Factuality Evaluation of Language Models | Jul 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |