| Cascade Speculative Drafting for Even Faster LLM Inference | Dec 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CriticEval: Evaluating Large Language Model as Critic | Feb 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions | Aug 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Faster Causal Attention Over Large Sequences Through Sparse Flash Attention | Jun 1, 2023 | 16k8k | CodeCode Available | 1 | 5 |
| Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment | Oct 9, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation | May 24, 2023 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| Learning Passage Impacts for Inverted Indexes | Apr 24, 2021 | Information RetrievalLanguage Modeling | CodeCode Available | 1 | 5 |
| PaLI-X: On Scaling up a Multilingual Vision and Language Model | May 29, 2023 | Chart Question Answeringdocument understanding | CodeCode Available | 1 | 5 |
| Cascaded Head-colliding Attention | May 31, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language Model | Apr 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |