| Learning To Retrieve Prompts for In-Context Learning | Dec 16, 2021 | In-Context LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| Legilimens: Practical and Unified Content Moderation for Large Language Model Services | Aug 28, 2024 | Data AugmentationLanguage Modeling | CodeCode Available | 1 | 5 |
| Few-shot Reranking for Multi-hop QA via Language Model Prompting | May 25, 2022 | Language ModelingOpen-Domain Question Answering | CodeCode Available | 1 | 5 |
| CDLM: Cross-Document Language Modeling | Jan 2, 2021 | Citation RecommendationCoreference Resolution | CodeCode Available | 1 | 5 |
| Learning How to Ask: Querying LMs with Mixtures of Soft Prompts | Apr 14, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias | May 9, 2024 | Data VisualizationLanguage Modeling | CodeCode Available | 1 | 5 |
| Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conversations | Feb 19, 2024 | ChatbotLanguage Modeling | CodeCode Available | 1 | 5 |
| RARR: Researching and Revising What Language Models Say, Using Language Models | Oct 17, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment | Oct 9, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Fast and Accurate Deep Bidirectional Language Representations for Unsupervised Learning | Apr 17, 2020 | CPULanguage Modeling | CodeCode Available | 1 | 5 |
| Learning Hierarchical Structures with Differentiable Nondeterministic Stacks | Sep 5, 2021 | Inductive BiasLanguage Modeling | CodeCode Available | 1 | 5 |
| Learning Passage Impacts for Inverted Indexes | Apr 24, 2021 | Information RetrievalLanguage Modeling | CodeCode Available | 1 | 5 |
| AttributionBench: How Hard is Automatic Attribution Evaluation? | Feb 23, 2024 | Binary ClassificationLanguage Modeling | CodeCode Available | 1 | 5 |
| Fast-R2D2: A Pretrained Recursive Neural Network based on Pruned CKY for Grammar Induction and Text Representation | Mar 1, 2022 | Constituency Grammar InductionLanguage Modeling | CodeCode Available | 1 | 5 |
| ArcGPT: A Large Language Model Tailored for Real-world Archival Applications | Jul 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation | May 24, 2023 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| Cascade Speculative Drafting for Even Faster LLM Inference | Dec 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling | Nov 23, 2021 | Image CaptioningImage Description | CodeCode Available | 1 | 5 |
| Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions | Aug 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Learning from Unlabeled 3D Environments for Vision-and-Language Navigation | Aug 24, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Learning Sparse Prototypes for Text Generation | Jun 29, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Fast Vocabulary Transfer for Language Model Compression | Feb 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model | Dec 2, 2024 | cross-modal alignmentKnowledge Distillation | CodeCode Available | 1 | 5 |
| Fauno: The Italian Large Language Model that will leave you senza parole! | Jun 26, 2023 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| Cascaded Head-colliding Attention | May 31, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |