| Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models | Jun 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Realistic Threat Model for Large Language Model Jailbreaks | Oct 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CAT-LM: Training Language Models on Aligned Code And Tests | Oct 2, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation | Jul 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| 4-bit Shampoo for Memory-Efficient Network Training | May 28, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language Modeling | Dec 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conversations | Feb 19, 2024 | ChatbotLanguage Modeling | CodeCode Available | 1 |
| RARR: Researching and Revising What Language Models Say, Using Language Models | Oct 17, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 1 |
| EDA Corpus: A Large Language Model Dataset for Enhanced Interaction with OpenROAD | May 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EasyJudge: an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs | Oct 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AgroGPT: Efficient Agricultural Vision-Language Model with Expert Tuning | Oct 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ECAMP: Entity-centered Context-aware Medical Vision Language Pre-training | Dec 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AttributionBench: How Hard is Automatic Attribution Evaluation? | Feb 23, 2024 | Binary ClassificationLanguage Modeling | CodeCode Available | 1 |
| Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-Generation | Apr 4, 2025 | ClusteringHallucination | CodeCode Available | 1 |
| DziriBERT: a Pre-trained Language Model for the Algerian Dialect | Sep 25, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs | May 1, 2022 | Constituency Grammar InductionLanguage Modeling | CodeCode Available | 1 |
| Listen, Attend and Spell | Aug 5, 2015 | DecoderLanguage Modeling | CodeCode Available | 1 |
| LitCab: Lightweight Language Model Calibration over Short- and Long-form Responses | Oct 30, 2023 | FormLanguage Modeling | CodeCode Available | 1 |
| A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration | Oct 3, 2023 | Arithmetic ReasoningCode Generation | CodeCode Available | 1 |
| ArcGPT: A Large Language Model Tailored for Real-world Archival Applications | Jul 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Cascade Speculative Drafting for Even Faster LLM Inference | Dec 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Connecting the Dots: A Knowledgeable Path Generator for Commonsense Question Answering | May 2, 2020 | Knowledge GraphsLanguage Modeling | CodeCode Available | 1 |
| Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model | Dec 2, 2024 | cross-modal alignmentKnowledge Distillation | CodeCode Available | 1 |
| Copy Is All You Need | Jul 13, 2023 | AllDomain Adaptation | CodeCode Available | 1 |
| Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions | Aug 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |