| Multi-task Pre-training Language Model for Semantic Network Completion | Jan 13, 2022 | Contrastive LearningData Augmentation | CodeCode Available | 0 | 5 |
| LT-LM: a novel non-autoregressive language model for single-shot lattice rescoring | Apr 6, 2021 | ARCAutomatic Speech Recognition | CodeCode Available | 0 | 5 |
| An Invariant Learning Characterization of Controlled Text Generation | May 31, 2023 | AttributeLanguage Modeling | CodeCode Available | 0 | 5 |
| Low-rank passthrough neural networks | Mar 10, 2016 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Low-Rank Constraints for Fast Inference in Structured Models | Jan 8, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Low Rank Factorizations are Indirect Encodings for Deep Neuroevolution | Apr 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Low-Rank RNN Adaptation for Context-Aware Language Modeling | Oct 6, 2017 | General ClassificationLanguage Modeling | CodeCode Available | 0 | 5 |
| Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory | May 21, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 0 | 5 |
| A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement | Oct 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| BIRCO: A Benchmark of Information Retrieval Tasks with Complex Objectives | Feb 21, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 0 | 5 |
| Lower Perplexity is Not Always Human-Like | Jun 2, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Not Everything is All You Need: Toward Low-Redundant Optimization for Large Language Model Alignment | Jun 18, 2024 | AllLanguage Modeling | CodeCode Available | 0 | 5 |
| BIOptimus: Pre-training an Optimal Biomedical Language Model with Curriculum Learning for Named Entity Recognition | Aug 16, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Looking for a Handsome Carpenter! Debiasing GPT-3 Job Advertisements | May 23, 2022 | DiversityLanguage Modeling | CodeCode Available | 0 | 5 |
| Retrieval-Pretrained Transformer: Long-range Language Modeling with Self-retrieval | Jun 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Long Range Language Modeling via Gated State Spaces | Jun 27, 2022 | ArticlesLanguage Modeling | CodeCode Available | 0 | 5 |
| Long Short-Term Memory Based Recurrent Neural Network Architectures for Large Vocabulary Speech Recognition | Feb 5, 2014 | Handwriting RecognitionLanguage Modeling | CodeCode Available | 0 | 5 |
| Biomedical Language Models are Robust to Sub-optimal Tokenization | Jun 30, 2023 | Entity LinkingLanguage Modeling | CodeCode Available | 0 | 5 |
| Long Short-Term Memory-Networks for Machine Reading | Jan 25, 2016 | DecoderLanguage Modeling | CodeCode Available | 0 | 5 |
| Biomedical Event Extraction as Multi-turn Question Answering | Nov 1, 2020 | Event ExtractionKnowledge Base Population | CodeCode Available | 0 | 5 |
| An Independent Evaluation of ChatGPT on Mathematical Word Problems (MWP) | Feb 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| A Few-shot Approach to Resume Information Extraction via Prompts | Sep 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences | Aug 31, 2020 | DiversityLanguage Modeling | CodeCode Available | 0 | 5 |
| Logical Implications for Visual Question Answering Consistency | Mar 16, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| A Feasible Framework for Arbitrary-Shaped Scene Text Recognition | Dec 10, 2019 | Instance SegmentationLanguage Modeling | CodeCode Available | 0 | 5 |