| Making Parameter-efficient Tuning More Efficient: A Unified Framework for Classification Tasks | Oct 1, 2022 | ClassificationLanguage Modeling | CodeCode Available | 0 | 5 |
| Making the Most of Text Semantics to Improve Biomedical Vision--Language Processing | Apr 21, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 0 | 5 |
| An LSTM Adaptation Study of (Un)grammaticality | Aug 1, 2019 | CoLALanguage Modeling | CodeCode Available | 0 | 5 |
| Making Language Model a Hierarchical Classifier and Generator | Jul 17, 2025 | DecoderLanguage Modeling | CodeCode Available | 0 | 5 |
| Private Memorization Editing: Turning Memorization into a Defense to Strengthen Data Privacy in Large Language Models | Jun 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Make Some Noise: Unlocking Language Model Parallel Inference Capability through Noisy Training | Jun 25, 2024 | DenoisingLanguage Modeling | CodeCode Available | 0 | 5 |
| Blockwise Self-Attention for Long Document Understanding | Nov 7, 2019 | document understandingLanguage Modeling | CodeCode Available | 0 | 5 |
| Block-wise Dynamic Sparseness | Jan 14, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| MADLAD-400: A Multilingual And Document-Level Large Audited Dataset | Sep 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Machine-generated text detection prevents language model collapse | Feb 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Machine-in-the-Loop Rewriting for Creative Image Captioning | Nov 7, 2021 | DescriptiveImage Captioning | CodeCode Available | 0 | 5 |
| Ankh: Optimized Protein Language Model Unlocks General-Purpose Modelling | Jan 16, 2023 | DiversityLanguage Modeling | CodeCode Available | 0 | 5 |
| Macsen: A Voice Assistant for Speakers of a Lesser Resourced Language | May 1, 2020 | Language Modelingspeech-recognition | CodeCode Available | 0 | 5 |
| M2SA: Multimodal and Multilingual Model for Sentiment Analysis of Tweets | Apr 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| LyapLock: Bounded Knowledge Preservation in Sequential Large Language Model Editing | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| A Framework for Adapting Human-Robot Interaction to Diverse User Groups | Oct 15, 2024 | Action DetectionActivity Detection | CodeCode Available | 0 | 5 |
| LVLM-Compress-Bench: Benchmarking the Broader Impact of Large Vision-Language Model Compression | Mar 6, 2025 | BenchmarkingCommon Sense Reasoning | CodeCode Available | 0 | 5 |
| LVLM-Interpret: An Interpretability Tool for Large Vision-Language Models | Apr 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| BLCU-ICALL at SemEval-2022 Task 1: Cross-Attention Multitasking Framework for Definition Modeling | Apr 16, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Blank Collapse: Compressing CTC emission for the faster decoding | Oct 31, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| BlackOut: Speeding up Recurrent Neural Network Language Models With Very Large Vocabularies | Nov 21, 2015 | CPULanguage Modeling | CodeCode Available | 0 | 5 |
| An Investigation of Noise in Morphological Inflection | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| LSTM based Conversation Models | Mar 31, 2016 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Black-box language model explanation by context length probing | Dec 30, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Low-Resource Sequence Labeling via Unsupervised Multilingual Contextualized Representations | Oct 24, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Multi-task Pre-training Language Model for Semantic Network Completion | Jan 13, 2022 | Contrastive LearningData Augmentation | CodeCode Available | 0 | 5 |
| LT-LM: a novel non-autoregressive language model for single-shot lattice rescoring | Apr 6, 2021 | ARCAutomatic Speech Recognition | CodeCode Available | 0 | 5 |
| An Invariant Learning Characterization of Controlled Text Generation | May 31, 2023 | AttributeLanguage Modeling | CodeCode Available | 0 | 5 |
| Low-rank passthrough neural networks | Mar 10, 2016 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Low-Rank Constraints for Fast Inference in Structured Models | Jan 8, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Low Rank Factorizations are Indirect Encodings for Deep Neuroevolution | Apr 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Low-Rank RNN Adaptation for Context-Aware Language Modeling | Oct 6, 2017 | General ClassificationLanguage Modeling | CodeCode Available | 0 | 5 |
| Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory | May 21, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 0 | 5 |
| A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement | Oct 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| BIRCO: A Benchmark of Information Retrieval Tasks with Complex Objectives | Feb 21, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 0 | 5 |
| Lower Perplexity is Not Always Human-Like | Jun 2, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Not Everything is All You Need: Toward Low-Redundant Optimization for Large Language Model Alignment | Jun 18, 2024 | AllLanguage Modeling | CodeCode Available | 0 | 5 |
| BIOptimus: Pre-training an Optimal Biomedical Language Model with Curriculum Learning for Named Entity Recognition | Aug 16, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Looking for a Handsome Carpenter! Debiasing GPT-3 Job Advertisements | May 23, 2022 | DiversityLanguage Modeling | CodeCode Available | 0 | 5 |
| Retrieval-Pretrained Transformer: Long-range Language Modeling with Self-retrieval | Jun 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Long Range Language Modeling via Gated State Spaces | Jun 27, 2022 | ArticlesLanguage Modeling | CodeCode Available | 0 | 5 |
| Long Short-Term Memory Based Recurrent Neural Network Architectures for Large Vocabulary Speech Recognition | Feb 5, 2014 | Handwriting RecognitionLanguage Modeling | CodeCode Available | 0 | 5 |
| Biomedical Language Models are Robust to Sub-optimal Tokenization | Jun 30, 2023 | Entity LinkingLanguage Modeling | CodeCode Available | 0 | 5 |
| Long Short-Term Memory-Networks for Machine Reading | Jan 25, 2016 | DecoderLanguage Modeling | CodeCode Available | 0 | 5 |
| Biomedical Event Extraction as Multi-turn Question Answering | Nov 1, 2020 | Event ExtractionKnowledge Base Population | CodeCode Available | 0 | 5 |
| An Independent Evaluation of ChatGPT on Mathematical Word Problems (MWP) | Feb 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| A Few-shot Approach to Resume Information Extraction via Prompts | Sep 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences | Aug 31, 2020 | DiversityLanguage Modeling | CodeCode Available | 0 | 5 |
| Logical Implications for Visual Question Answering Consistency | Mar 16, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| A Feasible Framework for Arbitrary-Shaped Scene Text Recognition | Dec 10, 2019 | Instance SegmentationLanguage Modeling | CodeCode Available | 0 | 5 |