| HistBERT: A Pre-trained Language Model for Diachronic Lexical Semantic Analysis | Feb 8, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Evolving Subnetwork Training for Large Language Models | Jun 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Improving Complex Knowledge Base Question Answering via Question-to-Action and Question-to-Question Alignment | Dec 26, 2022 | Knowledge Base Question AnsweringLanguage Modeling | CodeCode Available | 0 | 5 |
| Circuit Stability Characterizes Language Model Generalization | May 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Improving Context Aware Language Models | Apr 21, 2017 | General ClassificationLanguage Modeling | CodeCode Available | 0 | 5 |
| Large Memory Layers with Product Keys | Jul 10, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| LLaVA-Gemma: Accelerating Multimodal Foundation Models with a Compact Language Model | Mar 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| NegatER: Unsupervised Discovery of Negatives in Commonsense Knowledge Bases | Nov 15, 2020 | Data AugmentationLanguage Modeling | CodeCode Available | 0 | 5 |
| Contrastive Language Prompting to Ease False Positives in Medical Anomaly Detection | Nov 12, 2024 | Anomaly DetectionLanguage Modeling | CodeCode Available | 0 | 5 |
| Adapting Multilingual LLMs to Low-Resource Languages with Knowledge Graphs via Adapters | Jul 1, 2024 | Knowledge GraphsLanguage Modeling | CodeCode Available | 0 | 5 |
| Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation | Oct 25, 2023 | Conversational RecommendationData Augmentation | CodeCode Available | 0 | 5 |
| Examining Language Modeling Assumptions Using an Annotated Literary Dialect Corpus | Oct 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Large Product Key Memory for Pretrained Language Models | Oct 8, 2020 | Causal Language ModelingLanguage Modeling | CodeCode Available | 0 | 5 |
| Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition | Jul 9, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| From Markov to Laplace: How Mamba In-Context Learns Markov Chains | Feb 14, 2025 | In-Context LearningLanguage Modeling | CodeCode Available | 0 | 5 |
| ChartFormer: A Large Vision Language Model for Converting Chart Images into Tactile Accessible SVGs | May 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| CItruS: Chunked Instruction-aware State Eviction for Long Sequence Modeling | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Dissecting vocabulary biases datasets through statistical testing and automated data augmentation for artifact mitigation in Natural Language Inference | Dec 14, 2023 | Data AugmentationLanguage Modeling | CodeCode Available | 0 | 5 |
| Decomposed Prompting to Answer Questions on a Course Discussion Board | Jul 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| From Machine Translation to Code-Switching: Generating High-Quality Code-Switched Text | Jul 14, 2021 | Data AugmentationLanguage Modeling | CodeCode Available | 0 | 5 |
| High-risk learning: acquiring new word vectors from tiny data | Jul 20, 2017 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Adversarial Style Augmentation via Large Language Model for Robust Fake News Detection | Jun 17, 2024 | Fake News DetectionLanguage Modeling | CodeCode Available | 0 | 5 |
| Contrastive learning of T cell receptor representations | Jun 10, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 0 | 5 |
| Auto-tagging of Short Conversational Sentences using Natural Language Processing Methods | Jun 9, 2021 | ChatbotLanguage Modeling | CodeCode Available | 0 | 5 |
| Disentangling Logic: The Role of Context in Large Language Model Reasoning Capabilities | Jun 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Cross-lingual Information Retrieval with BERT | Apr 24, 2020 | Cross-Lingual Information RetrievalDocument Ranking | CodeCode Available | 0 | 5 |
| Disentangling and Integrating Relational and Sensory Information in Transformer Architectures | May 26, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 0 | 5 |
| Drop Dropout on Single-Epoch Language Model Pretraining | May 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| DropMicroFluidAgents (DMFAs): Autonomous Droplet Microfluidic Research Framework Through Large Language Model Agents | Dec 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| When Low Resource NLP Meets Unsupervised Language Model: Meta-pretraining Then Meta-learning for Few-shot Text Classification | Aug 22, 2019 | Few-Shot LearningFew-Shot Text Classification | CodeCode Available | 0 | 5 |
| exBERT: A Visual Analysis Tool to Explore Learned Representations in Transformers Models | Oct 11, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Improving Generalization Performance by Switching from Adam to SGD | Dec 20, 2017 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Improving Grammatical Error Correction with Machine Translation Pairs | Nov 7, 2019 | Grammatical Error CorrectionLanguage Modeling | CodeCode Available | 0 | 5 |
| Applying a Pre-trained Language Model to Spanish Twitter Humor Prediction | Jul 6, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization | Nov 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Improving In-Context Learning with Small Language Model Ensembles | Oct 29, 2024 | Domain LabellingIn-Context Learning | CodeCode Available | 0 | 5 |
| Low-rank passthrough neural networks | Mar 10, 2016 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Improving Information Extraction on Business Documents with Specific Pre-Training Tasks | Sep 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Claim Optimization in Computational Argumentation | Dec 17, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation | May 10, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 0 | 5 |
| A Neural Language Model for Dynamically Representing the Meanings of Unknown Words and Entities in a Discourse | Sep 6, 2017 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Improving Language Generation with Sentence Coherence Objective | Sep 7, 2020 | DiversityLanguage Modeling | CodeCode Available | 0 | 5 |
| DrugImproverGPT: A Large Language Model for Drug Optimization with Fine-Tuning via Structured Policy Optimization | Feb 11, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| A Content-Based Novelty Measure for Scholarly Publications: A Proof of Concept | Jan 8, 2024 | Common Sense ReasoningLanguage Modeling | CodeCode Available | 0 | 5 |
| Character-Level Language Modeling with Deeper Self-Attention | Aug 9, 2018 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Discriminative Policy Optimization for Token-Level Reward Models | May 29, 2025 | GSM8KLanguage Modeling | CodeCode Available | 0 | 5 |
| DrugTar Improves Druggability Prediction by Integrating Large Language Models and Gene Ontologies | Sep 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| DSC IIT-ISM at SemEval-2020 Task 6: Boosting BERT with Dependencies for Definition Extraction | Sep 17, 2020 | Definition ExtractionLanguage Modeling | CodeCode Available | 0 | 5 |
| Hierarchical Quantized Representations for Script Generation | Aug 28, 2018 | DecoderLanguage Modeling | CodeCode Available | 0 | 5 |
| A dataset and exploration of models for understanding video data through fill-in-the-blank question-answering | Nov 23, 2016 | DescriptiveLanguage Modeling | CodeCode Available | 0 | 5 |