| Cyclical Annealing Schedule: A Simple Approach to Mitigating KL Vanishing | Mar 25, 2019 | DecoderLanguage Modeling | CodeCode Available | 0 | 5 |
| DynaBERT: Dynamic BERT with Adaptive Width and Depth | Apr 8, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Learning to Plan for Language Modeling from Unlabeled Data | Mar 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Learning to Rank Context for Named Entity Recognition Using a Synthetic Dataset | Oct 16, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| AxomiyaBERTa: A Phonologically-aware Transformer Model for Assamese | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Learning to Generate Compositional Color Descriptions | Jun 13, 2016 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| CXP949 at WNUT-2020 Task 2: Extracting Informative COVID-19 Tweets -- RoBERTa Ensembles and The Continued Relevance of Handcrafted Features | Oct 15, 2020 | ClassificationGeneral Classification | CodeCode Available | 0 | 5 |
| Analyzing constrained LLM through PDFA-learning | Jun 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Learning to Explore and Select for Coverage-Conditioned Retrieval-Augmented Generation | Jul 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Learning to Maximize Mutual Information for Chain-of-Thought Distillation | Mar 5, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 0 | 5 |
| AX-MABSA: A Framework for Extremely Weakly Supervised Multi-label Aspect Based Sentiment Analysis | Nov 7, 2022 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 0 | 5 |
| Learning Syntax Without Planting Trees: Understanding When and Why Transformers Generalize Hierarchically | Apr 25, 2024 | Inductive BiasLanguage Modeling | CodeCode Available | 0 | 5 |
| Dynamic Entity Representations in Neural Language Models | Aug 2, 2017 | Coreference ResolutionLanguage Modeling | CodeCode Available | 0 | 5 |
| Dynamic Evaluation of Transformer Language Models | Apr 17, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Learning to Describe for Predicting Zero-shot Drug-Drug Interactions | Mar 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Learning Semantic Textual Similarity via Topic-informed Discrete Latent Variables | Nov 7, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Learning Python Code Suggestion with a Sparse Pointer Network | Nov 24, 2016 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Learning Recurrent Binary/Ternary Weights | Sep 28, 2018 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Learning to Solve Voxel Building Embodied Tasks from Pixels and Natural Language Instructions | Nov 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Learning Natural Language Generation with Truncated Reinforcement Learning | Jul 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Learning Multiplex Representations on Text-Attributed Graphs with One Language Model Encoder | Oct 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Learning of Generalizable and Interpretable Knowledge in Grid-Based Reinforcement Learning Environments | Sep 7, 2023 | Atari GamesDecision Making | CodeCode Available | 0 | 5 |
| Customising General Large Language Models for Specialised Emotion Recognition Tasks | Apr 14, 2024 | Emotion RecognitionLanguage Modeling | CodeCode Available | 0 | 5 |
| Learning Intrinsic Sparse Structures within Long Short-Term Memory | Sep 15, 2017 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Learning Longer Memory in Recurrent Neural Networks | Dec 24, 2014 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |