| Iterative Pseudo-Labeling for Speech Recognition | May 19, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models | Feb 27, 2019 | General ClassificationLanguage Modeling | CodeCode Available | 0 |
| Learning to Rank Context for Named Entity Recognition Using a Synthetic Dataset | Oct 16, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Large Language Model-driven Meta-structure Discovery in Heterogeneous Information Network | Feb 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Can current NLI systems handle German word order? Investigating language model performance on a new German challenge set of minimal pairs | Jun 7, 2023 | Data AugmentationLanguage Modeling | CodeCode Available | 0 |
| Can ChatGPT's Responses Boost Traditional Natural Language Processing? | Jul 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LLMs as Educational Analysts: Transforming Multimodal Data Traces into Actionable Reading Assessment Reports | Mar 3, 2025 | FairnessLanguage Modeling | CodeCode Available | 0 |
| Hierarchical Character Embeddings: Learning Phonological and Semantic Representations in Languages of Logographic Origin using Recursive Neural Networks | Dec 20, 2019 | DiagnosticLanguage Modeling | CodeCode Available | 0 |
| AlcLaM: Arabic Dialectal Language Model | Jul 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Latent Variable Recurrent Neural Network for Discourse Relation Language Models | Mar 7, 2016 | ClassificationDialog Act Classification | CodeCode Available | 0 |
| tcrLM: a lightweight protein language model for predicting T cell receptor and epitope binding specificity | Jun 24, 2024 | DiversityLanguage Modeling | CodeCode Available | 0 |
| "I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities | Dec 26, 2024 | Domain AdaptationLanguage Modeling | CodeCode Available | 0 |
| Cyclical Annealing Schedule: A Simple Approach to Mitigating KL Vanishing | Mar 25, 2019 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Can a Large Language Model Learn Matrix Functions In Context? | Nov 24, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| A Tool for Generating Exceptional Behavior Tests With Large Language Models | May 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LoRec: Large Language Model for Robust Sequential Recommendation against Poisoning Attacks | Jan 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Tool for Facilitating OCR Postediting in Historical Documents | Apr 23, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LLMSat: A Large Language Model-Based Goal-Oriented Agent for Autonomous Space Exploration | Apr 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| An Effective Deployment of Diffusion LM for Data Augmentation in Low-Resource Sentiment Classification | Sep 5, 2024 | Data AugmentationDiversity | CodeCode Available | 0 |
| A large language model-assisted education tool to provide feedback on open-ended responses | Jul 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Systematic Comparison of Architectures for Document-Level Sentiment Classification | Feb 19, 2020 | ClassificationDocument Classification | CodeCode Available | 0 |
| Can a large language model be a gaslighter? | Oct 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Lipsum-FT: Robust Fine-Tuning of Zero-Shot Models Using Random Text Guidance | Apr 1, 2024 | image-classificationImage Classification | CodeCode Available | 0 |
| Disentangling Language and Knowledge in Task-Oriented Dialogs | May 3, 2018 | DecoderDisentanglement | CodeCode Available | 0 |
| CXP949 at WNUT-2020 Task 2: Extracting Informative COVID-19 Tweets -- RoBERTa Ensembles and The Continued Relevance of Handcrafted Features | Oct 15, 2020 | ClassificationGeneral Classification | CodeCode Available | 0 |
| Hierarchical Quantized Representations for Script Generation | Aug 28, 2018 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Can AI Relate: Testing Large Language Model Response for Mental Health Support | May 20, 2024 | ChatbotLanguage Modeling | CodeCode Available | 0 |
| Customising General Large Language Models for Specialised Emotion Recognition Tasks | Apr 14, 2024 | Emotion RecognitionLanguage Modeling | CodeCode Available | 0 |
| A Language Model of Java Methods with Train/Test Deduplication | May 15, 2023 | DescriptiveLanguage Modeling | CodeCode Available | 0 |
| Can (A)I Change Your Mind? | Mar 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Language Model for Spell Checking of Educational Texts in Kurdish (Sorani) | Jun 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Jamba: A Hybrid Transformer-Mamba Language Model | Mar 28, 2024 | GPULanguage Modeling | CodeCode Available | 0 |
| cushLEPOR: customising hLEPOR metric using Optuna for higher agreement with human judgments or pre-trained language model LaBSE | Aug 21, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CamemBERT: a Tasty French Language Model | Nov 10, 2019 | Dependency ParsingLanguage Modeling | CodeCode Available | 0 |
| AKI-BERT: a Pre-trained Clinical Language Model for Early Prediction of Acute Kidney Injury | May 7, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Calibrating LLM-Based Evaluator | Sep 23, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| A Japanese Masked Language Model for Academic Domain | Oct 1, 2022 | ArticlesLanguage Modeling | CodeCode Available | 0 |
| Activations and Gradients Compression for Model-Parallel Training | Jan 15, 2024 | image-classificationImage Classification | CodeCode Available | 0 |
| Learning to Solve Voxel Building Embodied Tasks from Pixels and Natural Language Instructions | Nov 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| High-risk learning: acquiring new word vectors from tiny data | Jul 20, 2017 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Jasper: An End-to-End Convolutional Neural Acoustic Model | Apr 5, 2019 | DecoderLanguage Modeling | CodeCode Available | 0 |
| JavaBERT: Training a transformer-based model for the Java programming language | Oct 20, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Curriculum learning for language modeling | Aug 4, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Large Language Model Enhanced Machine Learning Estimators for Classification | May 8, 2024 | Binary ClassificationLanguage Modeling | CodeCode Available | 0 |
| Anchor Points: Benchmarking Models with Much Fewer Examples | Sep 14, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| A Conversation is Worth A Thousand Recommendations: A Survey of Holistic Conversational Recommender Systems | Sep 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Targeted Assessment of Incremental Processing in Neural LanguageModels and Humans | Jun 6, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Japanese Language Model and Three New Evaluation Benchmarks for Pharmaceutical NLP | May 22, 2025 | Continual PretrainingDiagnostic | CodeCode Available | 0 |
| HistBERT: A Pre-trained Language Model for Diachronic Lexical Semantic Analysis | Feb 8, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Learning to Verify Summary Facts with Fine-Grained LLM Feedback | Dec 14, 2024 | Fact VerificationLanguage Modeling | CodeCode Available | 0 |