| Paying More Attention to Source Context: Mitigating Unfaithful Translations from Large Language Model | Jun 11, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 |
| PAYADOR: A Minimalist Approach to Grounding Language Models on Structured Data for Interactive Storytelling and Role-playing Games | Apr 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Turning Logic Against Itself : Probing Model Defenses Through Contrastive Questions | Jan 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| The Factuality Tax of Diversity-Intervened Text-to-Image Generation: Benchmark and Fact-Augmented Intervention | Jun 29, 2024 | DiversityImage Generation | CodeCode Available | 0 |
| Patterns versus Characters in Subword-aware Neural Language Modeling | Sep 2, 2017 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Recurrent Additive Networks | May 21, 2017 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Neurocache: Efficient Vector Retrieval for Long-range Language Modeling | Jul 2, 2024 | Few-Shot LearningLanguage Modeling | CodeCode Available | 0 |
| Variational Autoencoders for Collaborative Filtering | Feb 16, 2018 | Bayesian InferenceCollaborative Filtering | CodeCode Available | 0 |
| Recoding latent sentence representations -- Dynamic gradient-based activation modification in RNNs | Jan 3, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Patient-Level Anatomy Meets Scanning-Level Physics: Personalized Federated Low-Dose CT Denoising Empowered by Large Language Model | Mar 2, 2025 | AnatomyDenoising | CodeCode Available | 0 |
| Multi-Grained Patch Training for Efficient LLM-based Recommendation | Jan 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Partially Shuffling the Training Data to Improve Language Models | Mar 11, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| TwinBooster: Synergising Large Language Models with Barlow Twins and Gradient Boosting for Enhanced Molecular Property Prediction | Jan 9, 2024 | Drug DiscoveryLanguage Modeling | CodeCode Available | 0 |
| YellowFin and the Art of Momentum Tuning | Jun 12, 2017 | Constituency ParsingLanguage Modeling | CodeCode Available | 0 |
| Neural Text Generation from Structured Data with Application to the Biography Domain | Mar 24, 2016 | Concept-To-Text GenerationLanguage Modeling | CodeCode Available | 0 |
| MolXPT: Wrapping Molecules with Text for Generative Pre-training | May 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ReCAM@IITK at SemEval-2021 Task 4: BERT and ALBERT based Ensemble for Abstract Word Prediction | Apr 4, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Reasoning Large Language Model Errors Arise from Hallucinating Critical Problem Features | May 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Reasoning-Grounded Natural Language Explanations for Language Models | Mar 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning | Feb 27, 2024 | 8kLanguage Modeling | CodeCode Available | 0 |
| Multimodal data matters: language model pre-training over structured and unstructured electronic health records | Jan 25, 2022 | Decision MakingLanguage Modeling | CodeCode Available | 0 |
| Neural spell-checker: Beyond words with synthetic data generation | Oct 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Reanalyzing L2 Preposition Learning with Bayesian Mixed Effects and a Pretrained Language Model | Feb 16, 2023 | DiversityLanguage Modeling | CodeCode Available | 0 |
| The Hidden Space of Transformer Language Adapters | Feb 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Women Are Beautiful, Men Are Leaders: Gender Stereotypes in Machine Translation and Language Modeling | Nov 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Comparison of Language Modeling and Translation as Multilingual Pretraining Objectives | Jul 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| word2vec Explained: deriving Mikolov et al.'s negative-sampling word-embedding method | Feb 15, 2014 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Neural Sign Language Translation | Jun 1, 2018 | Gesture RecognitionLanguage Modeling | CodeCode Available | 0 |
| Vaxformer: Antigenicity-controlled Transformer for Vaccine Design Against SARS-CoV-2 | May 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Parsing as Language Modeling | Nov 1, 2016 | Constituency ParsingDependency Parsing | CodeCode Available | 0 |
| The Impact of Element Ordering on LM Agent Performance | Sep 18, 2024 | Dimensionality ReductionLanguage Modeling | CodeCode Available | 0 |
| TypedThinker: Typed Thinking Improves Large Language Model Reasoning | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Paraphrase and Solve: Exploring and Exploiting the Impact of Surface Form on Mathematical Reasoning in Large Language Models | Apr 17, 2024 | FormLanguage Model Evaluation | CodeCode Available | 0 |
| Parameter-Efficient Language Model Tuning with Active Learning in Low-Resource Settings | May 23, 2023 | Active LearningLanguage Modeling | CodeCode Available | 0 |
| The impact of responding to patient messages with large language model assistance | Oct 26, 2023 | ChatbotDecision Making | CodeCode Available | 0 |
| The implementation of a Deep Recurrent Neural Network Language Model on a Xilinx FPGA | Oct 26, 2017 | CPULanguage Modeling | CodeCode Available | 0 |
| The Importance of Being Recurrent for Modeling Hierarchical Structure | Mar 9, 2018 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Parameter Efficient Fine Tuning Llama 3.1 for Answering Arabic Legal Questions: A Case Study on Jordanian Laws | Jun 2, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Make Some Noise: Unlocking Language Model Parallel Inference Capability through Noisy Training | Jun 25, 2024 | DenoisingLanguage Modeling | CodeCode Available | 0 |
| Neural Shuffle-Exchange Networks - Sequence Processing in O(n log n) Time | Dec 1, 2019 | LAMBADALanguage Modeling | CodeCode Available | 0 |
| UAlberta at SemEval-2023 Task 1: Context Augmentation and Translation for Multilingual Visual Word Sense Disambiguation | Jun 24, 2023 | Image GenerationImage Segmentation | CodeCode Available | 0 |
| Panoramic Interests: Stylistic-Content Aware Personalized Headline Generation | Jan 21, 2025 | Contrastive LearningHeadline Generation | CodeCode Available | 0 |
| We're Calling an Intervention: Exploring Fundamental Hurdles in Adapting Language Models to Nonstandard Text | Apr 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| The Influence of Context on Sentence Acceptability Judgements | Jul 1, 2018 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Z-LaVI: Zero-Shot Language Solver Fueled by Visual Imagination | Oct 21, 2022 | Image GenerationLanguage Modeling | CodeCode Available | 0 |
| RealHarm: A Collection of Real-World Language Model Application Failures | Apr 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| UBERT: A Novel Language Model for Synonymy Prediction at Scale in the UMLS Metathesaurus | Apr 27, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Neural Shuffle-Exchange Networks -- Sequence Processing in O(n log n) Time | Jul 18, 2019 | LAMBADALanguage Modeling | CodeCode Available | 0 |
| Neural Scaling Laws Rooted in the Data Distribution | Dec 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MAMUT: A Novel Framework for Modifying Mathematical Formulas for the Generation of Specialized Datasets for Language Model Training | Feb 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |