Temperature-scaling surprisal estimates improve fit to human reading times -- but does it do so for the "right reasons"? Nov 15, 2023 Language Modelling Large Language Model
Code Code Available 0How much complexity does an RNN architecture need to learn syntax-sensitive dependencies? May 17, 2020 Language Modeling Language Modelling
Code Code Available 0Generation with Dynamic Vocabulary Oct 11, 2024 Language Modeling Language Modelling
Code Code Available 0Every Answer Matters: Evaluating Commonsense with Probabilistic Measures Jun 6, 2024 Common Sense Reasoning Language Modeling
Code Code Available 0Controlling the Amount of Verbatim Copying in Abstractive Summarization Nov 23, 2019 Abstractive Text Summarization Language Modeling
Code Code Available 0Evidence-backed Fact Checking using RAG and Few-Shot In-Context Learning with LLMs Aug 22, 2024 Fact Checking In-Context Learning
Code Code Available 0AlgebraNets Jun 12, 2020 Computational Efficiency image-classification
Code Code Available 0ANGOFA: Leveraging OFA Embedding Initialization and Synthetic Data for Angolan Language Model Apr 3, 2024 Language Modeling Language Modelling
Code Code Available 0Evidence Is All You Need: Ordering Imaging Studies via Language Model Alignment with the ACR Appropriateness Criteria Sep 27, 2024 All Diagnostic
Code Code Available 0Generative adversarial networks vs large language models: a comparative study on synthetic tabular data generation Feb 20, 2025 Generative Adversarial Network Language Modeling
Code Code Available 0Controlling Large Language Model with Latent Actions Mar 27, 2025 CoLA Language Modeling
Code Code Available 0Controlled Text Generation for Black-box Language Models via Score-based Progressive Editor Nov 13, 2023 Language Modeling Language Modelling
Code Code Available 0Decoupled Sequence and Structure Generation for Realistic Antibody Design Feb 8, 2024 Language Modelling Protein Language Model
Code Code Available 0How Personality Traits Influence Negotiation Outcomes? A Simulation based on Large Language Models Jul 16, 2024 Decision Making Language Modeling
Code Code Available 0Bidirectional Transformer Reranker for Grammatical Error Correction May 22, 2023 Decoder Grammatical Error Correction
Code Code Available 0Controllable Neural Story Plot Generation via Reward Shaping Sep 27, 2018 Language Modeling Language Modelling
Code Code Available 0How Phonotactics Affect Multilingual and Zero-shot ASR Performance Oct 22, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Controllable Citation Sentence Generation with Language Models Nov 14, 2022 Attribute Language Modeling
Code Code Available 0How Predictable Are Large Language Model Capabilities? A Case Study on BIG-bench May 24, 2023 Diversity Language Modeling
Code Code Available 0Improving Generalization Performance by Switching from Adam to SGD Dec 20, 2017 Language Modeling Language Modelling
Code Code Available 0How Robust Are Router-LLMs? Analysis of the Fragility of LLM Routing Capabilities Mar 20, 2025 General Knowledge Language Modeling
Code Code Available 0Evolutionary Stochastic Gradient Descent for Optimization of Deep Neural Networks Oct 16, 2018 Evolutionary Algorithms Language Modeling
Code Code Available 0Evolutionary Verbalizer Search for Prompt-based Few Shot Text Classification Jun 18, 2023 Few-Shot Text Classification Language Modeling
Code Code Available 0Evolution of ESG-focused DLT Research: An NLP Analysis of the Literature Aug 23, 2023 Language Modeling Language Modelling
Code Code Available 0Improving Grammatical Error Correction with Machine Translation Pairs Nov 7, 2019 Grammatical Error Correction Language Modeling
Code Code Available 0A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement Oct 17, 2024 Language Modeling Language Modelling
Code Code Available 0A deep language model for software code Aug 9, 2016 Deep Learning Language Modeling
Code Code Available 0Evolving Assembly Code in an Adversarial Environment Mar 28, 2024 Language Modelling Large Language Model
Code Code Available 0A Deep Generative Model for Fragment-Based Molecule Generation Feb 28, 2020 Drug Design Language Modeling
Code Code Available 0Improving In-Context Learning with Small Language Model Ensembles Oct 29, 2024 Domain Labelling In-Context Learning
Code Code Available 0Contrastive learning of T cell receptor representations Jun 10, 2024 Contrastive Learning Language Modeling
Code Code Available 0AlcLaM: Arabic Dialectal Language Model Jul 18, 2024 Language Modeling Language Modelling
Code Code Available 0Generative Prompt Internalization Nov 24, 2024 Language Modeling Language Modelling
Code Code Available 0Evolving Subnetwork Training for Large Language Models Jun 11, 2024 Language Modeling Language Modelling
Code Code Available 0Improving Information Extraction on Business Documents with Specific Pre-Training Tasks Sep 11, 2023 Language Modeling Language Modelling
Code Code Available 0Is Training Data Quality or Quantity More Impactful to Small Language Model Performance? Nov 24, 2024 Language Modeling Language Modelling
Code Code Available 0Bidirectional Attention as a Mixture of Continuous Word Experts Jul 8, 2023 Language Modelling Mixture-of-Experts
Code Code Available 0How to Determine the Most Powerful Pre-trained Language Model without Brute Force Fine-tuning? An Empirical Survey Dec 8, 2023 Language Modeling Language Modelling
Code Code Available 0BiasKG: Adversarial Knowledge Graphs to Induce Bias in Large Language Models May 8, 2024 Knowledge Graphs Language Modeling
Code Code Available 0How to Determine the Preferred Image Distribution of a Black-Box Vision-Language Model? Sep 3, 2024 In-Context Learning Language Modeling
Code Code Available 0How To Evaluate Your Dialogue System: Probe Tasks as an Alternative for Token-level Evaluation Metrics Aug 24, 2020 Language Modeling Language Modelling
Code Code Available 0Examining Language Modeling Assumptions Using an Annotated Literary Dialect Corpus Oct 3, 2024 Language Modeling Language Modelling
Code Code Available 0An Eye on Clinical BERT: Investigating Language Model Generalization for Diabetic Eye Disease Phenotyping Nov 15, 2023 Language Modeling Language Modelling
Code Code Available 0An Exploratory Study on Automatic Identification of Assumptions in the Development of Deep Learning Frameworks Jan 8, 2024 Language Modelling Large Language Model
Code Code Available 0How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective Oct 14, 2024 Density Ratio Estimation GSM8K
Code Code Available 0How to Leverage Personal Textual Knowledge for Personalized Conversational Information Retrieval Jul 23, 2024 Information Retrieval Language Modeling
Code Code Available 0Contrastive Language Prompting to Ease False Positives in Medical Anomaly Detection Nov 12, 2024 Anomaly Detection Language Modeling
Code Code Available 0exBERT: A Visual Analysis Tool to Explore Learned Representations in Transformers Models Oct 11, 2019 Language Modeling Language Modelling
Code Code Available 0Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding May 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Continuous Speech Tokenizer in Text To Speech Oct 22, 2024 Language Modeling Language Modelling
Code Code Available 0