| Bias A-head? Analyzing Bias in Transformer-Based Language Model Attention Heads | Nov 17, 2023 | DecoderFairness | —Unverified | 0 |
| Causal Graph in Language Model Rediscovers Cortical Hierarchy in Human Narrative Processing | Nov 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Energy and Carbon Considerations of Fine-Tuning BERT | Nov 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Testing Language Model Agents Safely in the Wild | Nov 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PEFT-MedAware: Large Language Model for Medical Awareness | Nov 17, 2023 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| On Retrieval Augmentation and the Limitations of Language Model Training | Nov 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multi-Step Dialogue Workflow Action Prediction | Nov 16, 2023 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Chemist-X: Large Language Model-empowered Agent for Reaction Condition Recommendation in Chemical Synthesis | Nov 16, 2023 | AI AgentContrastive Learning | —Unverified | 0 |
| The Ups and Downs of Large Language Model Inference with Vocabulary Trimming by Language Heuristics | Nov 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving the Generation Quality of Watermarked Large Language Models via Word Importance Scoring | Nov 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLMs as Narcissistic Evaluators: When Ego Inflates Evaluation Scores | Nov 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Speed Odyssey for Deployable Quantization of LLMs | Nov 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Characterizing Tradeoffs in Language Model Decoding with Informational Interpretations | Nov 16, 2023 | DecoderDiversity | —Unverified | 0 |
| Leveraging LLMs in Scholarly Knowledge Graph Question Answering | Nov 16, 2023 | Graph Question AnsweringLanguage Modeling | CodeCode Available | 0 |
| Can Language Model Moderators Improve the Health of Online Discourse? | Nov 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Effective Large Language Model Adaptation for Improved Grounding and Citation Generation | Nov 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Crafting In-context Examples according to LMs' Parametric Knowledge | Nov 16, 2023 | HallucinationIn-Context Learning | CodeCode Available | 0 |
| Does Pre-trained Language Model Actually Infer Unseen Links in Knowledge Graph Completion? | Nov 15, 2023 | Knowledge Graph CompletionKnowledge Graphs | —Unverified | 0 |
| An Eye on Clinical BERT: Investigating Language Model Generalization for Diabetic Eye Disease Phenotyping | Nov 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CLIMB: Curriculum Learning for Infant-inspired Model Building | Nov 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Autonomous Large Language Model Agents Enabling Intent-Driven Mobile GUI Testing | Nov 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Comparing Generalization in Learning with Limited Numbers of Exemplars: Transformer vs. RNN in Attractor Dynamics | Nov 15, 2023 | Dynamic Time WarpingLanguage Modeling | —Unverified | 0 |
| German FinBERT: A German Pre-trained Language Model | Nov 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Data Similarity is Not Enough to Explain Language Model Performance | Nov 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| GENEVA: GENErating and Visualizing branching narratives using LLMs | Nov 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HeLM: Highlighted Evidence augmented Language Model for Enhanced Table-to-Text Generation | Nov 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Grounding Gaps in Language Model Generations | Nov 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Aligning Neural Machine Translation Models: Human Feedback in Training and Inference | Nov 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| User Persona Identification and New Service Adaptation Recommendation | Nov 15, 2023 | Collaborative FilteringLanguage Modeling | —Unverified | 0 |
| Improving Deep Learning Optimization through Constrained Parameter Regularization | Nov 15, 2023 | Deep LearningImage Classification | CodeCode Available | 0 |
| Toucan: Token-Aware Character Level Language Modeling | Nov 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SiRA: Sparse Mixture of Low Rank Adaptation | Nov 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multistage Collaborative Knowledge Distillation from a Large Language Model for Semi-Supervised Sequence Generation | Nov 15, 2023 | Constituency ParsingKnowledge Distillation | CodeCode Available | 0 |
| Violet: A Vision-Language Model for Arabic Image Captioning with Gemini Decoder | Nov 15, 2023 | DecoderImage Captioning | —Unverified | 0 |
| When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages | Nov 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MAP's not dead yet: Uncovering true language model modes by conditioning away degeneracy | Nov 15, 2023 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Pearl: Personalizing Large Language Model Writing Assistants with Generation-Calibrated Retrievers | Nov 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Memory-efficient Stochastic methods for Memory-based Transformers | Nov 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Semi-Structured Chain-of-Thought: Integrating Multiple Sources of Knowledge for Improved Language Model Reasoning | Nov 14, 2023 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |
| Text Retrieval with Multi-Stage Re-Ranking Models | Nov 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Anti-LM Decoding for Zero-shot In-context Machine Translation | Nov 14, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| Automated title and abstract screening for scoping reviews using the GPT-4 Large Language Model | Nov 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Summarization-Based Document IDs for Generative Retrieval with Language Models | Nov 14, 2023 | ArticlesLanguage Modeling | CodeCode Available | 0 |
| Large Language Model-Driven Classroom Flipping: Empowering Student-Centric Peer Questioning with Flipped Interaction | Nov 14, 2023 | ChatbotLanguage Modeling | —Unverified | 0 |
| Language Model-In-The-Loop: Data Optimal Approach to Learn-To-Recommend Actions in Text Games | Nov 13, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning | Nov 13, 2023 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Controlled Text Generation for Black-box Language Models via Score-based Progressive Editor | Nov 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| AuthentiGPT: Detecting Machine-Generated Text via Black-Box Language Models Denoising | Nov 13, 2023 | DenoisingLanguage Modeling | —Unverified | 0 |
| Activity Sparsity Complements Weight Sparsity for Efficient RNN Inference | Nov 13, 2023 | Deep LearningLanguage Modeling | —Unverified | 0 |
| Gen-Z: Generative Zero-Shot Text Classification with Contextualized Label Descriptions | Nov 13, 2023 | ClassificationLanguage Modeling | —Unverified | 0 |