| Energy and Carbon Considerations of Fine-Tuning BERT | Nov 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Testing Language Model Agents Safely in the Wild | Nov 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PEFT-MedAware: Large Language Model for Medical Awareness | Nov 17, 2023 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| Chemist-X: Large Language Model-empowered Agent for Reaction Condition Recommendation in Chemical Synthesis | Nov 16, 2023 | AI AgentContrastive Learning | —Unverified | 0 |
| Can Language Model Moderators Improve the Health of Online Discourse? | Nov 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Ups and Downs of Large Language Model Inference with Vocabulary Trimming by Language Heuristics | Nov 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Effective Large Language Model Adaptation for Improved Grounding and Citation Generation | Nov 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Video-LLaVA: Learning United Visual Representation by Alignment Before Projection | Nov 16, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Multi-Step Dialogue Workflow Action Prediction | Nov 16, 2023 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Improving the Generation Quality of Watermarked Large Language Models via Word Importance Scoring | Nov 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Crafting In-context Examples according to LMs' Parametric Knowledge | Nov 16, 2023 | HallucinationIn-Context Learning | CodeCode Available | 0 |
| HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs | Nov 16, 2023 | Domain AdaptationLanguage Modeling | CodeCode Available | 2 |
| On Retrieval Augmentation and the Limitations of Language Model Training | Nov 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLMs as Narcissistic Evaluators: When Ego Inflates Evaluation Scores | Nov 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Speed Odyssey for Deployable Quantization of LLMs | Nov 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Characterizing Tradeoffs in Language Model Decoding with Informational Interpretations | Nov 16, 2023 | DecoderDiversity | —Unverified | 0 |
| Leveraging LLMs in Scholarly Knowledge Graph Question Answering | Nov 16, 2023 | Graph Question AnsweringLanguage Modeling | CodeCode Available | 0 |
| Comparing Generalization in Learning with Limited Numbers of Exemplars: Transformer vs. RNN in Attractor Dynamics | Nov 15, 2023 | Dynamic Time WarpingLanguage Modeling | —Unverified | 0 |
| User Persona Identification and New Service Adaptation Recommendation | Nov 15, 2023 | Collaborative FilteringLanguage Modeling | —Unverified | 0 |
| VideoCon: Robust Video-Language Alignment via Contrast Captions | Nov 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Contrastive Chain-of-Thought Prompting | Nov 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Does Pre-trained Language Model Actually Infer Unseen Links in Knowledge Graph Completion? | Nov 15, 2023 | Knowledge Graph CompletionKnowledge Graphs | —Unverified | 0 |
| An Eye on Clinical BERT: Investigating Language Model Generalization for Diabetic Eye Disease Phenotyping | Nov 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Toucan: Token-Aware Character Level Language Modeling | Nov 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages | Nov 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Data Similarity is Not Enough to Explain Language Model Performance | Nov 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SiRA: Sparse Mixture of Low Rank Adaptation | Nov 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Accelerating Toeplitz Neural Network with Constant-time Inference Complexity | Nov 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Autonomous Large Language Model Agents Enabling Intent-Driven Mobile GUI Testing | Nov 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving Deep Learning Optimization through Constrained Parameter Regularization | Nov 15, 2023 | Deep LearningImage Classification | CodeCode Available | 0 |
| HeLM: Highlighted Evidence augmented Language Model for Enhanced Table-to-Text Generation | Nov 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Grounding Gaps in Language Model Generations | Nov 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GENEVA: GENErating and Visualizing branching narratives using LLMs | Nov 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| German FinBERT: A German Pre-trained Language Model | Nov 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Aligning Neural Machine Translation Models: Human Feedback in Training and Inference | Nov 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MAP's not dead yet: Uncovering true language model modes by conditioning away degeneracy | Nov 15, 2023 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Multistage Collaborative Knowledge Distillation from a Large Language Model for Semi-Supervised Sequence Generation | Nov 15, 2023 | Constituency ParsingKnowledge Distillation | CodeCode Available | 0 |
| Pearl: Personalizing Large Language Model Writing Assistants with Generation-Calibrated Retrievers | Nov 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CLIMB: Curriculum Learning for Infant-inspired Model Building | Nov 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Violet: A Vision-Language Model for Arabic Image Captioning with Gemini Decoder | Nov 15, 2023 | DecoderImage Captioning | —Unverified | 0 |
| Large Language Model-Driven Classroom Flipping: Empowering Student-Centric Peer Questioning with Flipped Interaction | Nov 14, 2023 | ChatbotLanguage Modeling | —Unverified | 0 |
| Summarization-Based Document IDs for Generative Retrieval with Language Models | Nov 14, 2023 | ArticlesLanguage Modeling | CodeCode Available | 0 |
| Text Retrieval with Multi-Stage Re-Ranking Models | Nov 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration | Nov 14, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| Semi-Structured Chain-of-Thought: Integrating Multiple Sources of Knowledge for Improved Language Model Reasoning | Nov 14, 2023 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |
| Automated title and abstract screening for scoping reviews using the GPT-4 Large Language Model | Nov 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding | Nov 14, 2023 | Image-based Generative Performance BenchmarkingLanguage Modeling | CodeCode Available | 2 |
| Memory-efficient Stochastic methods for Memory-based Transformers | Nov 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Zero-shot audio captioning with audio-language model guidance and audio context keywords | Nov 14, 2023 | Audio captioningDescriptive | CodeCode Available | 1 |
| Anti-LM Decoding for Zero-shot In-context Machine Translation | Nov 14, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |