Data Augmentations for Improved (Large) Language Model Generalization Oct 19, 2023 Attribute counterfactual
— Unverified 0Identifying and Adapting Transformer-Components Responsible for Gender Bias in an English Language Model Oct 19, 2023 Causal Discovery Language Modeling
Code Code Available 0Character-level Chinese Backpack Language Models Oct 19, 2023 Language Modeling Language Modelling
Code Code Available 1GestureGPT: Toward Zero-Shot Free-Form Hand Gesture Understanding with Large Language Model Agents Oct 19, 2023 Common Sense Reasoning Form
Code Code Available 0Exploring In-Context Learning of Textless Speech Language Model for Speech Classification Tasks Oct 19, 2023 Few-Shot Learning In-Context Learning
— Unverified 0CLAIR: Evaluating Image Captions with Large Language Models Oct 19, 2023 Diversity Image Captioning
— Unverified 0Efficient Long-Range Transformers: You Need to Attend More, but Not Necessarily at Every Layer Oct 19, 2023 8k Computational Efficiency
— Unverified 0A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems Oct 19, 2023 Language Modeling Language Modelling
— Unverified 0ICU: Conquering Language Barriers in Vision-and-Language Modeling by Dividing the Tasks into Image Captioning and Language Understanding Oct 19, 2023 Image Captioning Language Modeling
Code Code Available 0Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture Oct 18, 2023 4k image-classification
Code Code Available 2Position Interpolation Improves ALiBi Extrapolation Oct 18, 2023 Language Modelling Position
Code Code Available 2Solving the multiplication problem of a large language model system using a graph-based method Oct 18, 2023 Chatbot Language Modeling
— Unverified 0Solving Hard Analogy Questions with Relation Embedding Chains Oct 18, 2023 Knowledge Graphs Language Modeling
Code Code Available 0Preference Optimization for Molecular Language Models Oct 18, 2023 Language Modeling Language Modelling
Code Code Available 0Pseudointelligence: A Unifying Framework for Language Model Evaluation Oct 18, 2023 Language Model Evaluation Language Modeling
— Unverified 0Fast Multipole Attention: A Divide-and-Conquer Attention Mechanism for Long Sequences Oct 18, 2023 Language Modeling Language Modelling
Code Code Available 0Harnessing Dataset Cartography for Improved Compositional Generalization in Transformers Oct 18, 2023 Language Modeling Language Modelling
Code Code Available 0Document-Level Language Models for Machine Translation Oct 18, 2023 Language Modeling Language Modelling
— Unverified 0Zero-shot Faithfulness Evaluation for Text Summarization with Foundation Language Model Oct 18, 2023 Language Modeling Language Modelling
Code Code Available 1ChatGPT-guided Semantics for Zero-shot Learning Oct 18, 2023 Attribute Language Modelling
Code Code Available 0Generative error correction for code-switching speech recognition using large language models Oct 17, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Large Language Model Prediction Capabilities: Evidence from a Real-World Forecasting Tournament Oct 17, 2023 Language Modeling Language Modelling
— Unverified 0Multi-stage Large Language Model Correction for Speech Recognition Oct 17, 2023 Language Modeling Language Modelling
— Unverified 0Iterative Shallow Fusion of Backward Language Model for End-to-End Speech Recognition Oct 17, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging Oct 17, 2023 Language Modeling Language Modelling
Code Code Available 1Revealing the Unwritten: Visual Investigation of Beam Search Trees to Address Language Model Prompting Challenges Oct 17, 2023 Language Modeling Language Modelling
— Unverified 0Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formatting Oct 17, 2023 Language Modelling Sensitivity
Code Code Available 1Learn Your Tokens: Word-Pooled Tokenization for Language Modeling Oct 17, 2023 Language Modeling Language Modelling
Code Code Available 0Leveraging Large Language Model for Automatic Evolving of Industrial Data-Centric R&D Cycle Oct 17, 2023 Anomaly Detection Decision Making
— Unverified 0EvalCrafter: Benchmarking and Evaluating Large Video Generation Models Oct 17, 2023 Benchmarking Language Modelling
Code Code Available 1BitNet: Scaling 1-bit Transformers for Large Language Models Oct 17, 2023 Language Modeling Language Modelling
Code Code Available 2EXMODD: An EXplanatory Multimodal Open-Domain Dialogue dataset Oct 17, 2023 Language Modelling
Code Code Available 0Emulating Human Cognitive Processes for Expert-Level Medical Question-Answering with Large Language Models Oct 17, 2023 Decision Making Language Modeling
— Unverified 0Correction Focused Language Model Training for Speech Recognition Oct 17, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ChapGTP, ILLC's Attempt at Raising a BabyLM: Improving Data Efficiency by Automatic Task Formation Oct 17, 2023 Data Augmentation Language Modeling
— Unverified 0Watermarking LLMs with Weight Quantization Oct 17, 2023 Language Modeling Language Modelling
Code Code Available 1Utilising a Large Language Model to Annotate Subject Metadata: A Case Study in an Australian National Research Data Catalogue Oct 17, 2023 In-Context Learning Language Modeling
— Unverified 0ViSoBERT: A Pre-Trained Language Model for Vietnamese Social Media Text Processing Oct 17, 2023 Language Modeling Language Modelling
Code Code Available 0DavIR: Data Selection via Implicit Reward for Large Language Models Oct 16, 2023 Causal Language Modeling GSM8K
— Unverified 0SD-HuBERT: Sentence-Level Self-Distillation Induces Syllabic Organization in HuBERT Oct 16, 2023 Language Modeling Language Modelling
Code Code Available 1Swap and Predict -- Predicting the Semantic Changes in Words across Corpora by Context Swapping Oct 16, 2023 Change Detection Language Modelling
Code Code Available 0EconAgent: Large Language Model-Empowered Agents for Simulating Macroeconomic Activities Oct 16, 2023 Decision Making Language Modeling
Code Code Available 1Learning to Rank Context for Named Entity Recognition Using a Synthetic Dataset Oct 16, 2023 Language Modeling Language Modelling
Code Code Available 0MechGPT, a language-based strategy for mechanics and materials modeling that connects knowledge across scales, disciplines and modalities Oct 16, 2023 Knowledge Graphs Language Modelling
— Unverified 0RegaVAE: A Retrieval-Augmented Gaussian Mixture Variational Auto-Encoder for Language Modeling Oct 16, 2023 Hallucination Language Modeling
Code Code Available 1Navigation with Large Language Models: Semantic Guesswork as a Heuristic for Planning Oct 16, 2023 Language Modelling Navigate
— Unverified 0Llemma: An Open Language Model For Mathematics Oct 16, 2023 Arithmetic Reasoning Automated Theorem Proving
Code Code Available 3Untying the Reversal Curse via Bidirectional Language Model Editing Oct 16, 2023 knowledge editing Language Modeling
Code Code Available 1Use of probabilistic phrases in a coordination game: human versus GPT-4 Oct 16, 2023 Language Modeling Language Modelling
— Unverified 0Bootstrap Your Own Skills: Learning to Solve New Tasks with Large Language Model Guidance Oct 16, 2023 Language Modeling Language Modelling
— Unverified 0