| Empirical Sufficiency Lower Bounds for Language Modeling with Locally-Bootstrapped Semantic Structures | May 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| AdapterEM: Pre-trained Language Model Adaptation for Generalized Entity Matching using Adapter-tuning | May 30, 2023 | Binary ClassificationData Integration | CodeCode Available | 0 |
| LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding | May 30, 2023 | document-image-classificationDocument Image Classification | —Unverified | 0 |
| Preserving Pre-trained Features Helps Calibrate Fine-tuned Language Models | May 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Direct Preference Optimization: Your Language Model is Secretly a Reward Model | May 29, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 |
| Adapting Learned Sparse Retrieval for Long Documents | May 29, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Short Answer Grading Using One-shot Prompting and Text Similarity Scoring Model | May 29, 2023 | Domain AdaptationLanguage Modeling | —Unverified | 0 |
| Test-Time Training on Nearest Neighbors for Large Language Models | May 29, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Information Association for Language Model Updating by Mitigating LM-Logical Discrepancy | May 29, 2023 | Answer GenerationArticles | —Unverified | 0 |
| PaLI-X: On Scaling up a Multilingual Vision and Language Model | May 29, 2023 | Chart Question Answeringdocument understanding | CodeCode Available | 1 |
| Writing user personas with Large Language Models: Testing phase 6 of a Thematic Analysis of semi-structured interviews | May 29, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Do Language Models Know When They're Hallucinating References? | May 29, 2023 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections | May 29, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Quantitative Review on Language Model Efficiency Research | May 28, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Feature-Learning Networks Are Consistent Across Widths At Realistic Scales | May 28, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Semantic Segmentation with Bidirectional Language Models Improves Long-form ASR | May 28, 2023 | DecoderForm | —Unverified | 0 |
| Rethinking Masked Language Modeling for Chinese Spelling Correction | May 28, 2023 | DiversityDomain Generalization | CodeCode Available | 1 |
| KoSBi: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model Application | May 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Augmenting Large Language Model Translators via Translation Memories | May 27, 2023 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training | May 27, 2023 | intent-classificationIntent Classification | —Unverified | 0 |
| Matrix Information Theory for Self-Supervised Learning | May 27, 2023 | Contrastive LearningGSM8K | CodeCode Available | 1 |
| Improving Generalization in Language Model-Based Text-to-SQL Semantic Parsing: Two Simple Semantic Boundary-Based Techniques | May 27, 2023 | Domain GeneralizationLanguage Modeling | CodeCode Available | 1 |
| Query-Efficient Black-Box Red Teaming via Bayesian Optimization | May 27, 2023 | Bayesian OptimizationLanguage Modeling | CodeCode Available | 1 |
| Language Models Can Improve Event Prediction by Few-Shot Abductive Reasoning | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Distinguishing Human Generated Text From ChatGPT Generated Text Using Machine Learning | May 26, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large language models improve Alzheimer's disease diagnosis using multi-modality data | May 26, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SQL-PaLM: Improved Large Language Model Adaptation for Text-to-SQL (extended) | May 26, 2023 | Data AugmentationIn-Context Learning | —Unverified | 0 |
| DataChat: Prototyping a Conversational Agent for Dataset Search and Visualization | May 26, 2023 | ChatbotLanguage Modeling | CodeCode Available | 0 |
| CONA: A novel CONtext-Aware instruction paradigm for communication using large language model | May 26, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| External Language Model Integration for Factorized Neural Transducers | May 26, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Honey, I Shrunk the Language: Language Model Behavior at Reduced Scale | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| From Dogwhistles to Bullhorns: Unveiling Coded Rhetoric with Language Models | May 26, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Tokenization Impacts Multilingual Language Modeling: Assessing Vocabulary Allocation and Overlap Across Languages | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Zero-shot Visual Question Answering with Language Model Feedback | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Slide, Constrain, Parse, Repeat: Synchronous SlidingWindows for Document AMR Parsing | May 26, 2023 | Abstract Meaning RepresentationAMR Parsing | —Unverified | 0 |
| An Empirical Comparison of LM-based Question and Answer Generation Methods | May 26, 2023 | Answer GenerationData Augmentation | —Unverified | 0 |
| An Investigation of Noise in Morphological Inflection | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Leveraging Domain Knowledge for Inclusive and Bias-aware Humanitarian Response Entry Classification | May 26, 2023 | counterfactualData Augmentation | CodeCode Available | 0 |
| Backpack Language Models | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Emergent Agentic Transformer from Chain of Hindsight Experience | May 26, 2023 | D4RLImitation Learning | —Unverified | 0 |
| Green Runner: A tool for efficient model selection from model repositories | May 26, 2023 | Deep LearningImage Captioning | —Unverified | 0 |
| Improving accuracy of GPT-3/4 results on biomedical data using a retrieval-augmented language model | May 26, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Models Implement Simple Word2Vec-style Vector Arithmetic | May 25, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| BookGPT: A General Framework for Book Recommendation Empowered by Large Language Model | May 25, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ChatBridge: Bridging Modalities with Large Language Model as a Language Catalyst | May 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| RewriteLM: An Instruction-Tuned Large Language Model for Text Rewriting | May 25, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation | May 25, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| Masked and Permuted Implicit Context Learning for Scene Text Recognition | May 25, 2023 | DecoderLanguage Modeling | CodeCode Available | 0 |
| ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation | May 24, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| Lexinvariant Language Models | May 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |