| Large Language Models Are Latent Variable Models: Explaining and Finding Good Demonstrations for In-Context Learning | Jan 27, 2023 | Few-Shot LearningGSM8K | CodeCode Available | 1 |
| Case-Based Reasoning with Language Models for Classification of Logical Fallacies | Jan 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Context Matters: A Strategy to Pre-train Language Model for Science Education | Jan 27, 2023 | ArticlesLanguage Modeling | —Unverified | 0 |
| Call for Papers -- The BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus | Jan 27, 2023 | Language AcquisitionLanguage Modeling | CodeCode Available | 1 |
| ThoughtSource: A central hub for large language model reasoning data | Jan 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Probing Out-of-Distribution Robustness of Language Models with Parameter-Efficient Transfer Learning | Jan 27, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Semi-Parametric Video-Grounded Text Generation | Jan 27, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Prompt-Based Editing for Text Style Transfer | Jan 27, 2023 | ClassificationLanguage Modeling | CodeCode Available | 1 |
| SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient | Jan 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GPU-based Private Information Retrieval for On-Device Machine Learning Inference | Jan 26, 2023 | CPUGPU | CodeCode Available | 1 |
| Parameter-Efficient Low-Resource Dialogue State Tracking by Prompt Tuning | Jan 26, 2023 | Dialogue ManagementDialogue State Tracking | —Unverified | 0 |
| Domain-Agnostic Molecular Generation with Chemical Feedback | Jan 26, 2023 | Drug DesignLanguage Modeling | CodeCode Available | 1 |
| Explaining Large Language Model-Based Neural Semantic Parsers (Student Abstract) | Jan 25, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ExaRanker: Explanation-Augmented Neural Ranker | Jan 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FewShotTextGCN: K-hop neighborhood regularization for few-shot learning on graphs | Jan 25, 2023 | Document ClassificationFew-Shot Learning | —Unverified | 0 |
| Language Model Detoxification in Dialogue with Contextualized Stance Control | Jan 25, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Editing Language Model-based Knowledge Graph Embeddings | Jan 25, 2023 | EDIT Taskknowledge editing | CodeCode Available | 2 |
| XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language Models | Jan 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ViDeBERTa: A powerful pre-trained language model for Vietnamese | Jan 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Semi-Automated Construction of Food Composition Knowledge Base | Jan 24, 2023 | Active LearningLanguage Modeling | CodeCode Available | 0 |
| Large language models can segment narrative events similarly to humans | Jan 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Watermark for Large Language Models | Jan 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 |
| Lexi: Self-Supervised Learning of the UI Language | Jan 23, 2023 | Image RetrievalLanguage Modeling | CodeCode Available | 1 |
| Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning | Jan 23, 2023 | Cross-Lingual TransferLanguage Modeling | CodeCode Available | 0 |
| DiffSDS: A language diffusion model for protein backbone inpainting under geometric conditions and constraints | Jan 22, 2023 | DecoderDenoising | CodeCode Available | 1 |
| An Empirical Study of Metrics to Measure Representational Harms in Pre-Trained Language Models | Jan 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Debiasing the Cloze Task in Sequential Recommendation with Bidirectional Transformers | Jan 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Unifying Structure Reasoning and Language Model Pre-training for Complex Reasoning | Jan 21, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| REDAffectiveLM: Leveraging Affect Enriched Embedding and Transformer-based Neural Language Model for Readers' Emotion Detection | Jan 21, 2023 | 4kLanguage Modeling | CodeCode Available | 0 |
| Adapting a Language Model While Preserving its General Knowledge | Jan 21, 2023 | Continual LearningGeneral Knowledge | CodeCode Available | 2 |
| Exploring Methods for Building Dialects-Mandarin Code-Mixing Corpora: A Case Study in Taiwanese Hokkien | Jan 21, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Batch Prompting: Efficient Inference with Large Language Model APIs | Jan 19, 2023 | Arithmetic ReasoningIn-Context Learning | CodeCode Available | 1 |
| CLIPTER: Looking at the Bigger Picture in Scene Text Recognition | Jan 18, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Class Enhancement Losses with Pseudo Labels for Zero-shot Semantic Segmentation | Jan 18, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Syllable Subword Tokens for Open Vocabulary Speech Recognition in Malayalam | Jan 17, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Prompting Large Language Model for Machine Translation: A Case Study | Jan 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Ankh: Optimized Protein Language Model Unlocks General-Purpose Modelling | Jan 16, 2023 | DiversityLanguage Modeling | CodeCode Available | 0 |
| A Case Study in Engineering a Conversational Programming Assistant's Persona | Jan 13, 2023 | ChatbotLanguage Modeling | —Unverified | 0 |
| CLIP the Gap: A Single Domain Generalization Approach for Object Detection | Jan 13, 2023 | Domain Generalizationimage-classification | CodeCode Available | 1 |
| In BLOOM: Creativity and Affinity in Artificial Lyrics and Art | Jan 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Cohesive Distillation Architecture for Neural Language Models | Jan 12, 2023 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| KAER: A Knowledge Augmented Pre-Trained Language Model for Entity Resolution | Jan 12, 2023 | Entity ResolutionGeneral Knowledge | —Unverified | 0 |
| NarrowBERT: Accelerating Masked Language Model Pretraining and Inference | Jan 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Topics in Contextualised Attention Embeddings | Jan 11, 2023 | ClusteringLanguage Modeling | —Unverified | 0 |
| Memory Augmented Large Language Models are Computationally Universal | Jan 10, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Chatbots in a Honeypot World | Jan 10, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Dynamic Grained Encoder for Vision Transformers | Jan 10, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| Transferring Pre-trained Multimodal Representations with Cross-modal Similarity Matching | Jan 7, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Generative Antibody Design for Complementary Chain Pairing Sequences through Encoder-Decoder Language Model | Jan 6, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| You Truly Understand What I Need: Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona | Jan 6, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 |