| Rethinking Masked Language Modeling for Chinese Spelling Correction | May 28, 2023 | DiversityDomain Generalization | CodeCode Available | 1 |
| Query-Efficient Black-Box Red Teaming via Bayesian Optimization | May 27, 2023 | Bayesian OptimizationLanguage Modeling | CodeCode Available | 1 |
| Matrix Information Theory for Self-Supervised Learning | May 27, 2023 | Contrastive LearningGSM8K | CodeCode Available | 1 |
| Improving Generalization in Language Model-Based Text-to-SQL Semantic Parsing: Two Simple Semantic Boundary-Based Techniques | May 27, 2023 | Domain GeneralizationLanguage Modeling | CodeCode Available | 1 |
| Backpack Language Models | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Language Models Implement Simple Word2Vec-style Vector Arithmetic | May 25, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| ChatBridge: Bridging Modalities with Large Language Model as a Language Catalyst | May 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| PIVOINE: Instruction Tuning for Open-world Information Extraction | May 24, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation | May 24, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| Meta-Learning Online Adaptation of Language Models | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| An Efficient Multilingual Language Model Compression through Vocabulary Trimming | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Text-Augmented Open Knowledge Graph Completion via Pre-Trained Language Models | May 24, 2023 | Knowledge Graph CompletionLanguage Modeling | CodeCode Available | 1 |
| Prompt-Based Monte-Carlo Tree Search for Goal-Oriented Dialogue Policy Planning | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ConGraT: Self-Supervised Contrastive Pretraining for Joint Graph and Text Embeddings | May 23, 2023 | Community DetectionContrastive Learning | CodeCode Available | 1 |
| VisorGPT: Learning Visual Prior via Generative Pre-Training | May 23, 2023 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model | May 23, 2023 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Automatic Model Selection with Large Language Models for Reasoning | May 23, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 1 |
| Aligning Large Language Models through Synthetic Feedback | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Preserving Knowledge Invariance: Rethinking Robustness Evaluation of Open Information Extraction | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Frustratingly Simple Decoding Method for Neural Text Generation | May 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Word Embeddings Are Steers for Language Models | May 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Making Language Models Better Tool Learners with Execution Feedback | May 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints | May 22, 2023 | DecoderLanguage Modeling | CodeCode Available | 1 |
| MvP: Multi-view Prompting Improves Aspect Sentiment Tuple Prediction | May 22, 2023 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 1 |
| How Language Model Hallucinations Can Snowball | May 22, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| A Study of Generative Large Language Model for Medical Research and Healthcare | May 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| PiVe: Prompting with Iterative Verification Improving Graph-based Generative Capability of LLMs | May 21, 2023 | Data AugmentationGraph Generation | CodeCode Available | 1 |
| ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain | May 20, 2023 | De-identificationLanguage Modeling | CodeCode Available | 1 |
| DisCo: Distilled Student Models Co-training for Semi-supervised Text Mining | May 20, 2023 | Extractive SummarizationKnowledge Distillation | CodeCode Available | 1 |
| Decouple knowledge from parameters for plug-and-play language modeling | May 19, 2023 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering | May 19, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Clinical Camel: An Open Expert-Level Medical Language Model with Dialogue-Based Knowledge Encoding | May 19, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model | May 19, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression | May 17, 2023 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning | May 17, 2023 | ClusteringLanguage Modeling | CodeCode Available | 1 |
| CoEdIT: Text Editing by Task-Specific Instruction Tuning | May 17, 2023 | Formality Style TransferGrammatical Error Correction | CodeCode Available | 1 |
| A Better Way to Do Masked Language Model Scoring | May 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SatLM: Satisfiability-Aided Language Models Using Declarative Prompting | May 16, 2023 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 1 |
| Pre-Training to Learn in Context | May 16, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| Dual-Alignment Pre-training for Cross-lingual Sentence Embedding | May 16, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MPI-rical: Data-Driven MPI Distributed Parallelism Assistance with Transformers | May 16, 2023 | Code CompletionCode Generation | CodeCode Available | 1 |
| Knowledge Rumination for Pre-trained Language Models | May 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Improving End-to-End SLU performance with Prosodic Attention and Distillation | May 14, 2023 | intent-classificationIntent Classification | CodeCode Available | 1 |
| Pre-trained Language Model with Prompts for Temporal Knowledge Graph Completion | May 13, 2023 | Knowledge Graph CompletionKnowledge Graphs | CodeCode Available | 1 |
| LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development | May 12, 2023 | Knowledge ProbingLanguage Modeling | CodeCode Available | 1 |
| Is ChatGPT Fair for Recommendation? Evaluating Fairness in Large Language Model Recommendation | May 12, 2023 | FairnessLanguage Modeling | CodeCode Available | 1 |
| Self-Chained Image-Language Model for Video Localization and Question Answering | May 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Bot or Human? Detecting ChatGPT Imposters with A Single Question | May 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |