| Leveraging Open Information Extraction for More Robust Domain Transfer of Event Trigger Detection | May 23, 2023 | Event DetectionLanguage Modeling | CodeCode Available | 0 |
| Acquiring Frame Element Knowledge with Deep Metric Learning for Semantic Frame Induction | May 23, 2023 | ClusteringLanguage Modeling | —Unverified | 0 |
| Error Detection for Text-to-SQL Semantic Parsing | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Latent Positional Information is in the Self-Attention Variance of Transformer Language Models Without Positional Embeddings | May 23, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models | May 23, 2023 | AllFairness | —Unverified | 0 |
| AxomiyaBERTa: A Phonologically-aware Transformer Model for Assamese | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| R2H: Building Multimodal Navigation Helpers that Respond to Help Requests | May 23, 2023 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| WikiChat: Stopping the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on Wikipedia | May 23, 2023 | ChatbotHallucination | CodeCode Available | 3 |
| VisorGPT: Learning Visual Prior via Generative Pre-Training | May 23, 2023 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| Query Rewriting for Retrieval-Augmented Large Language Models | May 23, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Knowledge Alignment Problem: Bridging Human and External Knowledge for Large Language Models | May 23, 2023 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| Prompt-Based Monte-Carlo Tree Search for Goal-Oriented Dialogue Policy Planning | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Towards A Unified View of Sparse Feed-Forward Network in Pretraining Large Language Model | May 23, 2023 | AvgLanguage Modeling | —Unverified | 0 |
| When the Music Stops: Tip-of-the-Tongue Retrieval for Music | May 23, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| Preserving Knowledge Invariance: Rethinking Robustness Evaluation of Open Information Extraction | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Memory-Efficient Fine-Tuning of Compressed Large Language Models via sub-4-bit Integer Quantization | May 23, 2023 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Robust Prompt Optimization for Large Language Models Against Distribution Shifts | May 23, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Automatic Model Selection with Large Language Models for Reasoning | May 23, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 1 |
| Hierarchical Prompting Assists Large Language Model on Web Navigation | May 23, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 3 |
| Improving Factuality and Reasoning in Language Models through Multiagent Debate | May 23, 2023 | Few-Shot LearningLanguage Modeling | CodeCode Available | 2 |
| Dr.ICL: Demonstration-Retrieved In-context Learning | May 23, 2023 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Learning Easily Updated General Purpose Text Representations with Adaptable Task-Specific Prefixes | May 22, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Study of Generative Large Language Model for Medical Research and Healthcare | May 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Can LLMs facilitate interpretation of pre-trained language models? | May 22, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| How Language Model Hallucinations Can Snowball | May 22, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| ConQueR: Contextualized Query Reduction using Search Logs | May 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints | May 22, 2023 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Farewell to Aimless Large-scale Pretraining: Influential Subset Selection for Language Model | May 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Frustratingly Simple Decoding Method for Neural Text Generation | May 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Bidirectional Transformer Reranker for Grammatical Error Correction | May 22, 2023 | DecoderGrammatical Error Correction | CodeCode Available | 0 |
| Extrapolating Multilingual Understanding Models as Multilingual Generators | May 22, 2023 | DenoisingLanguage Modeling | —Unverified | 0 |
| Distilling ChatGPT for Explainable Automated Student Answer Assessment | May 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Evaluating Pragmatic Abilities of Image Captioners on A3DS | May 22, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PrOnto: Language Model Evaluations for 859 Languages | May 22, 2023 | Language Model EvaluationLanguage Modeling | CodeCode Available | 0 |
| Towards Unsupervised Recognition of Token-level Semantic Differences in Related Documents | May 22, 2023 | Contrastive LearningLanguage Modeling | CodeCode Available | 0 |
| LMGQS: A Large-scale Dataset for Query-focused Summarization | May 22, 2023 | DiversityLanguage Modeling | —Unverified | 0 |
| The Influence of ChatGPT on Artificial Intelligence Related Crypto Assets: Evidence from a Synthetic Control Analysis | May 22, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Text-based Person Search without Parallel Image-Text Data | May 22, 2023 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Making Language Models Better Tool Learners with Execution Feedback | May 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SPARSEFIT: Few-shot Prompting with Sparse Fine-tuning for Jointly Generating Predictions and Natural Language Explanations | May 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MvP: Multi-view Prompting Improves Aspect Sentiment Tuple Prediction | May 22, 2023 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 1 |
| Observations on LLMs for Telecom Domain: Capabilities and Limitations | May 22, 2023 | Domain AdaptationLanguage Modeling | —Unverified | 0 |
| Training Diffusion Models with Reinforcement Learning | May 22, 2023 | Decision MakingDenoising | CodeCode Available | 2 |
| Word Embeddings Are Steers for Language Models | May 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GPT-SW3: An Autoregressive Language Model for the Nordic Languages | May 22, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Federated Learning of Medical Concepts Embedding using BEHRT | May 22, 2023 | Federated LearningLanguage Modeling | CodeCode Available | 0 |
| Enhance Reasoning Ability of Visual-Language Models via Large Language Models | May 22, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Infor-Coef: Information Bottleneck-based Dynamic Token Downsampling for Compact and Efficient language model | May 21, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Pilot Study on Dialogue-Level Dependency Parsing for Chinese | May 21, 2023 | Dependency ParsingLanguage Modeling | —Unverified | 0 |