| Radiology-GPT: A Large Language Model for Radiology | Jun 14, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Recipes for Sequential Pre-training of Multilingual Encoder and Seq2Seq Models | Jun 14, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| Revealing the structure of language model capabilities | Jun 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| NoCoLA: The Norwegian Corpus of Linguistic Acceptability | Jun 13, 2023 | Binary ClassificationDiagnostic | CodeCode Available | 0 |
| PauseSpeech: Natural Speech Synthesis via Pre-trained Language Model and Pause-based Prosody Modeling | Jun 13, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AVIS: Autonomous Visual Information Seeking with Large Language Model Agent | Jun 13, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Large-scale Language Model Rescoring on Long-form Data | Jun 13, 2023 | FormLanguage Modeling | —Unverified | 0 |
| InstructP2P: Learning to Edit 3D Point Clouds with Text Instructions | Jun 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EriBERTa: A Bilingual Pre-Trained Language Model for Clinical Natural Language Processing | Jun 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large language models and (non-)linguistic recursion | Jun 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Augmenting Language Models with Long-Term Memory | Jun 12, 2023 | FormIn-Context Learning | —Unverified | 0 |
| Weakly supervised information extraction from inscrutable handwritten document images | Jun 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RoBERTweet: A BERT Language Model for Romanian Tweets | Jun 11, 2023 | Language IdentificationLanguage Modeling | —Unverified | 0 |
| Language-Guided Traffic Simulation via Scene-Level Diffusion | Jun 10, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving Non-autoregressive Translation Quality with Pretrained Language Model, Embedding Distillation and Upsampling Strategy for CTC | Jun 10, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Models Can Learn Exceptions to Syntactic Rules | Jun 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| S^3: Increasing GPU Utilization during Generative Inference for Higher Throughput | Jun 9, 2023 | GPULanguage Modeling | —Unverified | 0 |
| Towards a Robust Detection of Language Model Generated Text: Is ChatGPT that Easy to Detect? | Jun 9, 2023 | Adversarial TextLanguage Modeling | —Unverified | 0 |
| Robot Task Planning Based on Large Language Model Representing Knowledge with Directed Graph Structures | Jun 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts | Jun 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Speech-to-Text Adapter and Speech-to-Entity Retriever Augmented LLMs for Speech Understanding | Jun 8, 2023 | dialog state trackingLanguage Modeling | —Unverified | 0 |
| Mapping Brains with Language Models: A Survey | Jun 8, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving Language Model Integration for Neural Machine Translation | Jun 8, 2023 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding | Jun 8, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Models Get a Gender Makeover: Mitigating Gender Bias with Few-Shot Data Interventions | Jun 7, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Long-form analogies generated by chatGPT lack human-like psycholinguistic properties | Jun 7, 2023 | FormLanguage Modeling | —Unverified | 0 |
| Can current NLI systems handle German word order? Investigating language model performance on a new German challenge set of minimal pairs | Jun 7, 2023 | Data AugmentationLanguage Modeling | CodeCode Available | 0 |
| Evaluation of ChatGPT on Biomedical Tasks: A Zero-Shot Comparison with Fine-Tuned Generative Transformers | Jun 7, 2023 | Document ClassificationLanguage Modeling | —Unverified | 0 |
| Knowledge-Augmented Language Model Prompting for Zero-Shot Knowledge Graph Question Answering | Jun 7, 2023 | Graph Question AnsweringLanguage Modeling | —Unverified | 0 |
| Benchmarking Foundation Models with Language-Model-as-an-Examiner | Jun 7, 2023 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Absformer: Transformer-based Model for Unsupervised Multi-Document Abstractive Summarization | Jun 7, 2023 | Abstractive Text SummarizationDecoder | —Unverified | 0 |
| Dial-MAE: ConTextual Masked Auto-Encoder for Retrieval-based Dialogue Systems | Jun 7, 2023 | Conversational Response SelectionDecoder | CodeCode Available | 0 |
| Transfer Learning from Pre-trained Language Models Improves End-to-End Speech Summarization | Jun 7, 2023 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Text-only Domain Adaptation using Unified Speech-Text Representation in Transducer | Jun 7, 2023 | Domain AdaptationLanguage Modeling | —Unverified | 0 |
| Multi-Task Training with In-Domain Language Models for Diagnostic Reasoning | Jun 7, 2023 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Soft-prompt Tuning for Large Language Models to Evaluate Bias | Jun 7, 2023 | FairnessLanguage Modeling | —Unverified | 0 |
| TKDP: Threefold Knowledge-enriched Deep Prompt Tuning for Few-shot Named Entity Recognition | Jun 6, 2023 | few-shot-nerFew-shot NER | CodeCode Available | 0 |
| Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias | Jun 6, 2023 | AttributeInductive Bias | —Unverified | 0 |
| Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics | Jun 6, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Leveraging Explicit Procedural Instructions for Data-Efficient Action Prediction | Jun 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A generative framework for conversational laughter: Its 'language model' and laughter sound synthesis | Jun 6, 2023 | DiversityLanguage Modeling | —Unverified | 0 |
| Iterative Translation Refinement with Large Language Models | Jun 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CELDA: Leveraging Black-box Language Model as Enhanced Classifier without Labels | Jun 5, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evaluation of AI Chatbots for Patient-Specific EHR Questions | Jun 5, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Information Flow Control in Machine Learning through Modular Model Architecture | Jun 5, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Benchmarking Middle-Trained Language Models for Neural Search | Jun 5, 2023 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| CTRL: Connect Collaborative and Language Model for CTR Prediction | Jun 5, 2023 | Click-Through Rate PredictionLanguage Modeling | —Unverified | 0 |
| Cross-Lingual Transfer Learning for Phrase Break Prediction with Multilingual Language Model | Jun 5, 2023 | Cross-Lingual TransferLanguage Modeling | —Unverified | 0 |
| A Scalable and Adaptive System to Infer the Industry Sectors of Companies: Prompt + Model Tuning of Generative Language Models | Jun 5, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CoSiNES: Contrastive Siamese Network for Entity Standardization | Jun 5, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |