| Towards a Robust Detection of Language Model Generated Text: Is ChatGPT that Easy to Detect? | Jun 9, 2023 | Adversarial TextLanguage Modeling | —Unverified | 0 |
| S^3: Increasing GPU Utilization during Generative Inference for Higher Throughput | Jun 9, 2023 | GPULanguage Modeling | —Unverified | 0 |
| FinGPT: Open-Source Financial Large Language Models | Jun 9, 2023 | Algorithmic TradingLanguage Modeling | CodeCode Available | 6 |
| Simple and Controllable Music Generation | Jun 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 |
| Speech-to-Text Adapter and Speech-to-Entity Retriever Augmented LLMs for Speech Understanding | Jun 8, 2023 | dialog state trackingLanguage Modeling | —Unverified | 0 |
| Hexatagging: Projective Dependency Parsing as Tagging | Jun 8, 2023 | Computational EfficiencyDependency Parsing | CodeCode Available | 1 |
| K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization | Jun 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Improving Language Model Integration for Neural Machine Translation | Jun 8, 2023 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding | Jun 8, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark for Finance | Jun 8, 2023 | Conversational Question AnsweringLanguage Modeling | CodeCode Available | 2 |
| Mapping Brains with Language Models: A Survey | Jun 8, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts | Jun 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Robot Task Planning Based on Large Language Model Representing Knowledge with Directed Graph Structures | Jun 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| RETA-LLM: A Retrieval-Augmented Large Language Model Toolkit | Jun 8, 2023 | Answer GenerationFact Checking | CodeCode Available | 2 |
| Soft-prompt Tuning for Large Language Models to Evaluate Bias | Jun 7, 2023 | FairnessLanguage Modeling | —Unverified | 0 |
| Privately generating tabular data using language models | Jun 7, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Absformer: Transformer-based Model for Unsupervised Multi-Document Abstractive Summarization | Jun 7, 2023 | Abstractive Text SummarizationDecoder | —Unverified | 0 |
| Can current NLI systems handle German word order? Investigating language model performance on a new German challenge set of minimal pairs | Jun 7, 2023 | Data AugmentationLanguage Modeling | CodeCode Available | 0 |
| Language Models Get a Gender Makeover: Mitigating Gender Bias with Few-Shot Data Interventions | Jun 7, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Dial-MAE: ConTextual Masked Auto-Encoder for Retrieval-based Dialogue Systems | Jun 7, 2023 | Conversational Response SelectionDecoder | CodeCode Available | 0 |
| Evaluation of ChatGPT on Biomedical Tasks: A Zero-Shot Comparison with Fine-Tuned Generative Transformers | Jun 7, 2023 | Document ClassificationLanguage Modeling | —Unverified | 0 |
| Knowledge-Augmented Language Model Prompting for Zero-Shot Knowledge Graph Question Answering | Jun 7, 2023 | Graph Question AnsweringLanguage Modeling | —Unverified | 0 |
| Benchmarking Foundation Models with Language-Model-as-an-Examiner | Jun 7, 2023 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Long-form analogies generated by chatGPT lack human-like psycholinguistic properties | Jun 7, 2023 | FormLanguage Modeling | —Unverified | 0 |
| Multi-Task Training with In-Domain Language Models for Diagnostic Reasoning | Jun 7, 2023 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Text-only Domain Adaptation using Unified Speech-Text Representation in Transducer | Jun 7, 2023 | Domain AdaptationLanguage Modeling | —Unverified | 0 |
| Transfer Learning from Pre-trained Language Models Improves End-to-End Speech Summarization | Jun 7, 2023 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Inference-Time Intervention: Eliciting Truthful Answers from a Language Model | Jun 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LLMZip: Lossless Text Compression using Large Language Models | Jun 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| On the Difference of BERT-style and CLIP-style Text Encoders | Jun 6, 2023 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images! | Jun 6, 2023 | counterfactualData Augmentation | CodeCode Available | 1 |
| TKDP: Threefold Knowledge-enriched Deep Prompt Tuning for Few-shot Named Entity Recognition | Jun 6, 2023 | few-shot-nerFew-shot NER | CodeCode Available | 0 |
| A generative framework for conversational laughter: Its 'language model' and laughter sound synthesis | Jun 6, 2023 | DiversityLanguage Modeling | —Unverified | 0 |
| Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics | Jun 6, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Leveraging Explicit Procedural Instructions for Data-Efficient Action Prediction | Jun 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Iterative Translation Refinement with Large Language Models | Jun 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias | Jun 6, 2023 | AttributeInductive Bias | —Unverified | 0 |
| Semantically-Prompted Language Models Improve Visual Descriptions | Jun 5, 2023 | ClassificationDescriptive | —Unverified | 0 |
| AutoScrum: Automating Project Planning Using Large Language Models | Jun 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Scalable and Adaptive System to Infer the Industry Sectors of Companies: Prompt + Model Tuning of Generative Language Models | Jun 5, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Information Flow Control in Machine Learning through Modular Model Architecture | Jun 5, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CoSiNES: Contrastive Siamese Network for Entity Standardization | Jun 5, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Benchmarking Middle-Trained Language Models for Neural Search | Jun 5, 2023 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| CTRL: Connect Collaborative and Language Model for CTR Prediction | Jun 5, 2023 | Click-Through Rate PredictionLanguage Modeling | —Unverified | 0 |
| Cross-Lingual Transfer Learning for Phrase Break Prediction with Multilingual Language Model | Jun 5, 2023 | Cross-Lingual TransferLanguage Modeling | —Unverified | 0 |
| COMET: Learning Cardinality Constrained Mixture of Experts with Trees and Local Search | Jun 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Improving Conversational Recommendation Systems via Counterfactual Data Simulation | Jun 5, 2023 | Conversational Recommendationcounterfactual | CodeCode Available | 1 |
| CELDA: Leveraging Black-box Language Model as Enhanced Classifier without Labels | Jun 5, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| On "Scientific Debt" in NLP: A Case for More Rigour in Language Model Pre-Training Research | Jun 5, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding | Jun 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |