QUERT: Continual Pre-training of Language Model for Query Understanding in Travel Domain Search Jun 11, 2023 Domain Adaptation Language Modeling
Code Code Available 1RoBERTweet: A BERT Language Model for Romanian Tweets Jun 11, 2023 Language Identification Language Modeling
— Unverified 0Language-Guided Traffic Simulation via Scene-Level Diffusion Jun 10, 2023 Language Modeling Language Modelling
— Unverified 0Universal Language Modelling agent Jun 10, 2023 Language Modelling Word Translation
— Unverified 0Improving Non-autoregressive Translation Quality with Pretrained Language Model, Embedding Distillation and Upsampling Strategy for CTC Jun 10, 2023 Language Modeling Language Modelling
— Unverified 0Large Language Models Are Semi-Parametric Reinforcement Learning Agents Jun 9, 2023 Language Modeling Language Modelling
Code Code Available 1Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena Jun 9, 2023 Chatbot Language Modelling
Code Code Available 7PoET: A generative model of protein families as sequences-of-sequences Jun 9, 2023 Language Modelling Multiple Sequence Alignment
Code Code Available 114 Examples of How LLMs Can Transform Materials Science and Chemistry: A Reflection on a Large Language Model Hackathon Jun 9, 2023 Language Modeling Language Modelling
Code Code Available 1AVScan2Vec: Feature Learning on Antivirus Scan Data for Production-Scale Malware Corpora Jun 9, 2023 Language Modelling
— Unverified 0Aladdin: Zero-Shot Hallucination of Stylized 3D Assets from Abstract Scene Descriptions Jun 9, 2023 Hallucination Language Modeling
Code Code Available 1Language Models Can Learn Exceptions to Syntactic Rules Jun 9, 2023 Language Modeling Language Modelling
Code Code Available 0Towards a Robust Detection of Language Model Generated Text: Is ChatGPT that Easy to Detect? Jun 9, 2023 Adversarial Text Language Modeling
— Unverified 0Word sense extension Jun 9, 2023 Language Modelling Word Sense Disambiguation
Code Code Available 0FinGPT: Open-Source Financial Large Language Models Jun 9, 2023 Algorithmic Trading Language Modeling
Code Code Available 6How Can Recommender Systems Benefit from Large Language Models: A Survey Jun 9, 2023 Ethics Feature Engineering
Code Code Available 3S^3: Increasing GPU Utilization during Generative Inference for Higher Throughput Jun 9, 2023 GPU Language Modeling
— Unverified 0Simple and Controllable Music Generation Jun 8, 2023 Language Modeling Language Modelling
Code Code Available 6Speech-to-Text Adapter and Speech-to-Entity Retriever Augmented LLMs for Speech Understanding Jun 8, 2023 dialog state tracking Language Modeling
— Unverified 0Hexatagging: Projective Dependency Parsing as Tagging Jun 8, 2023 Computational Efficiency Dependency Parsing
Code Code Available 1Improving Language Model Integration for Neural Machine Translation Jun 8, 2023 Automatic Speech Recognition Language Modeling
— Unverified 0InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding Jun 8, 2023 Language Modeling Language Modelling
— Unverified 0In-Context Learning through the Bayesian Prism Jun 8, 2023 Bayesian Inference In-Context Learning
Code Code Available 0Advancing Italian Biomedical Information Extraction with Transformers-based Models: Methodological Insights and Multicenter Practical Application Jun 8, 2023 Language Modelling Large Language Model
— Unverified 0Multi-Modal Classifiers for Open-Vocabulary Object Detection Jun 8, 2023 Language Modelling Large Language Model
Code Code Available 1Mapping Brains with Language Models: A Survey Jun 8, 2023 Language Modeling Language Modelling
— Unverified 0PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark for Finance Jun 8, 2023 Conversational Question Answering Language Modeling
Code Code Available 2PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization Jun 8, 2023 Language Modelling Large Language Model
Code Code Available 2K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization Jun 8, 2023 Language Modeling Language Modelling
Code Code Available 2RETA-LLM: A Retrieval-Augmented Large Language Model Toolkit Jun 8, 2023 Answer Generation Fact Checking
Code Code Available 2Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts Jun 8, 2023 Language Modeling Language Modelling
Code Code Available 0Robot Task Planning Based on Large Language Model Representing Knowledge with Directed Graph Structures Jun 8, 2023 Language Modeling Language Modelling
Code Code Available 0T3L: Translate-and-Test Transfer Learning for Cross-Lingual Text Classification Jun 8, 2023 Classification Cross-Lingual Transfer
Code Code Available 0Soft-prompt Tuning for Large Language Models to Evaluate Bias Jun 7, 2023 Fairness Language Modeling
— Unverified 0Privately generating tabular data using language models Jun 7, 2023 Language Modeling Language Modelling
Code Code Available 1Absformer: Transformer-based Model for Unsupervised Multi-Document Abstractive Summarization Jun 7, 2023 Abstractive Text Summarization Decoder
— Unverified 0Dial-MAE: ConTextual Masked Auto-Encoder for Retrieval-based Dialogue Systems Jun 7, 2023 Conversational Response Selection Decoder
Code Code Available 0Can current NLI systems handle German word order? Investigating language model performance on a new German challenge set of minimal pairs Jun 7, 2023 Data Augmentation Language Modeling
Code Code Available 0Evaluation of ChatGPT on Biomedical Tasks: A Zero-Shot Comparison with Fine-Tuned Generative Transformers Jun 7, 2023 Document Classification Language Modeling
— Unverified 0Benchmarking Foundation Models with Language-Model-as-an-Examiner Jun 7, 2023 Benchmarking Language Modeling
— Unverified 0Transfer Learning from Pre-trained Language Models Improves End-to-End Speech Summarization Jun 7, 2023 Automatic Speech Recognition Decoder
— Unverified 0Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks Jun 7, 2023 Cross-Modal Retrieval Language Modelling
Code Code Available 2Knowledge-Augmented Language Model Prompting for Zero-Shot Knowledge Graph Question Answering Jun 7, 2023 Graph Question Answering Language Modeling
— Unverified 0Multi-Task Training with In-Domain Language Models for Diagnostic Reasoning Jun 7, 2023 Diagnostic Language Modeling
— Unverified 0Language Models Get a Gender Makeover: Mitigating Gender Bias with Few-Shot Data Interventions Jun 7, 2023 Language Modeling Language Modelling
— Unverified 0ModuleFormer: Modularity Emerges from Mixture-of-Experts Jun 7, 2023 Language Modelling Lightweight Deployment
Code Code Available 2Long-form analogies generated by chatGPT lack human-like psycholinguistic properties Jun 7, 2023 Form Language Modeling
— Unverified 0Text-only Domain Adaptation using Unified Speech-Text Representation in Transducer Jun 7, 2023 Domain Adaptation Language Modeling
— Unverified 0Inference-Time Intervention: Eliciting Truthful Answers from a Language Model Jun 6, 2023 Language Modeling Language Modelling
Code Code Available 2LLMZip: Lossless Text Compression using Large Language Models Jun 6, 2023 Language Modeling Language Modelling
Code Code Available 1