Searching for Needles in a Haystack: On the Role of Incidental Bilingualism in PaLM's Translation Capability May 17, 2023 Language Modeling Language Modelling
— Unverified 0Large-Scale Text Analysis Using Generative Language Models: A Case Study in Discovering Public Value Expressions in AI Patents May 17, 2023 Language Modelling text-classification
— Unverified 0Token-wise Decomposition of Autoregressive Language Model Hidden States for Analyzing Model Predictions May 17, 2023 Language Modeling Language Modelling
Code Code Available 0Controllable Speaking Styles Using a Large Language Model May 17, 2023 Language Modeling Language Modelling
— Unverified 0Improving Language Model Negotiation with Self-Play and In-Context Learning from AI Feedback May 17, 2023 In-Context Learning Language Modeling
Code Code Available 2DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning May 17, 2023 Clustering Language Modeling
Code Code Available 1AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression May 17, 2023 Knowledge Distillation Language Modeling
Code Code Available 1A Better Way to Do Masked Language Model Scoring May 17, 2023 Language Modeling Language Modelling
Code Code Available 1A Survey on Zero Pronoun Translation May 17, 2023 Language Modelling Large Language Model
— Unverified 0CageViT: Convolutional Activation Guided Efficient Vision Transformer May 17, 2023 Computational Efficiency image-classification
— Unverified 0PaLM 2 Technical Report May 17, 2023 Code Generation Common Sense Reasoning
— Unverified 0PMC-VQA: Visual Instruction Tuning for Medical Visual Question Answering May 17, 2023 Benchmarking Diagnostic
Code Code Available 1SLiC-HF: Sequence Likelihood Calibration with Human Feedback May 17, 2023 Language Modeling Language Modelling
— Unverified 0Generation of 3D Molecules in Pockets via Language Model May 17, 2023 3D Molecule Generation Drug Design
— Unverified 0SatLM: Satisfiability-Aided Language Models Using Declarative Prompting May 16, 2023 Arithmetic Reasoning Language Modeling
Code Code Available 1Application-Agnostic Language Modeling for On-Device ASR May 16, 2023 Automatic Speech Recognition Language Modeling
— Unverified 0SpecInfer: Accelerating Generative Large Language Model Serving with Tree-based Speculative Inference and Verification May 16, 2023 Decoder Language Modeling
Code Code Available 3MPI-rical: Data-Driven MPI Distributed Parallelism Assistance with Transformers May 16, 2023 Code Completion Code Generation
Code Code Available 1StructGPT: A General Framework for Large Language Model to Reason over Structured Data May 16, 2023 Language Modeling Language Modelling
Code Code Available 2Pre-Training to Learn in Context May 16, 2023 In-Context Learning Language Modeling
Code Code Available 1Towards Unifying Multi-Lingual and Cross-Lingual Summarization May 16, 2023 Language Modeling Language Modelling
— Unverified 0CWTM: Leveraging Contextualized Word Embeddings from BERT for Neural Topic Modeling May 16, 2023 Document Classification Language Modelling
Code Code Available 0Dual-Alignment Pre-training for Cross-lingual Sentence Embedding May 16, 2023 Language Modeling Language Modelling
Code Code Available 1NeuSTIP: A Novel Neuro-Symbolic Model for Link and Time Prediction in Temporal Knowledge Graphs May 15, 2023 Knowledge Graph Completion Knowledge Graphs
— Unverified 0Large Language Model Guided Tree-of-Thought May 15, 2023 Language Modeling Language Modelling
Code Code Available 2NLG Evaluation Metrics Beyond Correlation Analysis: An Empirical Metric Preference Checklist May 15, 2023 Controllable Language Modelling Dialogue Generation
Code Code Available 3Natural Language Decomposition and Interpretation of Complex Utterances May 15, 2023 Language Modeling Language Modelling
— Unverified 0Estimating the Causal Effects of Natural Logic Features in Neural NLI Models May 15, 2023 Language Modelling
— Unverified 0A Language Model of Java Methods with Train/Test Deduplication May 15, 2023 Descriptive Language Modeling
Code Code Available 0DarkBERT: A Language Model for the Dark Side of the Internet May 15, 2023 Diversity Language Modeling
— Unverified 0Exploring In-Context Learning Capabilities of Foundation Models for Generating Knowledge Graphs from Text May 15, 2023 graph construction In-Context Learning
— Unverified 0Knowledge Rumination for Pre-trained Language Models May 15, 2023 Language Modeling Language Modelling
Code Code Available 1Unsupervised Sentence Representation Learning with Frequency-induced Adversarial Tuning and Incomplete Sentence Filtering May 15, 2023 Language Modelling Representation Learning
Code Code Available 0Watermarking Text Generated by Black-Box Language Models May 14, 2023 Adversarial Robustness Language Modelling
Code Code Available 1Improving End-to-End SLU performance with Prosodic Attention and Distillation May 14, 2023 intent-classification Intent Classification
Code Code Available 1Mobile-Env: Building Qualified Evaluation Benchmarks for LLM-GUI Interaction May 14, 2023 Language Modelling
Code Code Available 1Scalable Educational Question Generation with Pre-trained Language Models May 13, 2023 Language Modeling Language Modelling
Code Code Available 0Pre-trained Language Model with Prompts for Temporal Knowledge Graph Completion May 13, 2023 Knowledge Graph Completion Knowledge Graphs
Code Code Available 1The Machine Psychology of Cooperation: Can GPT models operationalise prompts for altruism, cooperation, competitiveness and selfishness in economic games? May 13, 2023 Experimental Design Language Modelling
Code Code Available 1Reconstruct Before Summarize: An Efficient Two-Step Framework for Condensing and Summarizing Meeting Transcripts May 13, 2023 Language Modelling Meeting Summarization
— Unverified 0MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers May 12, 2023 Decoder Density Estimation
— Unverified 0Learning to Reason over Scene Graphs: A Case Study of Finetuning GPT-2 into a Robot Language Model for Grounded Task Planning May 12, 2023 Language Modeling Language Modelling
— Unverified 0Using Language Models to Detect Alarming Student Responses May 12, 2023 Language Modeling Language Modelling
— Unverified 0Two-in-One: A Model Hijacking Attack Against Text Generation Models May 12, 2023 Classification Face Recognition
— Unverified 0Prompt Learning to Mitigate Catastrophic Forgetting in Cross-lingual Transfer for Open-domain Dialogue Generation May 12, 2023 Cross-Lingual Transfer Dialogue Generation
Code Code Available 0Is ChatGPT Fair for Recommendation? Evaluating Fairness in Large Language Model Recommendation May 12, 2023 Fairness Language Modeling
Code Code Available 1LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development May 12, 2023 Knowledge Probing Language Modeling
Code Code Available 1Text2Cohort: Facilitating Intuitive Access to Biomedical Data with Natural Language Cohort Discovery May 12, 2023 Language Modelling Large Language Model
Code Code Available 0ArtGPT-4: Towards Artistic-understanding Large Vision-Language Models with Enhanced Adapter May 12, 2023 Image Comprehension Language Modelling
Code Code Available 1Self-Chained Image-Language Model for Video Localization and Question Answering May 11, 2023 Language Modeling Language Modelling
Code Code Available 1