| SpecInfer: Accelerating Generative Large Language Model Serving with Tree-based Speculative Inference and Verification | May 16, 2023 | DecoderLanguage Modeling | CodeCode Available | 3 |
| StructGPT: A General Framework for Large Language Model to Reason over Structured Data | May 16, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| MPI-rical: Data-Driven MPI Distributed Parallelism Assistance with Transformers | May 16, 2023 | Code CompletionCode Generation | CodeCode Available | 1 |
| Pre-Training to Learn in Context | May 16, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| Towards Unifying Multi-Lingual and Cross-Lingual Summarization | May 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Dual-Alignment Pre-training for Cross-lingual Sentence Embedding | May 16, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| NeuSTIP: A Novel Neuro-Symbolic Model for Link and Time Prediction in Temporal Knowledge Graphs | May 15, 2023 | Knowledge Graph CompletionKnowledge Graphs | —Unverified | 0 |
| Natural Language Decomposition and Interpretation of Complex Utterances | May 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model Guided Tree-of-Thought | May 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| A Language Model of Java Methods with Train/Test Deduplication | May 15, 2023 | DescriptiveLanguage Modeling | CodeCode Available | 0 |
| DarkBERT: A Language Model for the Dark Side of the Internet | May 15, 2023 | DiversityLanguage Modeling | —Unverified | 0 |
| Knowledge Rumination for Pre-trained Language Models | May 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Improving End-to-End SLU performance with Prosodic Attention and Distillation | May 14, 2023 | intent-classificationIntent Classification | CodeCode Available | 1 |
| Scalable Educational Question Generation with Pre-trained Language Models | May 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Pre-trained Language Model with Prompts for Temporal Knowledge Graph Completion | May 13, 2023 | Knowledge Graph CompletionKnowledge Graphs | CodeCode Available | 1 |
| MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers | May 12, 2023 | DecoderDensity Estimation | —Unverified | 0 |
| Using Language Models to Detect Alarming Student Responses | May 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning to Reason over Scene Graphs: A Case Study of Finetuning GPT-2 into a Robot Language Model for Grounded Task Planning | May 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Is ChatGPT Fair for Recommendation? Evaluating Fairness in Large Language Model Recommendation | May 12, 2023 | FairnessLanguage Modeling | CodeCode Available | 1 |
| LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development | May 12, 2023 | Knowledge ProbingLanguage Modeling | CodeCode Available | 1 |
| Prompt Learning to Mitigate Catastrophic Forgetting in Cross-lingual Transfer for Open-domain Dialogue Generation | May 12, 2023 | Cross-Lingual TransferDialogue Generation | CodeCode Available | 0 |
| Two-in-One: A Model Hijacking Attack Against Text Generation Models | May 12, 2023 | ClassificationFace Recognition | —Unverified | 0 |
| Self-Chained Image-Language Model for Video Localization and Question Answering | May 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Masked Audio Text Encoders are Effective Multi-Modal Rescorers | May 11, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Recommendation as Instruction Following: A Large Language Model Empowered Recommendation Approach | May 11, 2023 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Musketeer: Joint Training for Multi-task Vision Language Model with Task Explanation Prompts | May 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Detecting Idiomatic Multiword Expressions in Clinical Terminology using Definition-Based Representation Learning | May 11, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| How Good are Commercial Large Language Models on African Languages? | May 11, 2023 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| How to Index Item IDs for Recommendation Foundation Models | May 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Enriching language models with graph-based context information to better understand textual data | May 10, 2023 | ArticlesLanguage Modeling | CodeCode Available | 0 |
| Bot or Human? Detecting ChatGPT Imposters with A Single Question | May 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LACoS-BLOOM: Low-rank Adaptation with Contrastive objective on 8 bits Siamese-BLOOM | May 10, 2023 | GPULanguage Modeling | —Unverified | 0 |
| Adapter-TST: A Parameter Efficient Method for Multiple-Attribute Text Style Transfer | May 10, 2023 | AttributeLanguage Modeling | —Unverified | 0 |
| Automatic Evaluation of Attribution by Large Language Models | May 10, 2023 | Fact CheckingLanguage Modeling | CodeCode Available | 1 |
| Say What You Mean! Large Language Models Speak Too Positively about Negative Commonsense Knowledge | May 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Privacy-Preserving Prompt Tuning for Large Language Model Services | May 10, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards an Automatic Optimisation Model Generator Assisted with Generative Pre-trained Transformer | May 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PLM-GNN: A Webpage Classification Method based on Joint Pre-trained Language Model and Graph Neural Network | May 9, 2023 | Graph Neural NetworkLanguage Modeling | —Unverified | 0 |
| A Taxonomy of Foundation Model based Systems through the Lens of Software Architecture | May 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DeepTextMark: A Deep Learning-Driven Text Watermarking Approach for Identifying Large Language Model Generated Text | May 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Large Language Model Programs | May 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Estimating related words computationally using language model from the Mahabharata - an Indian epic | May 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Detection of depression on social networks using transformers and ensembles | May 9, 2023 | Depression DetectionLanguage Modeling | CodeCode Available | 0 |
| Effects of sub-word segmentation on performance of transformer language models | May 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ChatGPT: Vision and Challenges | May 8, 2023 | EthicsLanguage Modeling | —Unverified | 0 |
| Accessible Instruction-Following Agent | May 8, 2023 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| GersteinLab at MEDIQA-Chat 2023: Clinical Note Summarization from Doctor-Patient Conversations through Fine-tuning and In-context Learning | May 8, 2023 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Learning Summary-Worthy Visual Representation for Abstractive Summarization in Video | May 8, 2023 | Abstractive Text SummarizationLanguage Modeling | —Unverified | 0 |
| A Multi-Modal Context Reasoning Approach for Conditional Inference on Joint Textual and Visual Clues | May 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| PromptRank: Unsupervised Keyphrase Extraction Using Prompt | May 8, 2023 | DecoderKeyphrase Extraction | CodeCode Available | 1 |