| Token-Level Fitting Issues of Seq2seq Models | May 8, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MultiModal-GPT: A Vision and Language Model for Dialogue with Humans | May 8, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 3 |
| Toeplitz Neural Network for Sequence Modeling | May 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Scene Text Recognition with Image-Text Matching-guided Dictionary | May 8, 2023 | Image-text matchingLanguage Modeling | —Unverified | 0 |
| Empowering Language Model with Guided Knowledge Fusion for Biomedical Document Re-ranking | May 7, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Refining the Responses of LLMs by Themselves | May 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Pre-training Language Model as a Multi-perspective Course Learner | May 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Low-Resource Approach to the Grammatical Error Correction of Ukrainian | May 5, 2023 | Grammatical Error CorrectionLanguage Modeling | CodeCode Available | 0 |
| Simulating H.P. Lovecraft horror literature with the ChatGPT large language model | May 5, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| T-SciQ: Teaching Multimodal Chain-of-Thought Reasoning via Mixed Large Language Model Signals for Science Question Answering | May 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Mask The Bias: Improving Domain-Adaptive Generalization of CTC-based ASR with Internal Language Model Estimation | May 5, 2023 | DecoderDomain Adaptation | —Unverified | 0 |
| Now It Sounds Like You: Learning Personalized Vocabulary On Device | May 5, 2023 | Federated LearningLanguage Modeling | —Unverified | 0 |
| Retrieval Augmented Chest X-Ray Report Generation using OpenAI GPT models | May 5, 2023 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| MindGames: Targeting Theory of Mind in Large Language Models with Dynamic Epistemic Modal Logic | May 5, 2023 | Epistemic ReasoningLanguage Modeling | CodeCode Available | 1 |
| Towards antigenic peptide discovery with better MHC-I binding prediction and improved benchmark methodology | May 5, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Leveraging BERT Language Model for Arabic Long Document Classification | May 4, 2023 | ClassificationDocument Classification | —Unverified | 0 |
| Masked Structural Growth for 2x Faster Language Model Pre-training | May 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Gpt-4: A Review on Advancements and Opportunities in Natural Language Processing | May 4, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DN at SemEval-2023 Task 12: Low-Resource Language Text Classification via Multilingual Pretrained Language Model Fine-tuning | May 4, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Cognitive Reframing of Negative Thoughts through Human-Language Model Interaction | May 4, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| Personalized Abstractive Summarization by Tri-agent Generation Pipeline | May 4, 2023 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 0 |
| FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction | May 4, 2023 | Contrastive Learningdocument understanding | —Unverified | 0 |
| Interpretable Sentence Representation with Variational Autoencoders and Attention | May 4, 2023 | DisentanglementInductive Bias | —Unverified | 0 |
| On the Expressivity Role of LayerNorm in Transformers' Attention | May 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Sentence Embedding Leaks More Information than You Expect: Generative Embedding Inversion Attack to Recover the Whole Sentence | May 4, 2023 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Surveying Generative AI's Economic Expectations | May 4, 2023 | ArticlesLanguage Modeling | —Unverified | 0 |
| Using Language Models on Low-end Hardware | May 3, 2023 | ClassificationLanguage Modeling | —Unverified | 0 |
| Towards Imperceptible Document Manipulations against Neural Ranking Models | May 3, 2023 | Adversarial TextLanguage Modeling | —Unverified | 0 |
| Entity Tracking in Language Models | May 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Defending against Insertion-based Textual Backdoor Attacks via Attribution | May 3, 2023 | Backdoor AttackLanguage Modeling | CodeCode Available | 0 |
| ChatGraph: Interpretable Text Classification by Converting ChatGPT Knowledge to Graphs | May 3, 2023 | ClassificationDecision Making | CodeCode Available | 0 |
| WangLab at MEDIQA-Chat 2023: Clinical Note Generation from Doctor-Patient Conversations using Large Language Models | May 3, 2023 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| CodeGen2: Lessons for Training LLMs on Programming and Natural Languages | May 3, 2023 | Causal Language ModelingDecoder | CodeCode Available | 5 |
| Zero-Shot Listwise Document Reranking with a Large Language Model | May 3, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Benefits of Bad Advice: Autocontrastive Decoding across Model Layers | May 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| KEPLET: Knowledge-Enhanced Pretrained Language Model with Topic Entity Awareness | May 2, 2023 | Entity LinkingLanguage Modeling | —Unverified | 0 |
| FreeLM: Fine-Tuning-Free Language Model | May 2, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Huatuo-26M, a Large-scale Chinese Medical QA Dataset | May 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| How to Unleash the Power of Large Language Models for Few-shot Relation Extraction? | May 2, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| Self-Evaluation Guided Beam Search for Reasoning | May 1, 2023 | Arithmetic ReasoningGSM8K | —Unverified | 0 |
| An Iterative Algorithm for Rescaled Hyperbolic Functions Regression | May 1, 2023 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model | Apr 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Working Memory Capacity of ChatGPT: An Empirical Study | Apr 30, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| TALLRec: An Effective and Efficient Tuning Framework to Align Large Language Model with Recommendation | Apr 30, 2023 | Domain GeneralizationIn-Context Learning | CodeCode Available | 2 |
| A Review of ChatGPT Applications in Education, Marketing, Software Engineering, and Healthcare: Benefits, Drawbacks, and Research Directions | Apr 29, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| NLNDE at SemEval-2023 Task 12: Adaptive Pretraining and Source Language Selection for Low-Resource Multilingual Sentiment Analysis | Apr 28, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards autonomous system: flexible modular production system enhanced with large language model agents | Apr 28, 2023 | DescriptiveLanguage Modeling | CodeCode Available | 1 |
| Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation | Apr 28, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| ChatGPT Evaluation on Sentence Level Relations: A Focus on Temporal, Causal, and Discourse Relations | Apr 28, 2023 | Discourse ParsingIn-Context Learning | —Unverified | 0 |
| Explainable Verbal Reasoner Plus (EVR+): A Natural Language Reasoning Framework that Supports Diverse Compositional Reasoning | Apr 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |