| Rethinking Memory and Communication Cost for Efficient Large Language Model Training | Oct 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Transformers and Large Language Models for Chemistry and Drug Discovery | Oct 9, 2023 | Drug DiscoveryLanguage Modeling | —Unverified | 0 |
| Scaling Studies for Efficient Parameter Search and Parallelism for Large Language Model Pre-training | Oct 9, 2023 | DecoderGPU | —Unverified | 0 |
| Factual and Personalized Recommendations using Language Models and Reinforcement Learning | Oct 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Importance of Prompt Tuning for Automated Neuron Explanations | Oct 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Estimating Numbers without Regression | Oct 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation | Oct 9, 2023 | Action RecognitionImage Generation | CodeCode Available | 4 |
| Terminology-Aware Translation with Constrained Decoding and Large Language Model Prompting | Oct 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Transformer Fusion with Optimal Transport | Oct 9, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| Towards Verifiable Generation: A Benchmark for Knowledge-aware Language Model Attribution | Oct 9, 2023 | AttributeLanguage Modeling | CodeCode Available | 0 |
| A Meta-Learning Perspective on Transformers for Causal Language Modeling | Oct 9, 2023 | Causal Language ModelingLanguage Modeling | —Unverified | 0 |
| Guiding Language Model Reasoning with Planning Tokens | Oct 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GraphLLM: Boosting Graph Reasoning Ability of Large Language Model | Oct 9, 2023 | Graph LearningLanguage Modeling | CodeCode Available | 1 |
| Transcending the Attention Paradigm: Representation Learning from Geospatial Social Media Data | Oct 9, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| CCAE: A Corpus of Chinese-based Asian Englishes | Oct 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| NEFTune: Noisy Embeddings Improve Instruction Finetuning | Oct 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 |
| Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback | Oct 8, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Breaking Down Word Semantics from Pre-trained Language Models through Layer-wise Dimension Selection | Oct 8, 2023 | Binary ClassificationLanguage Modeling | —Unverified | 0 |
| Synslator: An Interactive Machine Translation Tool with Online Learning | Oct 8, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Optimizing Large Language Models to Expedite the Development of Smart Contracts | Oct 8, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| InstructDET: Diversifying Referring Object Detection with Generalized Instructions | Oct 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MindfulDiary: Harnessing Large Language Model to Support Psychiatric Patients' Journaling | Oct 8, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Generative Spoken Language Model based on continuous word-sized audio tokens | Oct 8, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model (LLM) as a System of Multiple Expert Agents: An Approach to solve the Abstraction and Reasoning Corpus (ARC) Challenge | Oct 8, 2023 | ARCLanguage Modeling | CodeCode Available | 1 |
| UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model | Oct 8, 2023 | DecoderLanguage Modeling | CodeCode Available | 1 |