| Transcending the Attention Paradigm: Representation Learning from Geospatial Social Media Data | Oct 9, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| Scaling Studies for Efficient Parameter Search and Parallelism for Large Language Model Pre-training | Oct 9, 2023 | DecoderGPU | —Unverified | 0 |
| CCAE: A Corpus of Chinese-based Asian Englishes | Oct 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Meta-Learning Perspective on Transformers for Causal Language Modeling | Oct 9, 2023 | Causal Language ModelingLanguage Modeling | —Unverified | 0 |
| Guiding Language Model Reasoning with Planning Tokens | Oct 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Factual and Personalized Recommendations using Language Models and Reinforcement Learning | Oct 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Estimating Numbers without Regression | Oct 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Breaking Down Word Semantics from Pre-trained Language Models through Layer-wise Dimension Selection | Oct 8, 2023 | Binary ClassificationLanguage Modeling | —Unverified | 0 |
| Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback | Oct 8, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Generative Spoken Language Model based on continuous word-sized audio tokens | Oct 8, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Report Generation Based on Multi-institution and Multi-system Data | Oct 8, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Optimizing Large Language Models to Expedite the Development of Smart Contracts | Oct 8, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Synslator: An Interactive Machine Translation Tool with Online Learning | Oct 8, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MindfulDiary: Harnessing Large Language Model to Support Psychiatric Patients' Journaling | Oct 8, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Tree-GPT: Modular Large Language Model Expert System for Forest Remote Sensing Image Understanding and Interactive Analysis | Oct 7, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Prompt-to-OS (P2OS): Revolutionizing Operating Systems and Human-Computer Interaction with Integrated AI Generative Models | Oct 7, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Question-focused Summarization by Decomposing Articles into Facts and Opinions and Retrieving Entities | Oct 7, 2023 | ArticlesDecision Making | —Unverified | 0 |
| ILuvUI: Instruction-tuned LangUage-Vision modeling of UIs from Machine Conversations | Oct 7, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| From task structures to world models: What do LLMs know? | Oct 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BrainSCUBA: Fine-Grained Natural Language Captions of Visual Cortex Selectivity | Oct 6, 2023 | Image GenerationLanguage Modeling | —Unverified | 0 |
| Functional Interpolation for Relative Positions Improves Long Context Transformers | Oct 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Quantized Transformer Language Model Implementations on Edge Devices | Oct 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TRAM: Bridging Trust Regions and Sharpness Aware Minimization | Oct 5, 2023 | Cross-Lingual TransferDomain Generalization | CodeCode Available | 0 |
| Neural Language Model Pruning for Automatic Speech Recognition | Oct 5, 2023 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| Controllable Multi-document Summarization: Coverage & Coherence Intuitive Policy with Large Language Model Based Rewards | Oct 5, 2023 | Document SummarizationLanguage Modeling | —Unverified | 0 |