| Parameter-Efficient Conversational Recommender System as a Language Processing Task | Jan 25, 2024 | Dialogue GenerationKnowledge Graphs | CodeCode Available | 1 |
| LocMoE: A Low-Overhead MoE for Large Language Model Training | Jan 25, 2024 | AllLanguage Modeling | —Unverified | 0 |
| TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and Generation | Jan 25, 2024 | DecoderLanguage Modeling | CodeCode Available | 2 |
| The Typing Cure: Experiences with Large Language Model Chatbots for Mental Health Support | Jan 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards 3D Molecule-Text Interpretation in Language Models | Jan 25, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| HiGen: Hierarchy-Aware Sequence Generation for Hierarchical Text Classification | Jan 24, 2024 | ArticlesLanguage Modeling | —Unverified | 0 |
| Large language model empowered participatory urban planning | Jan 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Fluent dreaming for language models | Jan 24, 2024 | Adversarial AttackLanguage Modeling | CodeCode Available | 1 |
| MambaByte: Token-free Selective State Space Model | Jan 24, 2024 | Computational EfficiencyInductive Bias | —Unverified | 0 |
| Supporting Sensemaking of Large Language Model Outputs at Scale | Jan 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Unified Approach to Emotion Detection and Task-Oriented Dialogue Modeling | Jan 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Large Malaysian Language Model Based on Mistral for Enhanced Local Language Understanding | Jan 24, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| MaLA-500: Massive Language Adaptation of Large Language Models | Jan 24, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| TPD: Enhancing Student Language Model Reasoning via Principle Discovery and Guidance | Jan 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ChatterBox: Multi-round Multimodal Referring and Grounding | Jan 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| MLLMReID: Multimodal Large Language Model-based Person Re-identification | Jan 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TAT-LLM: A Specialized Language Model for Discrete Reasoning over Tabular and Textual Data | Jan 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DsDm: Model-Aware Dataset Selection with Datamodels | Jan 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study | Jan 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| XAI for All: Can Large Language Models Simplify Explainable AI? | Jan 23, 2024 | AllDecision Making | —Unverified | 0 |
| Self-Supervised Vision Transformers Are Efficient Segmentation Learners for Imperfect Labels | Jan 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| How well can a large language model explain business processes as perceived by users? | Jan 23, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Knowledge Distillation from Language-Oriented to Emergent Communication for Multi-Agent Remote Control | Jan 23, 2024 | Deep Reinforcement LearningKnowledge Distillation | —Unverified | 0 |
| In-Context Language Learning: Architectures and Algorithms | Jan 23, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| Generating Zero-shot Abstractive Explanations for Rumour Verification | Jan 23, 2024 | Few-Shot LearningInformativeness | CodeCode Available | 0 |