| Learning to Maximize Mutual Information for Chain-of-Thought Distillation | Mar 5, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 0 |
| Evaluating and Optimizing Educational Content with Large Language Model Judgments | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DPPA: Pruning Method for Large Language Model to Model Merging | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Causal Prompting: Debiasing Large Language Model Prompting based on Front-Door Adjustment | Mar 5, 2024 | Contrastive LearningData Augmentation | —Unverified | 0 |
| Language Guided Exploration for RL Agents in Text Environments | Mar 5, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| MeanCache: User-Centric Semantic Caching for LLM Web Services | Mar 5, 2024 | Federated LearningLanguage Modeling | —Unverified | 0 |
| Towards Training A Chinese Large Language Model for Anesthesiology | Mar 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Word Importance Explains How Prompts Affect Language Model Outputs | Mar 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection | Mar 5, 2024 | Concept AlignmentExplanation Generation | —Unverified | 0 |
| Socratic Reasoning Improves Positive Text Rewriting | Mar 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RegionGPT: Towards Region Understanding Vision Language Model | Mar 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Non-autoregressive Sequence-to-Sequence Vision-Language Models | Mar 4, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 |
| NoteLLM: A Retrievable Large Language Model for Note Recommendation | Mar 4, 2024 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| Towards Intent-Based Network Management: Large Language Models for Intent Extraction in 5G Core Networks | Mar 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| How does Architecture Influence the Base Capabilities of Pre-trained Language Models? A Case Study Based on FFN-Wider and MoE Transformers | Mar 4, 2024 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| DECIDER: A Dual-System Rule-Controllable Decoding Framework for Language Generation | Mar 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model-Based Evolutionary Optimizer: Reasoning with elitism | Mar 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLM vs. Lawyers: Identifying a Subset of Summary Judgments in a Large UK Case Law Dataset | Mar 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Fantastic Semantics and Where to Find Them: Investigating Which Layers of Generative LLMs Reflect Lexical Semantics | Mar 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Revisiting Dynamic Evaluation: Online Adaptation for Large Language Models | Mar 3, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| SyllabusQA: A Course Logistics Question Answering Dataset | Mar 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| OVEL: Large Language Model as Memory Manager for Online Video Entity Linking | Mar 3, 2024 | Entity LinkingLanguage Modeling | —Unverified | 0 |
| SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code | Mar 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Data-free Multi-label Image Recognition via LLM-powered Prompt Tuning | Mar 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AutoAttacker: A Large Language Model Guided System to Implement Automatic Cyber-attacks | Mar 2, 2024 | Computer SecurityLanguage Modeling | —Unverified | 0 |