| Parameter-Efficient Conversational Recommender System as a Language Processing Task | Jan 25, 2024 | Dialogue GenerationKnowledge Graphs | CodeCode Available | 1 |
| Improving Natural Language Capability of Code Large Language Model | Jan 25, 2024 | Code GenerationLanguage Modeling | CodeCode Available | 0 |
| Accelerating Retrieval-Augmented Language Model Serving with Speculation | Jan 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LocMoE: A Low-Overhead MoE for Large Language Model Training | Jan 25, 2024 | AllLanguage Modeling | —Unverified | 0 |
| TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and Generation | Jan 25, 2024 | DecoderLanguage Modeling | CodeCode Available | 2 |
| Large language model empowered participatory urban planning | Jan 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HiGen: Hierarchy-Aware Sequence Generation for Hierarchical Text Classification | Jan 24, 2024 | ArticlesLanguage Modeling | —Unverified | 0 |
| Fluent dreaming for language models | Jan 24, 2024 | Adversarial AttackLanguage Modeling | CodeCode Available | 1 |
| MambaByte: Token-free Selective State Space Model | Jan 24, 2024 | Computational EfficiencyInductive Bias | —Unverified | 0 |
| Supporting Sensemaking of Large Language Model Outputs at Scale | Jan 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MaLA-500: Massive Language Adaptation of Large Language Models | Jan 24, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| TPD: Enhancing Student Language Model Reasoning via Principle Discovery and Guidance | Jan 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Unified Approach to Emotion Detection and Task-Oriented Dialogue Modeling | Jan 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Large Malaysian Language Model Based on Mistral for Enhanced Local Language Understanding | Jan 24, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| ChatterBox: Multi-round Multimodal Referring and Grounding | Jan 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| MLLMReID: Multimodal Large Language Model-based Person Re-identification | Jan 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TAT-LLM: A Specialized Language Model for Discrete Reasoning over Tabular and Textual Data | Jan 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Training-Free Action Recognition and Goal Inference with Dynamic Frame Selection | Jan 23, 2024 | Action RecognitionLanguage Modeling | —Unverified | 0 |
| DsDm: Model-Aware Dataset Selection with Datamodels | Jan 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Small Language Model Meets with Reinforced Vision Vocabulary | Jan 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Knowledge Distillation from Language-Oriented to Emergent Communication for Multi-Agent Remote Control | Jan 23, 2024 | Deep Reinforcement LearningKnowledge Distillation | —Unverified | 0 |
| In-Context Language Learning: Architectures and Algorithms | Jan 23, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| Self-Supervised Vision Transformers Are Efficient Segmentation Learners for Imperfect Labels | Jan 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| How well can a large language model explain business processes as perceived by users? | Jan 23, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| XAI for All: Can Large Language Models Simplify Explainable AI? | Jan 23, 2024 | AllDecision Making | —Unverified | 0 |
| Generating Zero-shot Abstractive Explanations for Rumour Verification | Jan 23, 2024 | Few-Shot LearningInformativeness | CodeCode Available | 0 |
| Comparing Pre-trained Human Language Models: Is it Better with Human Context as Groups, Individual Traits, or Both? | Jan 23, 2024 | Age EstimationLanguage Modeling | —Unverified | 0 |
| ChatGraph: Chat with Your Graphs | Jan 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study | Jan 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Can Large Language Models Write Parallel Code? | Jan 23, 2024 | Code CompletionCode Generation | CodeCode Available | 1 |
| CoAVT: A Cognition-Inspired Unified Audio-Visual-Text Pre-Training Model for Multimodal Processing | Jan 22, 2024 | AudioCapsAudio-Visual Synchronization | —Unverified | 0 |
| SignVTCL: Multi-Modal Continuous Sign Language Recognition Enhanced by Visual-Textual Contrastive Learning | Jan 22, 2024 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| West-of-N: Synthetic Preferences for Self-Improving Reward Models | Jan 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Keep Decoding Parallel with Effective Knowledge Distillation from Language Models to End-to-end Speech Recognisers | Jan 22, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Large Language Model based Multi-Agents: A Survey of Progress and Challenges | Jan 21, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 5 |
| Training microrobots to swim by a large language model | Jan 21, 2024 | Decision MakingFew-Shot Learning | —Unverified | 0 |
| Finding a Needle in the Adversarial Haystack: A Targeted Paraphrasing Approach For Uncovering Edge Cases with Minimal Distribution Distortion | Jan 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LLMRA: Multi-modal Large Language Model based Restoration Assistant | Jan 21, 2024 | Image RestorationLanguage Modeling | —Unverified | 0 |
| MolTailor: Tailoring Chemical Molecular Representation to Specific Tasks via Text Prompts | Jan 21, 2024 | Drug DiscoveryLanguage Modeling | CodeCode Available | 1 |
| With Greater Text Comes Greater Necessity: Inference-Time Training Helps Long Text Generation | Jan 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| AttentionLego: An Open-Source Building Block For Spatially-Scalable Large Language Model Accelerator With Processing-In-Memory Technology | Jan 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Integration of Large Language Models in Control of EHD Pumps for Precise Color Synthesis | Jan 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Using Large Language Model for End-to-End Chinese ASR and NER | Jan 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Embedding Ontologies via Incorporating Extensional and Intensional Knowledge | Jan 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion | Jan 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Using LLMs to discover emerging coded antisemitic hate-speech in extremist social media | Jan 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Image Safeguarding: Reasoning with Conditional Vision Language Model and Obfuscating Unsafe Content Counterfactually | Jan 19, 2024 | counterfactualCounterfactual Explanation | CodeCode Available | 0 |
| Critical Data Size of Language Models from a Grokking Perspective | Jan 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences | Jan 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning | Jan 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |