MATTER: Memory-Augmented Transformer Using Heterogeneous Knowledge Sources Jun 7, 2024 Language Modeling Language Modelling
— Unverified 0Large Generative Graph Models Jun 7, 2024 Language Modelling World Knowledge
— Unverified 0Are Large Language Models the New Interface for Data Pipelines? Jun 6, 2024 AutoML Explainable artificial intelligence
— Unverified 0HORAE: A Domain-Agnostic Language for Automated Service Regulation Jun 6, 2024 Language Modeling Language Modelling
Code Code Available 0Small-E: Small Language Model with Linear Attention for Efficient Speech Synthesis Jun 6, 2024 Decoder Inductive Bias
Code Code Available 2What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages Jun 6, 2024 Language Modeling Language Modelling
— Unverified 0LLplace: The 3D Indoor Scene Layout Generation and Editing via Large Language Model Jun 6, 2024 Language Modeling Language Modelling
— Unverified 0Simplified and Generalized Masked Diffusion for Discrete Data Jun 6, 2024 Language Modeling Language Modelling
Code Code Available 2Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation Jun 6, 2024 Language Model Evaluation Language Modeling
— Unverified 0Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning Jun 6, 2024 Attribute Language Modelling
Code Code Available 1Improving Audio Codec-based Zero-Shot Text-to-Speech Synthesis with Multi-Modal Context and Large Language Model Jun 6, 2024 Language Modeling Language Modelling
— Unverified 0BLSP-Emo: Towards Empathetic Large Speech-Language Models Jun 6, 2024 Emotion Recognition Instruction Following
Code Code Available 2BindGPT: A Scalable Framework for 3D Molecular Design via Language Modeling and Reinforcement Learning Jun 6, 2024 Graph Reconstruction Language Modeling
— Unverified 0Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data Jun 6, 2024 Denoising Language Modeling
Code Code Available 2Jailbreak Vision Language Models via Bi-Modal Adversarial Prompt Jun 6, 2024 Language Modelling Large Language Model
Code Code Available 2Towards Understanding Task-agnostic Debiasing Through the Lenses of Intrinsic Bias and Forgetfulness Jun 6, 2024 Language Modeling Language Modelling
— Unverified 0DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs Jun 6, 2024 Language Modelling Large Language Model
— Unverified 0Scaling and evaluating sparse autoencoders Jun 6, 2024 Language Modelling
Code Code Available 4Confabulation: The Surprising Value of Large Language Model Hallucinations Jun 6, 2024 Hallucination Language Modeling
— Unverified 0Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation Jun 6, 2024 Language Modelling Low-Rank Matrix Completion
Code Code Available 1Legal Documents Drafting with Fine-Tuned Pre-Trained Large Language Model Jun 6, 2024 Chinese Word Segmentation Language Modeling
Code Code Available 0AgentGym: Evolving Large Language Model-based Agents across Diverse Environments Jun 6, 2024 Language Modeling Language Modelling
Code Code Available 4Tool-Planner: Task Planning with Clusters across Multiple Tools Jun 6, 2024 Language Modelling Large Language Model
Code Code Available 2Every Answer Matters: Evaluating Commonsense with Probabilistic Measures Jun 6, 2024 Common Sense Reasoning Language Modeling
Code Code Available 0Queue management for slo-oriented large language model serving Jun 5, 2024 Blocking GPU
Code Code Available 1Ranking Manipulation for Conversational Search Engines Jun 5, 2024 Conversational Search Language Modeling
Code Code Available 0Item-Language Model for Conversational Recommendation Jun 5, 2024 Conversational Recommendation Dialogue Understanding
— Unverified 0Exploring Robustness in Doctor-Patient Conversation Summarization: An Analysis of Out-of-Domain SOAP Notes Jun 5, 2024 Conversation Summarization Language Modeling
— Unverified 0Language Model Can Do Knowledge Tracing: Simple but Effective Method to Integrate Language Model and Knowledge Tracing Task Jun 5, 2024 Knowledge Tracing Language Modeling
— Unverified 0Knowledge-Infused Legal Wisdom: Navigating LLM Consultation through the Lens of Diagnostics and Positive-Unlabeled Reinforcement Learning Jun 5, 2024 Diagnostic Language Modeling
— Unverified 0LLM-based Rewriting of Inappropriate Argumentation using Reinforcement Learning from Machine Feedback Jun 5, 2024 Few-Shot Learning Language Modeling
Code Code Available 0Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for Large Language Models Jun 5, 2024 Diversity Language Modeling
Code Code Available 2RadBARTsum: Domain Specific Adaption of Denoising Sequence-to-Sequence Models for Abstractive Radiology Report Summarization Jun 5, 2024 Clinical Knowledge Denoising
— Unverified 0Prompt-based Visual Alignment for Zero-shot Policy Transfer Jun 5, 2024 Autonomous Driving Language Modelling
— Unverified 0The Task-oriented Queries Benchmark (ToQB) Jun 5, 2024 Language Modelling Large Language Model
— Unverified 0PosterLLaVa: Constructing a Unified Multi-modal Layout Generator with LLM Jun 5, 2024 Language Modelling Large Language Model
Code Code Available 2From Tarzan to Tolkien: Controlling the Language Proficiency Level of LLMs for Content Generation Jun 5, 2024 Language Modeling Language Modelling
— Unverified 0Exploring User Retrieval Integration towards Large Language Models for Cross-Domain Sequential Recommendation Jun 5, 2024 Contrastive Learning Language Modelling
Code Code Available 0PrE-Text: Training Language Models on Private Federated Data in the Age of LLMs Jun 5, 2024 Language Modelling Large Language Model
Code Code Available 1Error-preserving Automatic Speech Recognition of Young English Learners' Language Jun 5, 2024 Automatic Speech Recognition Language Modelling
Code Code Available 0DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experiences Jun 5, 2024 Autonomous Driving Autonomous Vehicles
Code Code Available 2PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs Jun 5, 2024 Knowledge Distillation Language Modeling
— Unverified 0Does your data spark joy? Performance gains from domain upsampling at the end of training Jun 5, 2024 GSM8K HumanEval
— Unverified 0Xmodel-LM Technical Report Jun 5, 2024 Language Modeling Language Modelling
Code Code Available 1SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms Jun 5, 2024 Language Modeling Language Modelling
Code Code Available 1Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models Jun 5, 2024 Few-Shot Learning Language Modeling
Code Code Available 2From Redundancy to Relevance: Information Flow in LVLMs Across Reasoning Tasks Jun 4, 2024 Image Captioning Language Modelling
Code Code Available 2OccamLLM: Fast and Exact Language Model Arithmetic in a Single Step Jun 4, 2024 Language Modeling Language Modelling
— Unverified 0Order-Independence Without Fine Tuning Jun 4, 2024 Language Modelling Multiple-choice
Code Code Available 0Discrete Multimodal Transformers with a Pretrained Large Language Model for Mixed-Supervision Speech Processing Jun 4, 2024 Decoder Language Modeling
— Unverified 0