| Efficient Controlled Language Generation with Low-Rank Autoregressive Reward Models | Jul 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EventChat: Implementation and user-centric evaluation of a large language model-driven conversational recommender system for exploring leisure events in an SME context | Jul 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Integrating Randomness in Large Language Models: A Linear Congruential Generator Approach for Generating Clinically Relevant Content | Jul 4, 2024 | Fact SelectionLanguage Modeling | CodeCode Available | 0 |
| Historical Ink: 19th Century Latin American Spanish Newspaper Corpus with LLM OCR Correction | Jul 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ConText at WASSA 2024 Empathy and Personality Shared Task: History-Dependent Embedding Utterance Representations for Empathy and Emotion Prediction in Conversations | Jul 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving Self Consistency in LLMs through Probabilistic Tokenization | Jul 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Chain-of-Thought Augmentation with Logit Contrast for Enhanced Reasoning in Language Models | Jul 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Mysterious Case of Neuron 1512: Injectable Realignment Architectures Reveal Internal Characteristics of Meta's Llama 2 Model | Jul 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MAPO: Boosting Large Language Model Performance with Model-Adaptive Prompt Optimization | Jul 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unlocking the Potential of Model Merging for Low-Resource Languages | Jul 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MAMA: Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning | Jul 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| On the Effectiveness of Acoustic BPE in Decoder-Only TTS | Jul 4, 2024 | DecoderDiversity | —Unverified | 0 |
| Narrow Transformer: StarCoder-Based Java-LM For Desktop | Jul 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Uncertainty-Guided Optimization on Large Language Model Search Trees | Jul 4, 2024 | Bayesian OptimizationEfficient Exploration | CodeCode Available | 0 |
| Supporting Cross-language Cross-project Bug Localization Using Pre-trained Language Models | Jul 3, 2024 | Contrastive LearningCPU | —Unverified | 0 |
| RDBE: Reasoning Distillation-Based Evaluation Enhances Automatic Essay Scoring | Jul 3, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| Raw Text is All you Need: Knowledge-intensive Multi-turn Instruction Tuning for Large Language Model | Jul 3, 2024 | AllLanguage Modeling | —Unverified | 0 |
| SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning | Jul 3, 2024 | Few-Shot LearningGeneral Knowledge | —Unverified | 0 |
| Towards Federated RLHF with Aggregated Client Preference for LLMs | Jul 3, 2024 | Federated LearningLanguage Modeling | —Unverified | 0 |
| MLKD-BERT: Multi-level Knowledge Distillation for Pre-trained Language Models | Jul 3, 2024 | Extractive Question-AnsweringKnowledge Distillation | —Unverified | 0 |
| InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output | Jul 3, 2024 | ArticlesImage Comprehension | —Unverified | 0 |
| LLMcap: Large Language Model for Unsupervised PCAP Failure Detection | Jul 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model Agents for Improving Engagement with Behavior Change Interventions: Application to Digital Mindfulness | Jul 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning to Reduce: Towards Improving Performance of Large Language Models on Structured Data | Jul 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CogErgLLM: Exploring Large Language Model Systems Design Perspective Using Cognitive Ergonomics | Jul 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Images Speak Louder than Words: Understanding and Mitigating Bias in Vision-Language Model from a Causal Mediation Perspective | Jul 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Efficient Training of Language Models with Compact and Consistent Next Token Distributions | Jul 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLMs Plagiarize: Ensuring Responsible Sourcing of Large Language Model Training Data Through Knowledge Graph Comparison | Jul 2, 2024 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |
| Is Your Large Language Model Knowledgeable or a Choices-Only Cheater? | Jul 2, 2024 | Graph MiningLanguage Modeling | CodeCode Available | 0 |
| Fake News Detection and Manipulation Reasoning via Large Vision-Language Models | Jul 2, 2024 | Binary ClassificationFake News Detection | —Unverified | 0 |
| An End-to-End Speech Summarization Using Large Language Model | Jul 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Accompanied Singing Voice Synthesis with Fully Text-controlled Melody | Jul 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Assessing the Effectiveness of GPT-4o in Climate Change Evidence Synthesis and Systematic Assessments: Preliminary Insights | Jul 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Investigating the Effects of Large-Scale Pseudo-Stereo Data and Different Speech Foundation Model on Dialogue Generative Spoken Language Model | Jul 2, 2024 | Dialogue GenerationDiversity | —Unverified | 0 |
| Lightweight Large Language Model for Medication Enquiry: Med-Pal | Jul 2, 2024 | ChatbotLanguage Modeling | —Unverified | 0 |
| Helpful assistant or fruitful facilitator? Investigating how personas affect language model behavior | Jul 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SeqMate: A Novel Large Language Model Pipeline for Automating RNA Sequencing | Jul 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multi-Modal Video Dialog State Tracking in the Wild | Jul 2, 2024 | dialog state trackingGraph structure learning | —Unverified | 0 |
| Why do LLaVA Vision-Language Models Reply to Images in English? | Jul 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Synthetic Multimodal Question Generation | Jul 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PromptIntern: Saving Inference Costs by Internalizing Recurrent Prompt during Large Language Model Fine-tuning | Jul 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Neurocache: Efficient Vector Retrieval for Long-range Language Modeling | Jul 2, 2024 | Few-Shot LearningLanguage Modeling | CodeCode Available | 0 |
| Memory^3: Language Modeling with Explicit Memory | Jul 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Needle in the Haystack for Memory Based Large Language Models | Jul 1, 2024 | DecoderGPU | —Unverified | 0 |
| Tokenize the World into Object-level Knowledge to Address Long-tail Events in Autonomous Driving | Jul 1, 2024 | Autonomous DrivingCommon Sense Reasoning | —Unverified | 0 |
| Optimization of Retrieval-Augmented Generation Context with Outlier Detection | Jul 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Image-to-Text Logic Jailbreak: Your Imagination can Help You Do Anything | Jul 1, 2024 | Image to textLanguage Modeling | —Unverified | 0 |
| Learning to Explore and Select for Coverage-Conditioned Retrieval-Augmented Generation | Jul 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Adapting Multilingual LLMs to Low-Resource Languages with Knowledge Graphs via Adapters | Jul 1, 2024 | Knowledge GraphsLanguage Modeling | CodeCode Available | 0 |
| First Place Solution of 2023 Global Artificial Intelligence Technology Innovation Competition Track 1 | Jul 1, 2024 | DenoisingDiagnostic | —Unverified | 0 |