| Narrow Transformer: StarCoder-Based Java-LM For Desktop | Jul 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evaluating Language Model Context Windows: A "Working Memory" Test and Inference-time Correction | Jul 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MAPO: Boosting Large Language Model Performance with Model-Adaptive Prompt Optimization | Jul 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RDBE: Reasoning Distillation-Based Evaluation Enhances Automatic Essay Scoring | Jul 3, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| Large Language Model Agents for Improving Engagement with Behavior Change Interventions: Application to Digital Mindfulness | Jul 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLMcap: Large Language Model for Unsupervised PCAP Failure Detection | Jul 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Federated RLHF with Aggregated Client Preference for LLMs | Jul 3, 2024 | Federated LearningLanguage Modeling | —Unverified | 0 |
| Learning to Reduce: Towards Improving Performance of Large Language Models on Structured Data | Jul 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MLKD-BERT: Multi-level Knowledge Distillation for Pre-trained Language Models | Jul 3, 2024 | Extractive Question-AnsweringKnowledge Distillation | —Unverified | 0 |
| CogErgLLM: Exploring Large Language Model Systems Design Perspective Using Cognitive Ergonomics | Jul 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Supporting Cross-language Cross-project Bug Localization Using Pre-trained Language Models | Jul 3, 2024 | Contrastive LearningCPU | —Unverified | 0 |
| Efficient Training of Language Models with Compact and Consistent Next Token Distributions | Jul 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Images Speak Louder than Words: Understanding and Mitigating Bias in Vision-Language Model from a Causal Mediation Perspective | Jul 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning | Jul 3, 2024 | Few-Shot LearningGeneral Knowledge | —Unverified | 0 |
| InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output | Jul 3, 2024 | ArticlesImage Comprehension | —Unverified | 0 |
| Raw Text is All you Need: Knowledge-intensive Multi-turn Instruction Tuning for Large Language Model | Jul 3, 2024 | AllLanguage Modeling | —Unverified | 0 |
| Lightweight Large Language Model for Medication Enquiry: Med-Pal | Jul 2, 2024 | ChatbotLanguage Modeling | —Unverified | 0 |
| Assessing the Effectiveness of GPT-4o in Climate Change Evidence Synthesis and Systematic Assessments: Preliminary Insights | Jul 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SeqMate: A Novel Large Language Model Pipeline for Automating RNA Sequencing | Jul 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLMs Plagiarize: Ensuring Responsible Sourcing of Large Language Model Training Data Through Knowledge Graph Comparison | Jul 2, 2024 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |
| Investigating the Effects of Large-Scale Pseudo-Stereo Data and Different Speech Foundation Model on Dialogue Generative Spoken Language Model | Jul 2, 2024 | Dialogue GenerationDiversity | —Unverified | 0 |
| Language Model Alignment in Multilingual Trolley Problems | Jul 2, 2024 | Decision MakingEthics | CodeCode Available | 1 |
| An End-to-End Speech Summarization Using Large Language Model | Jul 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Helpful assistant or fruitful facilitator? Investigating how personas affect language model behavior | Jul 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Accompanied Singing Voice Synthesis with Fully Text-controlled Melody | Jul 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Is Your Large Language Model Knowledgeable or a Choices-Only Cheater? | Jul 2, 2024 | Graph MiningLanguage Modeling | CodeCode Available | 0 |
| Neurocache: Efficient Vector Retrieval for Long-range Language Modeling | Jul 2, 2024 | Few-Shot LearningLanguage Modeling | CodeCode Available | 0 |
| PromptIntern: Saving Inference Costs by Internalizing Recurrent Prompt during Large Language Model Fine-tuning | Jul 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Why do LLaVA Vision-Language Models Reply to Images in English? | Jul 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GPTCast: a weather language model for precipitation nowcasting | Jul 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Multi-Modal Video Dialog State Tracking in the Wild | Jul 2, 2024 | dialog state trackingGraph structure learning | —Unverified | 0 |
| Synthetic Multimodal Question Generation | Jul 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document Understanding | Jul 2, 2024 | document understandingKey Information Extraction | CodeCode Available | 2 |
| Fake News Detection and Manipulation Reasoning via Large Vision-Language Models | Jul 2, 2024 | Binary ClassificationFake News Detection | —Unverified | 0 |
| AutoFlow: Automated Workflow Generation for Large Language Model Agents | Jul 1, 2024 | AI AgentLanguage Modeling | CodeCode Available | 2 |
| Image-to-Text Logic Jailbreak: Your Imagination can Help You Do Anything | Jul 1, 2024 | Image to textLanguage Modeling | —Unverified | 0 |
| SINKT: A Structure-Aware Inductive Knowledge Tracing Model with Large Language Model | Jul 1, 2024 | Knowledge TracingLanguage Modeling | CodeCode Available | 1 |
| An Empirical Comparison of Generative Approaches for Product Attribute-Value Identification | Jul 1, 2024 | AttributeAttribute Mining | CodeCode Available | 0 |
| Adapting Multilingual LLMs to Low-Resource Languages with Knowledge Graphs via Adapters | Jul 1, 2024 | Knowledge GraphsLanguage Modeling | CodeCode Available | 0 |
| FoldGPT: Simple and Effective Large Language Model Compression Scheme | Jul 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Tokenize the World into Object-level Knowledge to Address Long-tail Events in Autonomous Driving | Jul 1, 2024 | Autonomous DrivingCommon Sense Reasoning | —Unverified | 0 |
| First Place Solution of 2023 Global Artificial Intelligence Technology Innovation Competition Track 1 | Jul 1, 2024 | DenoisingDiagnostic | —Unverified | 0 |
| Large Language Model Enhanced Knowledge Representation Learning: A Survey | Jul 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning to Explore and Select for Coverage-Conditioned Retrieval-Augmented Generation | Jul 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Optimization of Retrieval-Augmented Generation Context with Outlier Detection | Jul 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Tree Search for Language Model Agents | Jul 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak Attacks | Jul 1, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| Memory^3: Language Modeling with Explicit Memory | Jul 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents | Jul 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| RegMix: Data Mixture as Regression for Language Model Pre-training | Jul 1, 2024 | Common Sense ReasoningLanguage Modeling | CodeCode Available | 2 |