| Automatically Generating Visual Hallucination Test Cases for Multimodal Large Language Models | Oct 15, 2024 | HallucinationLarge Language Model | CodeCode Available | 0 | 5 |
| Large Language Model Augmented Narrative Driven Recommendations | Jun 4, 2023 | Data AugmentationLanguage Modeling | CodeCode Available | 0 | 5 |
| StructEval: Deepen and Broaden Large Language Model Assessment via Structured Evaluation | Aug 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Large Language Model Assisted Adversarial Robustness Neural Architecture Search | Jun 8, 2024 | Adversarial RobustnessCombinatorial Optimization | CodeCode Available | 0 | 5 |
| Automated title and abstract screening for scoping reviews using the GPT-4 Large Language Model | Nov 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Adaptive Graph Pruning for Multi-Agent Communication | Jun 3, 2025 | Code GenerationLarge Language Model | CodeCode Available | 0 | 5 |
| Language Model Behavior: A Comprehensive Survey | Mar 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Language Model Sentence Completion with a Parser-Driven Rhetorical Control Method | Feb 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| CoSQA+: Pioneering the Multi-Choice Code Search Benchmark with Test-Driven Agents | Jun 17, 2024 | Code GenerationCode Search | CodeCode Available | 0 | 5 |
| Correcting misinformation on social media with a large language model | Mar 17, 2024 | Fact CheckingLanguage Modeling | CodeCode Available | 0 | 5 |
| Labels Generated by Large Language Model Helps Measuring People's Empathy in Vitro | Jan 1, 2025 | Data AugmentationLanguage Modeling | CodeCode Available | 0 | 5 |
| Large Language Model Critics for Execution-Free Evaluation of Code Changes | Jan 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| CORE: A Retrieve-then-Edit Framework for Counterfactual Data Generation | Oct 10, 2022 | counterfactualData Augmentation | CodeCode Available | 0 | 5 |
| CoPrUS: Consistency Preserving Utterance Synthesis towards more realistic benchmark dialogues | Dec 10, 2024 | Data AugmentationLanguage Modeling | CodeCode Available | 0 | 5 |
| Knowledge Grafting of Large Language Models | May 24, 2025 | Continual LearningKnowledge Distillation | CodeCode Available | 0 | 5 |
| Automated Privacy Information Annotation in Large Language Model Interactions | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Conveyor: Efficient Tool-aware LLM Serving with Tool Partial Execution | May 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| KL Penalty Control via Perturbation for Direct Preference Optimization | Feb 18, 2025 | ChatbotLanguage Modeling | CodeCode Available | 0 | 5 |
| Conversations in Galician: a Large Language Model for an Underrepresented Language | Nov 7, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Lisbon Computational Linguists at SemEval-2024 Task 2: Using A Mistral 7B Model and Data Augmentation | Aug 6, 2024 | Data AugmentationLanguage Modeling | CodeCode Available | 0 | 5 |
| Knowledge-to-Jailbreak: Investigating Knowledge-driven Jailbreaking Attacks for Large Language Models | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| KidneyTalk-open: No-code Deployment of a Private Large Language Model with Medical Documentation-Enhanced Knowledge Database for Kidney Disease | Mar 6, 2025 | ChunkingLanguage Modeling | CodeCode Available | 0 | 5 |
| Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question Answering | May 21, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 0 | 5 |
| Conversational Feedback in Scripted versus Spontaneous Dialogues: A Comparative Analysis | Sep 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Conversational AI Powered by Large Language Models Amplifies False Memories in Witness Interviews | Aug 8, 2024 | ChatbotLanguage Modelling | CodeCode Available | 0 | 5 |
| Automated Generation and Tagging of Knowledge Components from Multiple-Choice Questions | May 30, 2024 | Language ModellingLarge Language Model | CodeCode Available | 0 | 5 |
| Keep It Private: Unsupervised Privatization of Online Text | May 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| KL-Divergence Guided Temperature Sampling | Jun 2, 2023 | Conversational Question AnsweringDiversity | CodeCode Available | 0 | 5 |
| Controlling Large Language Model with Latent Actions | Mar 27, 2025 | CoLALanguage Modeling | CodeCode Available | 0 | 5 |
| Adapting Multi-modal Large Language Model to Concept Drift From Pre-training Onwards | May 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Just What You Desire: Constrained Timeline Summarization with Self-Reflection for Enhanced Relevance | Dec 23, 2024 | ArticlesLanguage Modeling | CodeCode Available | 0 | 5 |
| Evaluating Biases in Context-Dependent Health Questions | Mar 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Automated Bug Report Prioritization in Large Open-Source Projects | Apr 22, 2025 | Large Language Modeltext-classification | CodeCode Available | 0 | 5 |
| Controlled LLM Decoding via Discrete Auto-regressive Biasing | Feb 6, 2025 | Large Language ModelText Generation | CodeCode Available | 0 | 5 |
| Aligning Sentence Simplification with ESL Learner's Proficiency for Language Acquisition | Feb 17, 2025 | DiversityLanguage Acquisition | CodeCode Available | 0 | 5 |
| Jamba: A Hybrid Transformer-Mamba Language Model | Mar 28, 2024 | GPULanguage Modeling | CodeCode Available | 0 | 5 |
| Evaluating Cultural Adaptability of a Large Language Model via Simulation of Synthetic Personas | Aug 13, 2024 | ArticlesLanguage Modeling | CodeCode Available | 0 | 5 |
| Evaluating Dependencies in Fact Editing for Language Models: Specificity and Implication Awareness | Dec 4, 2023 | knowledge editingLanguage Modeling | CodeCode Available | 0 | 5 |
| KatzBot: Revolutionizing Academic Chatbot for Enhanced Communication | Oct 21, 2024 | ChatbotLanguage Modeling | CodeCode Available | 0 | 5 |
| "I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities | Dec 26, 2024 | Domain AdaptationLanguage Modeling | CodeCode Available | 0 | 5 |
| Iterative Counterfactual Data Augmentation | Feb 25, 2025 | counterfactualData Augmentation | CodeCode Available | 0 | 5 |
| Evaluating Judges as Evaluators: The JETTS Benchmark of LLM-as-Judges as Test-Time Scaling Evaluators | Apr 21, 2025 | Code GenerationInstruction Following | CodeCode Available | 0 | 5 |
| I Want to Break Free! Persuasion and Anti-Social Behavior of LLMs in Multi-Agent Settings with Social Hierarchy | Oct 9, 2024 | Large Language ModelPersuasiveness | CodeCode Available | 0 | 5 |
| Is Your Large Language Model Knowledgeable or a Choices-Only Cheater? | Jul 2, 2024 | Graph MiningLanguage Modeling | CodeCode Available | 0 | 5 |
| Item-side Fairness of Large Language Model-based Recommendation System | Feb 23, 2024 | FairnessLanguage Modeling | CodeCode Available | 0 | 5 |
| KCluster: An LLM-based Clustering Approach to Knowledge Component Discovery | May 9, 2025 | ClusteringDescriptive | CodeCode Available | 0 | 5 |
| Know Your Needs Better: Towards Structured Understanding of Marketer Demands with Analogical Reasoning Augmented LLMs | Jan 9, 2024 | Language ModellingLarge Language Model | CodeCode Available | 0 | 5 |
| Leveraging Large Language Models for Automated Dialogue Analysis | Sep 12, 2023 | General KnowledgeLanguage Modeling | CodeCode Available | 0 | 5 |
| Investigating and Extending Homans' Social Exchange Theory with Large Language Model based Agents | Feb 18, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Contrastive Cross-Course Knowledge Tracing via Concept Graph Guided Knowledge Transfer | May 14, 2025 | Knowledge TracingLarge Language Model | CodeCode Available | 0 | 5 |