A Rusty Link in the AI Supply Chain: Detecting Evil Configurations in Model Repositories May 2, 2025 Code Generation Text Generation
— Unverified 0Ensuring Reproducibility in Generative AI Systems for General Use Cases: A Framework for Regression Testing and Open Datasets May 2, 2025 Code Generation GPR
Code Code Available 0A Character-based Diffusion Embedding Algorithm for Enhancing the Generation Quality of Generative Linguistic Steganographic Texts May 2, 2025 Linguistic steganography Text Generation
— Unverified 0Graph Synthetic Out-of-Distribution Exposure with Large Language Models Apr 29, 2025 Out of Distribution (OOD) Detection Text Generation
— Unverified 0Beyond One-Size-Fits-All: Inversion Learning for Highly Effective NLG Evaluation Prompts Apr 29, 2025 All Diversity
— Unverified 0YoChameleon: Personalized Vision and Language Generation Apr 29, 2025 Image Generation Text Generation
— Unverified 0Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models Apr 29, 2025 Diversity Sensitivity
— Unverified 0A Platform for Generating Educational Activities to Teach English as a Second Language Apr 28, 2025 Text Generation
— Unverified 0Anyprefer: An Agentic Framework for Preference Data Synthesis Apr 27, 2025 Medical Image Analysis Text Generation
— Unverified 0TRACE Back from the Future: A Probabilistic Reasoning Approach to Controllable Language Generation Apr 25, 2025 Attribute Text Generation
— Unverified 0Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models Apr 24, 2025 Image Generation Text Generation
— Unverified 0How Effective are Generative Large Language Models in Performing Requirements Classification? Apr 23, 2025 Classification Text Generation
— Unverified 0Distilling semantically aware orders for autoregressive image generation Apr 23, 2025 Image Generation Text Generation
— Unverified 0ConTextual: Improving Clinical Text Summarization in LLMs with Context-preserving Token Filtering and Knowledge Graphs Apr 23, 2025 Decision Making Knowledge Graphs
Code Code Available 0(Im)possibility of Automated Hallucination Detection in Large Language Models Apr 23, 2025 Hallucination Language Identification
— Unverified 0FairSteer: Inference Time Debiasing for LLMs with Dynamic Activation Steering Apr 20, 2025 counterfactual Fairness
— Unverified 0FarsEval-PKBETS: A new diverse benchmark for evaluating Persian large language models Apr 20, 2025 Descriptive Ethics
— Unverified 0LGD: Leveraging Generative Descriptions for Zero-Shot Referring Image Segmentation Apr 20, 2025 Attribute Image Segmentation
— Unverified 0Density Measures for Language Generation Apr 19, 2025 Hallucination Text Generation
— Unverified 0Sparks of Science: Hypothesis Generation Using Structured Paper Data Apr 17, 2025 Language Modelling Text Generation
— Unverified 0Entropy-Guided Watermarking for LLMs: A Test-Time Framework for Robust and Traceable Text Generation Apr 16, 2025 GSM8K Math
— Unverified 0Enhancing multimodal analogical reasoning with Logic Augmented Generation Apr 15, 2025 Knowledge Graphs Text Generation
Code Code Available 0Benchmarking Next-Generation Reasoning-Focused Large Language Models in Ophthalmology: A Head-to-Head Evaluation on 5,888 Items Apr 15, 2025 Benchmarking Multiple-choice
— Unverified 0Joint Action Language Modelling for Transparent Policy Execution Apr 14, 2025 Language Modelling Text Generation
— Unverified 0Transferable text data distillation by trajectory matching Apr 14, 2025 ARC Large Language Model
— Unverified 0ELSA: A Style Aligned Dataset for Emotionally Intelligent Language Generation Apr 11, 2025 Diversity Language Modeling
— Unverified 0MedHal: An Evaluation Dataset for Medical Hallucination Detection Apr 11, 2025 Hallucination Natural Language Inference
— Unverified 0Large Language Models as Span Annotators Apr 11, 2025 Data-to-Text Generation Machine Translation
— Unverified 0DeepSeek vs. o3-mini: How Well can Reasoning LLMs Evaluate MT and Summarization? Apr 10, 2025 Machine Translation nlg evaluation
— Unverified 0HypoEval: Hypothesis-Guided Evaluation for Natural Language Generation Apr 9, 2025 Text Generation
Code Code Available 0Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use Apr 7, 2025 GSM8K Math
— Unverified 0Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling Apr 7, 2025 Information Retrieval Language Modeling
— Unverified 0IMPersona: Evaluating Individual Level LM Impersonation Apr 6, 2025 Text Generation
Code Code Available 0Evaluating Compact LLMs for Zero-Shot Iberian Language Tasks on End-User Devices Apr 4, 2025 Text Generation
— Unverified 0Stance-Driven Multimodal Controlled Statement Generation: New Dataset and Task Apr 4, 2025 Marketing multimodal generation
— Unverified 0Sample, Don't Search: Rethinking Test-Time Alignment for Language Models Apr 4, 2025 GSM8K Mathematical Reasoning
— Unverified 0Align to Structure: Aligning Large Language Models with Structural Information Apr 4, 2025 Document Summarization Text Generation
Code Code Available 0State-of-the-Art Translation of Text-to-Gloss using mBART : A case study of Bangla Apr 3, 2025 Data Augmentation Text Generation
— Unverified 0CoLa -- Learning to Interactively Collaborate with Large LMs Apr 3, 2025 CoLA Text Generation
— Unverified 0Pel, A Programming Language for Orchestrating AI Agents Apr 3, 2025 Code Generation Text Generation
— Unverified 0LVMed-R2: Perception and Reflection-driven Complex Reasoning for Medical Report Generation Apr 2, 2025 Diagnostic Medical Report Generation
— Unverified 0ContrastScore: Towards Higher Quality, Less Biased, More Efficient Evaluation Metrics with Contrastive Evaluation Apr 2, 2025 Machine Translation Text Generation
— Unverified 0GraphMaster: Automated Graph Synthesis via LLM Agents in Data-Limited Environments Apr 1, 2025 Hallucination Text Generation
— Unverified 0Repetitions are not all alike: distinct mechanisms sustain repetition in language models Apr 1, 2025 All In-Context Learning
— Unverified 0ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations Apr 1, 2025 Articles RAG
— Unverified 0Synthesized Annotation Guidelines are Knowledge-Lite Boosters for Clinical Information Extraction Apr 1, 2025 Few-Shot Learning named-entity-recognition
— Unverified 0A Unified Virtual Mixture-of-Experts Framework:Enhanced Inference and Hallucination Mitigation in Single-Model System Apr 1, 2025 Dialogue Generation Ensemble Learning
— Unverified 0Multi-Agent LLM Judge: automatic personalized LLM judge design for evaluating natural language generation applications Apr 1, 2025 Text Generation
— Unverified 0Adaptive Layer-skipping in Pre-trained LLMs Mar 31, 2025 Text Generation
— Unverified 0Optimizing Humor Generation in Large Language Models: Temperature Configurations and Architectural Trade-offs Mar 31, 2025 Model Selection Text Generation
— Unverified 0