Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling Oct 15, 2024 Instruction Following Knowledge Distillation
— Unverified 0Survey and Evaluation of Converging Architecture in LLMs based on Footsteps of Operations Oct 15, 2024 Text Generation
— Unverified 0Improving the Language Understanding Capabilities of Large Language Models Using Reinforcement Learning Oct 14, 2024 Natural Language Understanding Text Generation
— Unverified 0LLM-based Code-Switched Text Generation for Grammatical Error Correction Oct 14, 2024 Grammatical Error Correction Synthetic Data Generation
— Unverified 0Enhancing AI Assisted Writing with One-Shot Implicit Negative Feedback Oct 14, 2024 Text Generation
Code Code Available 0Large Language Models Are Active Critics in NLG Evaluation Oct 14, 2024 nlg evaluation Prompt Engineering
— Unverified 0Local and Global Decoding in Text Generation Oct 14, 2024 Language Modeling Language Modelling
Code Code Available 0TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models Oct 14, 2024 2k Benchmarking
Code Code Available 1Multilingual Controlled Generation And Gold-Standard-Agnostic Evaluation of Code-Mixed Sentences Oct 14, 2024 Sentence Text Generation
— Unverified 0MMAR: Towards Lossless Multi-Modal Auto-Regressive Probabilistic Modeling Oct 14, 2024 Denoising Image Generation
— Unverified 0QE-EBM: Using Quality Estimators as Energy Loss for Machine Translation Oct 14, 2024 Machine Translation NMT
— Unverified 0First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending Oct 14, 2024 Image Generation Text Generation
Code Code Available 1BiDoRA: Bi-level Optimization-Based Weight-Decomposed Low-Rank Adaptation Oct 13, 2024 Natural Language Understanding parameter-efficient fine-tuning
— Unverified 0Text4Seg: Reimagining Image Segmentation as Text Generation Oct 13, 2024 Image Segmentation Referring Expression
Code Code Available 2FedEx-LoRA: Exact Aggregation for Federated and Efficient Fine-Tuning of Foundation Models Oct 12, 2024 Arithmetic Reasoning Federated Learning
Code Code Available 1Diversity of Thought Elicits Stronger Reasoning Capabilities in Multi-Agent Debate Frameworks Oct 10, 2024 8k Diversity
— Unverified 0GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment Oct 10, 2024 Text Generation
Code Code Available 1Thought2Text: Text Generation from EEG Signal using Large Language Models (LLMs) Oct 10, 2024 EEG Text Generation
Code Code Available 2StablePrompt: Automatic Prompt Tuning using Reinforcement Learning for Large Language Models Oct 10, 2024 Question Answering Reinforcement Learning (RL)
Code Code Available 1Jump Your Steps: Optimizing Sampling Schedule of Discrete Diffusion Models Oct 10, 2024 Text Generation
— Unverified 0Localizing Factual Inconsistencies in Attributable Text Generation Oct 9, 2024 Text Generation
Code Code Available 0Signal Watermark on Large Language Models Oct 9, 2024 Text Generation
Code Code Available 0Guaranteed Generation from Large Language Models Oct 9, 2024 Text Generation
Code Code Available 0Parameter Efficient Fine-tuning via Explained Variance Adaptation Oct 9, 2024 All image-classification
Code Code Available 1Decoding Decoded: Understanding Hyperparameter Effects in Open-Ended Text Generation Oct 8, 2024 Text Generation
Code Code Available 0Integrating Planning into Single-Turn Long-Form Text Generation Oct 8, 2024 Articles Form
— Unverified 0TRACE: Temporal Grounding Video LLM via Causal Event Modeling Oct 8, 2024 Text Generation Video Understanding
Code Code Available 2Conformal Structured Prediction Oct 8, 2024 Conformal Prediction image-classification
Code Code Available 0ParallelSpec: Parallel Drafter for Efficient Speculative Decoding Oct 8, 2024 Language Modeling Language Modelling
— Unverified 0KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server Oct 8, 2024 Federated Learning Knowledge Distillation
Code Code Available 0RevisEval: Improving LLM-as-a-Judge via Response-Adapted References Oct 7, 2024 Instruction Following Text Generation
— Unverified 0LLaVA Needs More Knowledge: Retrieval Augmented Natural Language Generation with Knowledge Graph for Explaining Thoracic Pathologies Oct 7, 2024 RAG Retrieval
Code Code Available 0Evaluating the Correctness of Inference Patterns Used by LLMs for Judgment Oct 6, 2024 Text Generation
— Unverified 0Graded Suspiciousness of Adversarial Texts to Human Oct 6, 2024 Adversarial Attack Adversarial Text
— Unverified 0Control Large Language Models via Divide and Conquer Oct 6, 2024 Text Generation
— Unverified 0Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective Oct 6, 2024 CPU GPU
Code Code Available 1Inner-Probe: Discovering Copyright-related Data Generation in LLM Architecture Oct 6, 2024 Contrastive Learning Prompt Engineering
— Unverified 0Algorithmic Capabilities of Random Transformers Oct 6, 2024 Text Generation
Code Code Available 1Empowering Backbone Models for Visual Text Generation with Input Granularity Control and Glyph-Aware Training Oct 6, 2024 Diversity Image Generation
— Unverified 0PAD: Personalized Alignment of LLMs at Decoding-Time Oct 5, 2024 Text Generation
— Unverified 0Mechanistic Behavior Editing of Language Models Oct 5, 2024 Bayesian Optimization Specificity
Code Code Available 0Crafting Narrative Closures: Zero-Shot Learning with SSM Mamba for Short Story Ending Generation Oct 4, 2024 Mamba Sentence
— Unverified 0Using Prompts to Guide Large Language Models in Imitating a Real Person's Language Style Oct 4, 2024 Language Modelling Large Language Model
— Unverified 0FaithCAMERA: Construction of a Faithful Dataset for Ad Text Generation Oct 4, 2024 Informativeness Sentence
Code Code Available 0Decoding Game: On Minimax Optimality of Heuristic Text Generation Strategies Oct 4, 2024 Text Generation
— Unverified 0Steering Large Language Models between Code Execution and Textual Reasoning Oct 4, 2024 Code Generation Math
Code Code Available 2Explicit, Implicit, and Scattered: Revisiting Event Extraction to Capture Complex Arguments Oct 4, 2024 Event Extraction Text Generation
— Unverified 0ToolGen: Unified Tool Retrieval and Calling via Generation Oct 4, 2024 Retrieval Text Generation
Code Code Available 2On Uncertainty In Natural Language Processing Oct 4, 2024 Conformal Prediction text-classification
— Unverified 0Reward-RAG: Enhancing RAG with Reward Driven Supervision Oct 3, 2024 RAG Retrieval
— Unverified 0