Adaptive Pruning for Large Language Models with Structural Importance Awareness Dec 19, 2024 Text Generation zero-shot-classification
— Unverified 0A Survey of RWKV Dec 19, 2024 Natural Language Understanding Survey
Code Code Available 1Movie2Story: A framework for understanding videos and telling stories in the form of novel text Dec 19, 2024 Dataset Generation Fairness
— Unverified 0ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language Modeling Dec 18, 2024 Language Modeling Language Modelling
Code Code Available 1Cross-Lingual Transfer of Debiasing and Detoxification in Multilingual LLMs: An Extensive Investigation Dec 18, 2024 Cross-Lingual Transfer Text Generation
Code Code Available 0LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer Dec 18, 2024 Attribute Text Generation
Code Code Available 3Curriculum Learning for Cross-Lingual Data-to-Text Generation With Noisy Data Dec 18, 2024 Data-to-Text Generation Text Generation
— Unverified 0Typhoon 2: A Family of Open Text and Multimodal Thai Large Language Models Dec 18, 2024 document understanding Image Captioning
Code Code Available 1Federated Learning and RAG Integration: A Scalable Approach for Medical Large Language Models Dec 18, 2024 Federated Learning Privacy Preserving
— Unverified 0Extending LLMs to New Languages: A Case Study of Llama and Persian Adaptation Dec 17, 2024 Classification parameter-efficient fine-tuning
Code Code Available 0MOPO: Multi-Objective Prompt Optimization for Affective Text Generation Dec 17, 2024 Conditional Text Generation Text Generation
— Unverified 0Unlocking LLMs: Addressing Scarce Data and Bias Challenges in Mental Health Dec 17, 2024 Text Generation
Code Code Available 0DnDScore: Decontextualization and Decomposition for Factuality Verification in Long-Form Text Generation Dec 17, 2024 Form Language Modeling
Code Code Available 0The Open Source Advantage in Large Language Models (LLMs) Dec 16, 2024 Computational Efficiency Machine Translation
— Unverified 0VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting Dec 16, 2024 Informativeness Large Language Model
Code Code Available 0LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input Contexts Dec 16, 2024 General Knowledge Instruction Following
Code Code Available 2GaLore+: Boosting Low-Rank Adaptation for LLMs with Cross-Head Projection Dec 15, 2024 Arithmetic Reasoning Text Generation
— Unverified 0AD-LLM: Benchmarking Large Language Models for Anomaly Detection Dec 15, 2024 Anomaly Detection Benchmarking
Code Code Available 1Understanding and Mitigating Memorization in Diffusion Models for Tabular Data Dec 15, 2024 Data Augmentation Memorization
— Unverified 0Segment-Level Diffusion: A Framework for Controllable Long-Form Generation with Diffusion Language Models Dec 15, 2024 Contrastive Learning Decoder
— Unverified 0Cultural Palette: Pluralising Culture Alignment via Multi-agent Palette Dec 15, 2024 Large Language Model Question Answering
— Unverified 0RWKV-Lite: Deeply Compressed RWKV for Resource-Constrained Devices Dec 14, 2024 Computational Efficiency Text Generation
— Unverified 0Generative Modeling with Diffusion Dec 14, 2024 Denoising Text Generation
Code Code Available 0Benchmarking Linguistic Diversity of Large Language Models Dec 13, 2024 Benchmarking Diversity
Code Code Available 0AutoPatent: A Multi-Agent Framework for Automatic Patent Generation Dec 13, 2024 Text Generation
Code Code Available 2Text Generation Models for Luxembourgish with Limited Data: A Balanced Multilingual Strategy Dec 12, 2024 Cross-Lingual Transfer Text Generation
— Unverified 0Imitate Before Detect: Aligning Machine Stylistic Preference for Machine-Revised Text Detection Dec 11, 2024 Text Detection Text Generation
— Unverified 0Y-NQ: English-Yorùbá Evaluation dataset for Open-Book Reading Comprehension and Text Generation Dec 11, 2024 Reading Comprehension Text Generation
— Unverified 0Concept Bottleneck Large Language Models Dec 11, 2024 Language Modeling Language Modelling
Code Code Available 1Forking Paths in Neural Text Generation Dec 10, 2024 Text Generation
— Unverified 0Frame Representation Hypothesis: Multi-Token LLM Interpretability and Concept-Guided Text Generation Dec 10, 2024 Text Generation
Code Code Available 0Label-Confidence-Aware Uncertainty Estimation in Natural Language Generation Dec 10, 2024 Text Generation Uncertainty Quantification
— Unverified 0TRIM: Token Reduction and Inference Modeling for Cost-Effective Language Generation Dec 10, 2024 General Knowledge Text Generation
— Unverified 0Copyright-Protected Language Generation via Adaptive Model Fusion Dec 9, 2024 Code Generation model
Code Code Available 0The Narrow Gate: Localized Image-Text Communication in Vision-Language Models Dec 9, 2024 Text Generation
— Unverified 0Flow Matching Guide and Code Dec 9, 2024 Text Generation
Code Code Available 7Language hooks: a modular framework for augmenting LLM reasoning that decouples tool usage from the model and its prompt Dec 8, 2024 Text Generation
— Unverified 0Compositional Image Retrieval via Instruction-Aware Contrastive Learning Dec 7, 2024 Contrastive Learning Image Retrieval
Code Code Available 0A Scoping Review of ChatGPT Research in Accounting and Finance Dec 7, 2024 Text Generation
— Unverified 0PromptRefine: Enhancing Few-Shot Performance on Low-Resource Indic Languages with Example Selection from Related Example Banks Dec 7, 2024 Cross-Lingual Question Answering Diversity
— Unverified 0Reducing Tool Hallucination via Reliability Alignment Dec 5, 2024 Hallucination Text Generation
— Unverified 0The Hyperfitting Phenomenon: Sharpening and Stabilizing LLMs for Open-Ended Text Generation Dec 5, 2024 Blocking Image Generation
— Unverified 0One Communication Round is All It Needs for Federated Fine-Tuning Foundation Models Dec 5, 2024 All Image Generation
— Unverified 0Exploring AI Text Generation, Retrieval-Augmented Generation, and Detection Technologies: a Comprehensive Overview Dec 5, 2024 Information Retrieval Misinformation
— Unverified 0Automated Medical Report Generation for ECG Data: Bridging Medical Text and Signal Processing with Deep Learning Dec 5, 2024 Comment Generation Decoder
Code Code Available 0Survey of different Large Language Model Architectures: Trends, Benchmarks, and Challenges Dec 4, 2024 Code Generation Image Comprehension
— Unverified 0Flow Matching with General Discrete Paths: A Kinetic-Optimal Perspective Dec 4, 2024 Image Generation Text Generation
— Unverified 0Enhancing Trust in Large Language Models with Uncertainty-Aware Fine-Tuning Dec 3, 2024 Causal Language Modeling Language Modeling
— Unverified 0Compressing KV Cache for Long-Context LLM Inference with Inter-Layer Attention Similarity Dec 3, 2024 Text Generation
— Unverified 0Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey Dec 3, 2024 Cross-Modal Retrieval Natural Language Understanding
— Unverified 0