Multilingual Large Language Model: A Survey of Resources, Taxonomy and Frontiers Apr 7, 2024 Language Modeling Language Modelling
— Unverified 0Towards Understanding the Influence of Reward Margin on Preference Model Performance Apr 7, 2024 Language Modeling Language Modelling
— Unverified 0How Many Languages Make Good Multilingual Instruction Tuning? A Case Study on BLOOM Apr 7, 2024 Cross-Lingual Transfer Language Modeling
— Unverified 0Your Finetuned Large Language Model is Already a Powerful Out-of-distribution Detector Apr 7, 2024 Language Modeling Language Modelling
— Unverified 0X-VARS: Introducing Explainability in Football Refereeing with Multi-Modal Large Language Model Apr 7, 2024 Action Recognition Decision Making
— Unverified 0What Happens When Small Is Made Smaller? Exploring the Impact of Compression on Small Data Pretrained Language Models Apr 6, 2024 Knowledge Distillation Language Modeling
— Unverified 0Binary Classifier Optimization for Large Language Model Alignment Apr 6, 2024 Language Modeling Language Modelling
— Unverified 0Autonomous Artificial Intelligence Agents for Clinical Decision Making in Oncology Apr 6, 2024 Decision Making Language Modelling
— Unverified 0Large Language Model (LLM) AI text generation detection based on transformer deep learning algorithm Apr 6, 2024 Language Modeling Language Modelling
— Unverified 0Physics Event Classification Using Large Language Models Apr 5, 2024 Chatbot Classification
Code Code Available 0player2vec: A Language Modeling Approach to Understand Player Behavior in Games Apr 5, 2024 Language Modeling Language Modelling
— Unverified 0BuDDIE: A Business Document Dataset for Multi-task Information Extraction Apr 5, 2024 Document Classification document understanding
— Unverified 0Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model Apr 5, 2024 Language Modeling Language Modelling
— Unverified 0Implicit Bias of AdamW: _ Norm Constrained Optimization Apr 5, 2024 Language Modeling Language Modelling
— Unverified 0A Comparison of Methods for Evaluating Generative IR Apr 5, 2024 Information Retrieval Language Modelling
Code Code Available 0Forget NLI, Use a Dictionary: Zero-Shot Topic Classification for Low-Resource Languages with Application to Luxembourgish Apr 5, 2024 Language Modelling Natural Language Inference
Code Code Available 0Data Augmentation with In-Context Learning and Comparative Evaluation in Math Word Problem Solving Apr 5, 2024 Data Augmentation In-Context Learning
— Unverified 0Dwell in the Beginning: How Language Models Embed Long Documents for Dense Retrieval Apr 5, 2024 Decoder Language Modeling
Code Code Available 0Edisum: Summarizing and Explaining Wikipedia Edits at Scale Apr 4, 2024 Language Modeling Language Modelling
Code Code Available 0Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought Apr 4, 2024 Extractive Question-Answering Knowledge Distillation
— Unverified 0Diverse and Tailored Image Generation for Zero-shot Multi-label Classification Apr 4, 2024 Image Generation Language Modelling
— Unverified 0DeViDe: Faceted medical knowledge for improved medical vision-language pre-training Apr 4, 2024 Language Modelling Large Language Model
— Unverified 0Mitigating LLM Hallucinations via Conformal Abstention Apr 4, 2024 Conformal Prediction Generative Question Answering
— Unverified 0Multi-modal Learning for WebAssembly Reverse Engineering Apr 4, 2024 Language Modelling Self-Supervised Learning
— Unverified 0SemGrasp: Semantic Grasp Generation via Language Aligned Discretization Apr 4, 2024 Grasp Generation Language Modeling
— Unverified 0Towards Pareto Optimal Throughput in Small Language Model Serving Apr 4, 2024 Language Modeling Language Modelling
— Unverified 0OW-VISCapTor: Abstractors for Open-World Video Instance Segmentation and Captioning Apr 4, 2024 Descriptive Diversity
— Unverified 0Bias Amplification in Language Model Evolution: An Iterated Learning Perspective Apr 4, 2024 Language Modeling Language Modelling
Code Code Available 0Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization Apr 4, 2024 GPU Language Modeling
Code Code Available 0RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis Apr 4, 2024 Language Modeling Language Modelling
— Unverified 0Standardizing Knowledge Engineering Practices with a Reference Architecture Apr 4, 2024 Language Modeling Language Modelling
— Unverified 0Understanding Language Modeling Paradigm Adaptations in Recommender Systems: Lessons Learned and Open Challenges Apr 4, 2024 Language Modeling Language Modelling
Code Code Available 0Manipulating and Mitigating Generative Model Biases without Retraining Apr 3, 2024 Backdoor Attack Language Modelling
— Unverified 0Mai Ho'omāuna i ka 'Ai: Language Models Improve Automatic Speech Recognition in Hawaiian Apr 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0PhonologyBench: Evaluating Phonological Skills of Large Language Models Apr 3, 2024 Diagnostic Grapheme-to-Phoneme Conversion
— Unverified 0Testing the Effect of Code Documentation on Large Language Model Code Understanding Apr 3, 2024 Code Generation Language Modeling
— Unverified 0Towards Large Language Model driven Reference-less Translation Evaluation for English and Indian Languages Apr 3, 2024 Language Modeling Language Modelling
— Unverified 0LVLM-Interpret: An Interpretability Tool for Large Vision-Language Models Apr 3, 2024 Language Modeling Language Modelling
Code Code Available 0Retrieving Examples from Memory for Retrieval Augmented Neural Machine Translation: A Systematic Comparison Apr 3, 2024 Diversity In-Context Learning
— Unverified 0Vocabulary Attack to Hijack Large Language Model Applications Apr 3, 2024 Language Modeling Language Modelling
— Unverified 0Construction and Application of Materials Knowledge Graph in Multidisciplinary Materials Science via Large Language Model Apr 3, 2024 Knowledge Graphs Language Modeling
— Unverified 0Estimating the Causal Effects of Natural Logic Features in Transformer-Based NLI Models Apr 3, 2024 Language Modelling Negation
— Unverified 0Attributions toward Artificial Agents in a modified Moral Turing Test Apr 3, 2024 Language Modelling
— Unverified 0Cross-Architecture Transfer Learning for Linear-Cost Inference Transformers Apr 3, 2024 Language Modeling Language Modelling
— Unverified 0CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech Apr 3, 2024 Language Modeling Language Modelling
— Unverified 0From Narratives to Numbers: Valid Inference Using Language Model Predictions from Verbal Autopsy Narratives Apr 3, 2024 Decision Making Language Modeling
— Unverified 0Improving Topic Relevance Model by Mix-structured Summarization and LLM-based Data Augmentation Apr 3, 2024 Data Augmentation Language Modeling
— Unverified 0FPT: Feature Prompt Tuning for Few-shot Readability Assessment Apr 3, 2024 16k Few-Shot Text Classification
Code Code Available 0ANGOFA: Leveraging OFA Embedding Initialization and Synthetic Data for Angolan Language Model Apr 3, 2024 Language Modeling Language Modelling
Code Code Available 0AWOL: Analysis WithOut synthesis using Language Apr 3, 2024 Language Modelling
— Unverified 0