X-VARS: Introducing Explainability in Football Refereeing with Multi-Modal Large Language Model Apr 7, 2024 Action Recognition Decision Making
— Unverified 0How Many Languages Make Good Multilingual Instruction Tuning? A Case Study on BLOOM Apr 7, 2024 Cross-Lingual Transfer Language Modeling
— Unverified 0PairAug: What Can Augmented Image-Text Pairs Do for Radiology? Apr 7, 2024 Data Augmentation image-classification
Code Code Available 1SqueezeAttention: 2D Management of KV-Cache in LLM Inference via Layer-wise Optimal Budget Apr 7, 2024 Language Modelling Large Language Model
Code Code Available 1SilverSight: A Multi-Task Chinese Financial Large Language Model Based on Adaptive Semantic Space Learning Apr 7, 2024 Language Modeling Language Modelling
— Unverified 0Hidden You Malicious Goal Into Benign Narratives: Jailbreak Large Language Models through Logic Chain Injection Apr 7, 2024 Language Modeling Language Modelling
— Unverified 0Towards Understanding the Influence of Reward Margin on Preference Model Performance Apr 7, 2024 Language Modeling Language Modelling
— Unverified 0How Bad is Training on Synthetic Data? A Statistical Analysis of Language Model Collapse Apr 7, 2024 Language Modeling Language Modelling
— Unverified 0GenEARL: A Training-Free Generative Framework for Multimodal Event Argument Role Labeling Apr 7, 2024 Language Modeling Language Modelling
— Unverified 0Large Language Model (LLM) AI text generation detection based on transformer deep learning algorithm Apr 6, 2024 Language Modeling Language Modelling
— Unverified 0What Happens When Small Is Made Smaller? Exploring the Impact of Compression on Small Data Pretrained Language Models Apr 6, 2024 Knowledge Distillation Language Modeling
— Unverified 0Autonomous Artificial Intelligence Agents for Clinical Decision Making in Oncology Apr 6, 2024 Decision Making Language Modelling
— Unverified 0Binary Classifier Optimization for Large Language Model Alignment Apr 6, 2024 Language Modeling Language Modelling
— Unverified 0Physics Event Classification Using Large Language Models Apr 5, 2024 Chatbot Classification
Code Code Available 0Implicit Bias of AdamW: _ Norm Constrained Optimization Apr 5, 2024 Language Modeling Language Modelling
— Unverified 0Data Augmentation with In-Context Learning and Comparative Evaluation in Math Word Problem Solving Apr 5, 2024 Data Augmentation In-Context Learning
— Unverified 0Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model Apr 5, 2024 Language Modeling Language Modelling
— Unverified 0Forget NLI, Use a Dictionary: Zero-Shot Topic Classification for Low-Resource Languages with Application to Luxembourgish Apr 5, 2024 Language Modelling Natural Language Inference
Code Code Available 0Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation Apr 5, 2024 Contrastive Learning Language Modeling
Code Code Available 1player2vec: A Language Modeling Approach to Understand Player Behavior in Games Apr 5, 2024 Language Modeling Language Modelling
— Unverified 0BuDDIE: A Business Document Dataset for Multi-task Information Extraction Apr 5, 2024 Document Classification document understanding
— Unverified 0Dwell in the Beginning: How Language Models Embed Long Documents for Dense Retrieval Apr 5, 2024 Decoder Language Modeling
Code Code Available 0A Comparison of Methods for Evaluating Generative IR Apr 5, 2024 Information Retrieval Language Modelling
Code Code Available 0BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal and Masked Language Models Apr 5, 2024 Factual probe General Knowledge
Code Code Available 1Mitigating LLM Hallucinations via Conformal Abstention Apr 4, 2024 Conformal Prediction Generative Question Answering
— Unverified 0CBR-RAG: Case-Based Reasoning for Retrieval Augmented Generation in LLMs for Legal Question Answering Apr 4, 2024 Language Modeling Language Modelling
Code Code Available 1CONFLARE: CONFormal LArge language model REtrieval Apr 4, 2024 Conformal Prediction Language Modeling
Code Code Available 1Bias Amplification in Language Model Evolution: An Iterated Learning Perspective Apr 4, 2024 Language Modeling Language Modelling
Code Code Available 0Understanding Language Modeling Paradigm Adaptations in Recommender Systems: Lessons Learned and Open Challenges Apr 4, 2024 Language Modeling Language Modelling
Code Code Available 0Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization Apr 4, 2024 GPU Language Modeling
Code Code Available 0AutoWebGLM: A Large Language Model-based Web Navigating Agent Apr 4, 2024 Decision Making Language Modeling
Code Code Available 4OW-VISCapTor: Abstractors for Open-World Video Instance Segmentation and Captioning Apr 4, 2024 Descriptive Diversity
— Unverified 0Multi-modal Learning for WebAssembly Reverse Engineering Apr 4, 2024 Language Modelling Self-Supervised Learning
— Unverified 0Standardizing Knowledge Engineering Practices with a Reference Architecture Apr 4, 2024 Language Modeling Language Modelling
— Unverified 0RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis Apr 4, 2024 Language Modeling Language Modelling
— Unverified 0nicolay-r at SemEval-2024 Task 3: Using Flan-T5 for Reasoning Emotion Cause in Conversations with Chain-of-Thought on Emotion States Apr 4, 2024 Language Modeling Language Modelling
Code Code Available 1Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought Apr 4, 2024 Extractive Question-Answering Knowledge Distillation
— Unverified 0Edisum: Summarizing and Explaining Wikipedia Edits at Scale Apr 4, 2024 Language Modeling Language Modelling
Code Code Available 0SemGrasp: Semantic Grasp Generation via Language Aligned Discretization Apr 4, 2024 Grasp Generation Language Modeling
— Unverified 0Diverse and Tailored Image Generation for Zero-shot Multi-label Classification Apr 4, 2024 Image Generation Language Modelling
— Unverified 0Towards Pareto Optimal Throughput in Small Language Model Serving Apr 4, 2024 Language Modeling Language Modelling
— Unverified 0DeViDe: Faceted medical knowledge for improved medical vision-language pre-training Apr 4, 2024 Language Modelling Large Language Model
— Unverified 0MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens Apr 4, 2024 Language Modeling Language Modelling
Code Code Available 4Sailor: Open Language Models for South-East Asia Apr 4, 2024 Language Modeling Language Modelling
Code Code Available 4Attributions toward Artificial Agents in a modified Moral Turing Test Apr 3, 2024 Language Modelling
— Unverified 0CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech Apr 3, 2024 Language Modeling Language Modelling
— Unverified 0From Narratives to Numbers: Valid Inference Using Language Model Predictions from Verbal Autopsy Narratives Apr 3, 2024 Decision Making Language Modeling
— Unverified 0Improving Topic Relevance Model by Mix-structured Summarization and LLM-based Data Augmentation Apr 3, 2024 Data Augmentation Language Modeling
— Unverified 0Retrieving Examples from Memory for Retrieval Augmented Neural Machine Translation: A Systematic Comparison Apr 3, 2024 Diversity In-Context Learning
— Unverified 0AWOL: Analysis WithOut synthesis using Language Apr 3, 2024 Language Modelling
— Unverified 0