| Multilingual Large Language Model: A Survey of Resources, Taxonomy and Frontiers | Apr 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| How Many Languages Make Good Multilingual Instruction Tuning? A Case Study on BLOOM | Apr 7, 2024 | Cross-Lingual TransferLanguage Modeling | —Unverified | 0 |
| X-VARS: Introducing Explainability in Football Refereeing with Multi-Modal Large Language Model | Apr 7, 2024 | Action RecognitionDecision Making | —Unverified | 0 |
| Towards Understanding the Influence of Reward Margin on Preference Model Performance | Apr 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| How Bad is Training on Synthetic Data? A Statistical Analysis of Language Model Collapse | Apr 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GenEARL: A Training-Free Generative Framework for Multimodal Event Argument Role Labeling | Apr 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Hidden You Malicious Goal Into Benign Narratives: Jailbreak Large Language Models through Logic Chain Injection | Apr 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model (LLM) AI text generation detection based on transformer deep learning algorithm | Apr 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| What Happens When Small Is Made Smaller? Exploring the Impact of Compression on Small Data Pretrained Language Models | Apr 6, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Binary Classifier Optimization for Large Language Model Alignment | Apr 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Physics Event Classification Using Large Language Models | Apr 5, 2024 | ChatbotClassification | CodeCode Available | 0 |
| Implicit Bias of AdamW: _ Norm Constrained Optimization | Apr 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Dwell in the Beginning: How Language Models Embed Long Documents for Dense Retrieval | Apr 5, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation | Apr 5, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal and Masked Language Models | Apr 5, 2024 | Factual probeGeneral Knowledge | CodeCode Available | 1 |
| Data Augmentation with In-Context Learning and Comparative Evaluation in Math Word Problem Solving | Apr 5, 2024 | Data AugmentationIn-Context Learning | —Unverified | 0 |
| Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model | Apr 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| player2vec: A Language Modeling Approach to Understand Player Behavior in Games | Apr 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CBR-RAG: Case-Based Reasoning for Retrieval Augmented Generation in LLMs for Legal Question Answering | Apr 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CONFLARE: CONFormal LArge language model REtrieval | Apr 4, 2024 | Conformal PredictionLanguage Modeling | CodeCode Available | 1 |
| Understanding Language Modeling Paradigm Adaptations in Recommender Systems: Lessons Learned and Open Challenges | Apr 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Bias Amplification in Language Model Evolution: An Iterated Learning Perspective | Apr 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Edisum: Summarizing and Explaining Wikipedia Edits at Scale | Apr 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis | Apr 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AutoWebGLM: A Large Language Model-based Web Navigating Agent | Apr 4, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 4 |
| Standardizing Knowledge Engineering Practices with a Reference Architecture | Apr 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Pareto Optimal Throughput in Small Language Model Serving | Apr 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OW-VISCapTor: Abstractors for Open-World Video Instance Segmentation and Captioning | Apr 4, 2024 | DescriptiveDiversity | —Unverified | 0 |
| SemGrasp: Semantic Grasp Generation via Language Aligned Discretization | Apr 4, 2024 | Grasp GenerationLanguage Modeling | —Unverified | 0 |
| MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens | Apr 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| nicolay-r at SemEval-2024 Task 3: Using Flan-T5 for Reasoning Emotion Cause in Conversations with Chain-of-Thought on Emotion States | Apr 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Sailor: Open Language Models for South-East Asia | Apr 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization | Apr 4, 2024 | GPULanguage Modeling | CodeCode Available | 0 |
| Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought | Apr 4, 2024 | Extractive Question-AnsweringKnowledge Distillation | —Unverified | 0 |
| CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech | Apr 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Calibrating the Confidence of Large Language Models by Eliciting Fidelity | Apr 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Vocabulary Attack to Hijack Large Language Model Applications | Apr 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LVLM-Interpret: An Interpretability Tool for Large Vision-Language Models | Apr 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ANGOFA: Leveraging OFA Embedding Initialization and Synthetic Data for Angolan Language Model | Apr 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Testing the Effect of Code Documentation on Large Language Model Code Understanding | Apr 3, 2024 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Improving Topic Relevance Model by Mix-structured Summarization and LLM-based Data Augmentation | Apr 3, 2024 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Retrieving Examples from Memory for Retrieval Augmented Neural Machine Translation: A Systematic Comparison | Apr 3, 2024 | DiversityIn-Context Learning | —Unverified | 0 |
| Cross-Architecture Transfer Learning for Linear-Cost Inference Transformers | Apr 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mai Ho'omāuna i ka 'Ai: Language Models Improve Automatic Speech Recognition in Hawaiian | Apr 3, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Construction and Application of Materials Knowledge Graph in Multidisciplinary Materials Science via Large Language Model | Apr 3, 2024 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |
| Towards Large Language Model driven Reference-less Translation Evaluation for English and Indian Languages | Apr 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| From Narratives to Numbers: Valid Inference Using Language Model Predictions from Verbal Autopsy Narratives | Apr 3, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| FPT: Feature Prompt Tuning for Few-shot Readability Assessment | Apr 3, 2024 | 16kFew-Shot Text Classification | CodeCode Available | 0 |
| PhonologyBench: Evaluating Phonological Skills of Large Language Models | Apr 3, 2024 | DiagnosticGrapheme-to-Phoneme Conversion | —Unverified | 0 |
| Enhancing Human-Computer Interaction in Chest X-ray Analysis using Vision and Language Model with Eye Gaze Patterns | Apr 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |