| Player-Driven Emergence in LLM-Driven Game Narrative | Apr 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evaluating Class Membership Relations in Knowledge Graphs using Large Language Models | Apr 25, 2024 | Knowledge GraphsLanguage Modeling | CodeCode Available | 0 |
| Tele-FLM Technical Report | Apr 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning Syntax Without Planting Trees: Understanding When and Why Transformers Generalize Hierarchically | Apr 25, 2024 | Inductive BiasLanguage Modeling | CodeCode Available | 0 |
| REBEL: Reinforcement Learning via Regressing Relative Rewards | Apr 25, 2024 | continuous-controlContinuous Control | CodeCode Available | 2 |
| How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites | Apr 25, 2024 | 4kLanguage Modeling | —Unverified | 0 |
| Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation Language Model | Apr 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Using Artificial Intelligence to Unlock Crowdfunding Success for Small Businesses | Apr 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Step Differences in Instructional Video | Apr 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Attacks on Third-Party APIs of Large Language Models | Apr 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CORM: Cache Optimization with Recent Message for Large Language Model Inference | Apr 24, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Beyond ESM2: Graph-Enhanced Protein Sequence Modeling with Efficient Clustering | Apr 24, 2024 | ClusteringDiversity | —Unverified | 0 |
| Detecting Conceptual Abstraction in LLMs | Apr 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Generalization Measures for Zero-Shot Cross-Lingual Transfer | Apr 24, 2024 | Cross-Lingual TransferLanguage Model Evaluation | —Unverified | 0 |
| Nyonic Technical Report | Apr 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Knowledge Graph Completion using Structural and Textual Embeddings | Apr 24, 2024 | Knowledge Graph CompletionKnowledge Graphs | CodeCode Available | 0 |
| Fusion of Domain-Adapted Vision and Language Models for Medical Visual Question Answering | Apr 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Studying Large Language Model Behaviors Under Context-Memory Conflicts With Real Documents | Apr 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Towards Efficient Patient Recruitment for Clinical Trials: Application of a Prompt-Based Learning Model | Apr 24, 2024 | Extractive SummarizationLanguage Modeling | —Unverified | 0 |
| A Comprehensive Survey on Evaluating Large Language Model Applications in the Medical Industry | Apr 24, 2024 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Breaking Walls: Pioneering Automatic Speech Recognition for Central Kurdish: End-to-End Transformer Paradigm | Apr 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Detection of circular permutations by Protein Language Models | Apr 23, 2024 | Computational EfficiencyLanguage Modeling | CodeCode Available | 0 |
| Setting up the Data Printer with Improved English to Ukrainian Machine Translation | Apr 23, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval | Apr 23, 2024 | Image RetrievalLanguage Modeling | —Unverified | 0 |
| XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference | Apr 23, 2024 | DecoderIn-Context Learning | —Unverified | 0 |
| Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation | Apr 23, 2024 | Image GenerationLanguage Modeling | —Unverified | 0 |
| Pegasus-v1 Technical Report | Apr 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RealTCD: Temporal Causal Discovery from Interventional Data with Large Language Model | Apr 23, 2024 | Causal Discoverygraph construction | —Unverified | 0 |
| Multi-Head Mixture-of-Experts | Apr 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CultureBank: An Online Community-Driven Knowledge Base Towards Culturally Aware Language Technologies | Apr 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ClinicalAgent: Clinical Trial Multi-Agent System with Large Language Model-based Reasoning | Apr 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Retrieval Augmented Generation for Domain-specific Question Answering | Apr 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| WangLab at MEDIQA-CORR 2024: Optimized LLM-based Programs for Medical Error Detection and Correction | Apr 22, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU Heterogeneity | Apr 22, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Q-Tuning: Queue-based Prompt Tuning for Lifelong Few-shot Language Learning | Apr 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Understanding the role of FFNs in driving multilingual behaviour in LLMs | Apr 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference Labels | Apr 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Pixels and Predictions: Potential of GPT-4V in Meteorological Imagery Analysis and Forecast Communication | Apr 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OpenELM: An Efficient Language Model Family with Open Training and Inference Framework | Apr 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 9 |
| From LLM to NMT: Advancing Low-Resource Machine Translation with Claude | Apr 22, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Self-Bootstrapped Visual-Language Model for Knowledge Selection and Question Answering | Apr 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Multimodal Automated Interpretability Agent | Apr 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VALOR-EVAL: Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models | Apr 22, 2024 | HallucinationInformativeness | CodeCode Available | 1 |
| CoFInAl: Enhancing Action Quality Assessment with Coarse-to-Fine Instruction Alignment | Apr 22, 2024 | Action Quality AssessmentAction Recognition | CodeCode Available | 1 |
| PARAMANU-GANITA: Language Model with Mathematical Capabilities | Apr 22, 2024 | Domain AdaptationGSM8K | —Unverified | 0 |
| Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone | Apr 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SpaceByte: Towards Deleting Tokenization from Large Language Modeling | Apr 22, 2024 | DecoderLanguage Modeling | CodeCode Available | 2 |
| Automated Text Mining of Experimental Methodologies from Biomedical Literature | Apr 21, 2024 | ArticlesClassification | —Unverified | 0 |
| Socratic Planner: Self-QA-Based Zero-Shot Planning for Embodied Instruction Following | Apr 21, 2024 | In-Context LearningInstruction Following | —Unverified | 0 |
| Evaluating Retrieval Quality in Retrieval-Augmented Generation | Apr 21, 2024 | GPULanguage Modeling | CodeCode Available | 1 |