| PuoBERTa: Training and evaluation of a curated language model for Setswana | Oct 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Qilin-Med: Multi-stage Knowledge Injection Advanced Medical Large Language Model | Oct 13, 2023 | Knowledge GraphsLanguage Modeling | CodeCode Available | 1 |
| ClickPrompt: CTR Models are Strong Prompt Generators for Adapting Language Models to CTR Prediction | Oct 13, 2023 | Click-Through Rate PredictionLanguage Modeling | —Unverified | 0 |
| The Consensus Game: Language Model Generation via Equilibrium Search | Oct 13, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large language models can replicate cross-cultural differences in personality | Oct 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Models for Scientific Synthesis, Inference and Explanation | Oct 12, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| Language Models are Universal Embedders | Oct 12, 2023 | Code SearchLanguage Modeling | CodeCode Available | 1 |
| Multimodal Large Language Model for Visual Navigation | Oct 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Toward Joint Language Modeling for Speech Units and Text | Oct 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Evaluating Generalist Agents: An Automated Benchmark in Open World | Oct 12, 2023 | BenchmarkingDiversity | CodeCode Available | 1 |
| Mapping Memes to Words for Multimodal Hateful Meme Classification | Oct 12, 2023 | Hateful Meme ClassificationLanguage Modeling | CodeCode Available | 1 |
| Harnessing Large Language Models' Empathetic Response Generation Capabilities for Online Mental Health Counselling Support | Oct 12, 2023 | Empathetic Response GenerationLanguage Modeling | —Unverified | 0 |
| GraphextQA: A Benchmark for Evaluating Graph-Enhanced Large Language Models | Oct 12, 2023 | Answer GenerationHallucination | CodeCode Available | 0 |
| Context Compression for Auto-regressive Transformers with Sentinel Tokens | Oct 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GameGPT: Multi-agent Collaborative Framework for Game Development | Oct 12, 2023 | Code GenerationHallucination | —Unverified | 0 |
| Is attention required for ICL? Exploring the Relationship Between Model Architecture and In-Context Learning Ability | Oct 12, 2023 | Causal Language ModelingIn-Context Learning | CodeCode Available | 0 |
| Expanding the Vocabulary of BERT for Knowledge Base Construction | Oct 12, 2023 | Knowledge Base ConstructionKnowledge Base Population | CodeCode Available | 0 |
| Ziya-Visual: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning | Oct 12, 2023 | Image CaptioningImage-text Retrieval | —Unverified | 0 |
| HoneyBee: Progressive Instruction Finetuning of Large Language Models for Materials Science | Oct 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LangNav: Language as a Perceptual Representation for Navigation | Oct 11, 2023 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Crosslingual Structural Priming and the Pre-Training Dynamics of Bilingual Language Models | Oct 11, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Measuring Feature Sparsity in Language Models | Oct 11, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| On the Relationship between Sentence Analogy Identification and Sentence Structure Encoding in Large Language Models | Oct 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CacheGen: KV Cache Compression and Streaming for Fast Large Language Model Serving | Oct 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| A Comparative Study of Pre-trained CNNs and GRU-Based Attention for Image Caption Generation | Oct 11, 2023 | Caption GenerationDecoder | —Unverified | 0 |
| A Resilient and Accessible Distribution-Preserving Watermark for Large Language Models | Oct 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| PHALM: Building a Knowledge Graph from Scratch by Prompting Humans and a Language Model | Oct 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MatChat: A Large Language Model and Application Service Platform for Materials Science | Oct 11, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLark: A Multimodal Instruction-Following Language Model for Music | Oct 11, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Cognate Transformer for Automated Phonological Reconstruction and Cognate Reflex Prediction | Oct 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Ferret: Refer and Ground Anything Anywhere at Any Granularity | Oct 11, 2023 | HallucinationLanguage Modeling | CodeCode Available | 5 |
| Fast-ELECTRA for Efficient Pre-training | Oct 11, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ClausewitzGPT Framework: A New Frontier in Theoretical Large Language Model Enhanced Information Operations | Oct 11, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Answer Candidate Type Selection: Text-to-Text Language Model for Closed Book Question Answering Meets Knowledge Graphs | Oct 10, 2023 | Graph Question AnsweringKnowledge Graphs | —Unverified | 0 |
| Prosody Analysis of Audiobooks | Oct 10, 2023 | AttributeLanguage Modeling | CodeCode Available | 0 |
| Acoustic Model Fusion for End-to-end Speech Recognition | Oct 10, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Dobby: A Conversational Service Robot Driven by GPT-4 | Oct 10, 2023 | AI AgentDecision Making | —Unverified | 0 |
| The Geometry of Truth: Emergent Linear Structure in Large Language Model Representations of True/False Datasets | Oct 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Mistral 7B | Oct 10, 2023 | answerability predictionArithmetic Reasoning | CodeCode Available | 6 |
| Learning Multiplex Representations on Text-Attributed Graphs with One Language Model Encoder | Oct 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MuseChat: A Conversational Music Recommendation System for Videos | Oct 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Benchmarking and Explaining Large Language Model-based Code Generation: A Causality-Centric Approach | Oct 10, 2023 | BenchmarkingCode Generation | CodeCode Available | 1 |
| SEER : A Knapsack approach to Exemplar Selection for In-Context HybridQA | Oct 10, 2023 | DiversityIn-Context Learning | CodeCode Available | 0 |
| Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning | Oct 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Bridging Items and Language: A Transition Paradigm for Large Language Model-Based Recommendation | Oct 10, 2023 | AttributeLanguage Modeling | —Unverified | 0 |
| Making Large Language Models Perform Better in Knowledge Graph Completion | Oct 10, 2023 | In-Context LearningKnowledge Graph Completion | CodeCode Available | 2 |
| Get the gist? Using large language models for few-shot decontextualization | Oct 10, 2023 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Generating and Evaluating Tests for K-12 Students with Language Model Simulations: A Case Study on Sentence Reading Efficiency | Oct 10, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CodeFuse-13B: A Pretrained Multi-lingual Code Large Language Model | Oct 10, 2023 | Code GenerationCode Translation | —Unverified | 0 |
| Rethinking Memory and Communication Cost for Efficient Large Language Model Training | Oct 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |