| Qilin-Med: Multi-stage Knowledge Injection Advanced Medical Large Language Model | Oct 13, 2023 | Knowledge GraphsLanguage Modeling | CodeCode Available | 1 |
| PuoBERTa: Training and evaluation of a curated language model for Setswana | Oct 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ClickPrompt: CTR Models are Strong Prompt Generators for Adapting Language Models to CTR Prediction | Oct 13, 2023 | Click-Through Rate PredictionLanguage Modeling | —Unverified | 0 |
| Welfare Diplomacy: Benchmarking Language Model Cooperation | Oct 13, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| Large language models can replicate cross-cultural differences in personality | Oct 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Toward Joint Language Modeling for Speech Units and Text | Oct 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Models are Universal Embedders | Oct 12, 2023 | Code SearchLanguage Modeling | CodeCode Available | 1 |
| Large Language Models for Scientific Synthesis, Inference and Explanation | Oct 12, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| Ziya-Visual: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning | Oct 12, 2023 | Image CaptioningImage-text Retrieval | —Unverified | 0 |
| Harnessing Large Language Models' Empathetic Response Generation Capabilities for Online Mental Health Counselling Support | Oct 12, 2023 | Empathetic Response GenerationLanguage Modeling | —Unverified | 0 |
| GraphextQA: A Benchmark for Evaluating Graph-Enhanced Large Language Models | Oct 12, 2023 | Answer GenerationHallucination | CodeCode Available | 0 |
| Context Compression for Auto-regressive Transformers with Sentinel Tokens | Oct 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Multimodal Large Language Model for Visual Navigation | Oct 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GameGPT: Multi-agent Collaborative Framework for Game Development | Oct 12, 2023 | Code GenerationHallucination | —Unverified | 0 |
| Is attention required for ICL? Exploring the Relationship Between Model Architecture and In-Context Learning Ability | Oct 12, 2023 | Causal Language ModelingIn-Context Learning | CodeCode Available | 0 |
| Towards Evaluating Generalist Agents: An Automated Benchmark in Open World | Oct 12, 2023 | BenchmarkingDiversity | CodeCode Available | 1 |
| Expanding the Vocabulary of BERT for Knowledge Base Construction | Oct 12, 2023 | Knowledge Base ConstructionKnowledge Base Population | CodeCode Available | 0 |
| Mapping Memes to Words for Multimodal Hateful Meme Classification | Oct 12, 2023 | Hateful Meme ClassificationLanguage Modeling | CodeCode Available | 1 |
| HoneyBee: Progressive Instruction Finetuning of Large Language Models for Materials Science | Oct 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Measuring Feature Sparsity in Language Models | Oct 11, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LangNav: Language as a Perceptual Representation for Navigation | Oct 11, 2023 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Crosslingual Structural Priming and the Pre-Training Dynamics of Bilingual Language Models | Oct 11, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| On the Relationship between Sentence Analogy Identification and Sentence Structure Encoding in Large Language Models | Oct 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CacheGen: KV Cache Compression and Streaming for Fast Large Language Model Serving | Oct 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| A Comparative Study of Pre-trained CNNs and GRU-Based Attention for Image Caption Generation | Oct 11, 2023 | Caption GenerationDecoder | —Unverified | 0 |