| Non-autoregressive Sequence-to-Sequence Vision-Language Models | Mar 4, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 |
| DECIDER: A Dual-System Rule-Controllable Decoding Framework for Language Generation | Mar 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Rethinking LLM Language Adaptation: A Case Study on Chinese Mixtral | Mar 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| Towards Intent-Based Network Management: Large Language Models for Intent Extraction in 5G Core Networks | Mar 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RegionGPT: Towards Region Understanding Vision Language Model | Mar 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SyllabusQA: A Course Logistics Question Answering Dataset | Mar 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| OVEL: Large Language Model as Memory Manager for Online Video Entity Linking | Mar 3, 2024 | Entity LinkingLanguage Modeling | —Unverified | 0 |
| Fantastic Semantics and Where to Find Them: Investigating Which Layers of Generative LLMs Reflect Lexical Semantics | Mar 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Revisiting Dynamic Evaluation: Online Adaptation for Large Language Models | Mar 3, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| GuardT2I: Defending Text-to-Image Models from Adversarial Prompts | Mar 3, 2024 | Binary ClassificationLanguage Modeling | CodeCode Available | 3 |
| NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free Attention | Mar 2, 2024 | 16kCPU | CodeCode Available | 1 |
| Data-free Multi-label Image Recognition via LLM-powered Prompt Tuning | Mar 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AutoAttacker: A Large Language Model Guided System to Implement Automatic Cyber-attacks | Mar 2, 2024 | Computer SecurityLanguage Modeling | —Unverified | 0 |
| IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact | Mar 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| OpenGraph: Towards Open Graph Foundation Models | Mar 2, 2024 | Data AugmentationGraph Learning | CodeCode Available | 3 |
| SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code | Mar 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Chaining thoughts and LLMs to learn DNA structural biophysics | Mar 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LAB: Large-Scale Alignment for ChatBots | Mar 2, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 5 |
| An Interpretable Ensemble of Graph and Language Models for Improving Search Relevance in E-Commerce | Mar 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Merging Text Transformer Models from Different Initializations | Mar 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| BasedAI: A decentralized P2P network for Zero Knowledge Large Language Models (ZK-LLMs) | Mar 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AXOLOTL: Fairness through Assisted Self-Debiasing of Large Language Model Outputs | Mar 1, 2024 | FairnessLanguage Modeling | —Unverified | 0 |
| RiNALMo: General-Purpose RNA Language Models Can Generalize Well on Structure Prediction Tasks | Feb 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Resonance RoPE: Improving Context Length Generalization of Large Language Models | Feb 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LLM-Ensemble: Optimal Large Language Model Ensemble Method for E-commerce Product Attribute Value Extraction | Feb 29, 2024 | AttributeAttribute Extraction | —Unverified | 0 |
| FAC^2E: Better Understanding Large Language Model Capabilities by Dissociating Language and Cognition | Feb 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PaECTER: Patent-level Representation Learning using Citation-informed Transformers | Feb 29, 2024 | Citation PredictionLanguage Modeling | —Unverified | 0 |
| Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model | Feb 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VIXEN: Visual Text Comparison Network for Image Difference Captioning | Feb 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| TEncDM: Understanding the Properties of the Diffusion Model in the Space of Language Model Encodings | Feb 29, 2024 | Conditional Text GenerationDecoder | CodeCode Available | 1 |
| FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning | Feb 29, 2024 | GPULanguage Modeling | CodeCode Available | 5 |
| PlanGPT: Enhancing Urban Planning with Tailored Language Model and Efficient Retrieval | Feb 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL | Feb 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| A Protein Structure Prediction Approach Leveraging Transformer and CNN Integration | Feb 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Merino: Entropy-driven Design for Generative Language Models on IoT Devices | Feb 28, 2024 | CPULanguage Modeling | —Unverified | 0 |
| ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training | Feb 28, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| Chaining text-to-image and large language model: A novel approach for generating personalized e-commerce banners | Feb 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MIKO: Multimodal Intention Knowledge Distillation from Large Language Models for Social-Media Commonsense Discovery | Feb 28, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Diffusion Language Models Are Versatile Protein Learners | Feb 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling | Feb 28, 2024 | Computational Efficiencyimage-classification | —Unverified | 0 |
| Multi-FAct: Assessing Factuality of Multilingual LLMs using FActScore | Feb 28, 2024 | DiversityForm | CodeCode Available | 0 |
| SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model | Feb 28, 2024 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| Grounding Language Models for Visual Entity Recognition | Feb 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Prospect Personalized Recommendation on Large Language Model-based Agent Platform | Feb 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Characterizing Truthfulness in Large Language Model Generations with Local Intrinsic Dimension | Feb 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ICE-SEARCH: A Language Model-Driven Feature Selection Approach | Feb 28, 2024 | Diabetes PredictionDisease Prediction | —Unverified | 0 |
| Is Crowdsourcing Breaking Your Bank? Cost-Effective Fine-Tuning of Pre-trained Language Models with Proximal Policy Optimization | Feb 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CogBench: a large language model walks into a psychology lab | Feb 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Vision Language Model-based Caption Evaluation Method Leveraging Visual Context Extraction | Feb 28, 2024 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Random Silicon Sampling: Simulating Human Sub-Population Opinion Using a Large Language Model Based on Group-Level Demographic Information | Feb 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |