| Learning to Maximize Mutual Information for Chain-of-Thought Distillation | Mar 5, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 0 |
| Evaluating and Optimizing Educational Content with Large Language Model Judgments | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DPPA: Pruning Method for Large Language Model to Model Merging | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Causal Prompting: Debiasing Large Language Model Prompting based on Front-Door Adjustment | Mar 5, 2024 | Contrastive LearningData Augmentation | —Unverified | 0 |
| Language Guided Exploration for RL Agents in Text Environments | Mar 5, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| MeanCache: User-Centric Semantic Caching for LLM Web Services | Mar 5, 2024 | Federated LearningLanguage Modeling | —Unverified | 0 |
| Towards Training A Chinese Large Language Model for Anesthesiology | Mar 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Word Importance Explains How Prompts Affect Language Model Outputs | Mar 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection | Mar 5, 2024 | Concept AlignmentExplanation Generation | —Unverified | 0 |
| Socratic Reasoning Improves Positive Text Rewriting | Mar 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RegionGPT: Towards Region Understanding Vision Language Model | Mar 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Non-autoregressive Sequence-to-Sequence Vision-Language Models | Mar 4, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 |
| NoteLLM: A Retrievable Large Language Model for Note Recommendation | Mar 4, 2024 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| Towards Intent-Based Network Management: Large Language Models for Intent Extraction in 5G Core Networks | Mar 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| How does Architecture Influence the Base Capabilities of Pre-trained Language Models? A Case Study Based on FFN-Wider and MoE Transformers | Mar 4, 2024 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| DECIDER: A Dual-System Rule-Controllable Decoding Framework for Language Generation | Mar 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model-Based Evolutionary Optimizer: Reasoning with elitism | Mar 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLM vs. Lawyers: Identifying a Subset of Summary Judgments in a Large UK Case Law Dataset | Mar 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Fantastic Semantics and Where to Find Them: Investigating Which Layers of Generative LLMs Reflect Lexical Semantics | Mar 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Revisiting Dynamic Evaluation: Online Adaptation for Large Language Models | Mar 3, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| SyllabusQA: A Course Logistics Question Answering Dataset | Mar 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| OVEL: Large Language Model as Memory Manager for Online Video Entity Linking | Mar 3, 2024 | Entity LinkingLanguage Modeling | —Unverified | 0 |
| SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code | Mar 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Data-free Multi-label Image Recognition via LLM-powered Prompt Tuning | Mar 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AutoAttacker: A Large Language Model Guided System to Implement Automatic Cyber-attacks | Mar 2, 2024 | Computer SecurityLanguage Modeling | —Unverified | 0 |
| Chaining thoughts and LLMs to learn DNA structural biophysics | Mar 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| BasedAI: A decentralized P2P network for Zero Knowledge Large Language Models (ZK-LLMs) | Mar 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AXOLOTL: Fairness through Assisted Self-Debiasing of Large Language Model Outputs | Mar 1, 2024 | FairnessLanguage Modeling | —Unverified | 0 |
| LLM-Ensemble: Optimal Large Language Model Ensemble Method for E-commerce Product Attribute Value Extraction | Feb 29, 2024 | AttributeAttribute Extraction | —Unverified | 0 |
| A Protein Structure Prediction Approach Leveraging Transformer and CNN Integration | Feb 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FAC^2E: Better Understanding Large Language Model Capabilities by Dissociating Language and Cognition | Feb 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VIXEN: Visual Text Comparison Network for Image Difference Captioning | Feb 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| PlanGPT: Enhancing Urban Planning with Tailored Language Model and Efficient Retrieval | Feb 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PaECTER: Patent-level Representation Learning using Citation-informed Transformers | Feb 29, 2024 | Citation PredictionLanguage Modeling | —Unverified | 0 |
| Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model | Feb 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling | Feb 28, 2024 | Computational Efficiencyimage-classification | —Unverified | 0 |
| Prospect Personalized Recommendation on Large Language Model-based Agent Platform | Feb 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Random Silicon Sampling: Simulating Human Sub-Population Opinion Using a Large Language Model Based on Group-Level Demographic Information | Feb 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Merino: Entropy-driven Design for Generative Language Models on IoT Devices | Feb 28, 2024 | CPULanguage Modeling | —Unverified | 0 |
| Multi-FAct: Assessing Factuality of Multilingual LLMs using FActScore | Feb 28, 2024 | DiversityForm | CodeCode Available | 0 |
| MIKO: Multimodal Intention Knowledge Distillation from Large Language Models for Social-Media Commonsense Discovery | Feb 28, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Vision Language Model-based Caption Evaluation Method Leveraging Visual Context Extraction | Feb 28, 2024 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Chaining text-to-image and large language model: A novel approach for generating personalized e-commerce banners | Feb 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ICE-SEARCH: A Language Model-Driven Feature Selection Approach | Feb 28, 2024 | Diabetes PredictionDisease Prediction | —Unverified | 0 |
| Is Crowdsourcing Breaking Your Bank? Cost-Effective Fine-Tuning of Pre-trained Language Models with Proximal Policy Optimization | Feb 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BASES: Large-scale Web Search User Simulation with Large Language Model based Agents | Feb 27, 2024 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Large Language Model for Participatory Urban Planning | Feb 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Neural Rewriting System to Solve Algorithmic Problems | Feb 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning | Feb 27, 2024 | 8kLanguage Modeling | CodeCode Available | 0 |
| OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web | Feb 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |