| A Survey on the Memory Mechanism of Large Language Model based Agents | Apr 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Generating Daylight-driven Architectural Design via Diffusion Models | Apr 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Intrusion Detection at Scale with the Assistance of a Command-line Language Model | Apr 20, 2024 | Intrusion DetectionLanguage Modeling | —Unverified | 0 |
| F5C-finder: An Explainable and Ensemble Biological Language Model for Predicting 5-Formylcytidine Modifications on mRNA | Apr 20, 2024 | Ensemble LearningLanguage Modeling | CodeCode Available | 0 |
| Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models | Apr 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| LLM-R2: A Large Language Model Enhanced Rule-based Rewrite System for Boosting Query Efficiency | Apr 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Heterogeneous Subgraph Transformer for Fake News Detection | Apr 19, 2024 | Fake News DetectionLanguage Modeling | CodeCode Available | 0 |
| Beyond Self-Consistency: Ensemble Reasoning Boosts Consistency and Accuracy of LLMs in Cancer Staging | Apr 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FineRec:Exploring Fine-grained Sequential Recommendation | Apr 19, 2024 | AttributeDiversity | CodeCode Available | 1 |
| Exploring Interactive Semantic Alignment for Efficient HOI Detection with Vision-language Model | Apr 19, 2024 | Human-Object Interaction DetectionLanguage Modeling | —Unverified | 0 |
| LiMe: a Latin Corpus of Late Medieval Criminal Sentences | Apr 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DeepLocalization: Using change point detection for Temporal Action Localization | Apr 18, 2024 | Action LocalizationChange Point Detection | —Unverified | 0 |
| Parallel Decoding via Hidden Transfer for Lossless Large Language Model Acceleration | Apr 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Augmenting emotion features in irony detection with Large language modeling | Apr 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Aligning Language Models to Explicitly Handle Ambiguity | Apr 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| From r to Q^*: Your Language Model is Secretly a Q-Function | Apr 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RAGAR, Your Falsehood Radar: RAG-Augmented Reasoning for Political Fact-Checking using Multimodal Large Language Models | Apr 18, 2024 | Fact CheckingLanguage Modeling | —Unverified | 0 |
| Length Generalization of Causal Transformers without Position Encoding | Apr 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Skeleton: A New Framework for Accelerating Language Models via Task Neuron Localized Prompt Tuning | Apr 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhancing Embedding Performance through Large Language Model-based Text Enrichment and Rewriting | Apr 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhance Robustness of Language Models Against Variation Attack through Graph Integration | Apr 18, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| MCRanker: Generating Diverse Criteria On-the-Fly to Improve Point-wise LLM Rankers | Apr 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Language Models Still Struggle to Zero-shot Reason about Time Series | Apr 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Stepwise Alignment for Constrained Language Model Policy Optimization | Apr 17, 2024 | Computational EfficiencyLanguage Modeling | CodeCode Available | 0 |
| MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory | Apr 17, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| LongVQ: Long Sequence Modeling with Vector Quantization on Structured Memory | Apr 17, 2024 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| ViLLM-Eval: A Comprehensive Evaluation Suite for Vietnamese Large Language Models | Apr 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| To Drop or Not to Drop? Predicting Argument Ellipsis Judgments: A Case Study in Japanese | Apr 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Prompt Optimizer of Text-to-Image Diffusion Models for Abstract Concept Understanding | Apr 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| E2ETune: End-to-End Knob Tuning via Fine-tuned Generative Language Model | Apr 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VG4D: Vision-Language Model Goes 4D Video Recognition | Apr 17, 2024 | Action RecognitionAutonomous Driving | CodeCode Available | 1 |
| Lightweight Unsupervised Federated Learning with Pretrained Vision Language Model | Apr 17, 2024 | Federated LearningLanguage Modeling | —Unverified | 0 |
| Paraphrase and Solve: Exploring and Exploiting the Impact of Surface Form on Mathematical Reasoning in Large Language Models | Apr 17, 2024 | FormLanguage Model Evaluation | CodeCode Available | 0 |
| Prompt-Guided Generation of Structured Chest X-Ray Report Using a Pre-trained LLM | Apr 17, 2024 | AnatomyLanguage Modeling | —Unverified | 0 |
| Grounded Language Agent for Product Search via Intelligent Web Interactions | Apr 16, 2024 | Domain AdaptationIn-Context Learning | CodeCode Available | 0 |
| Forcing Diffuse Distributions out of Language Models | Apr 16, 2024 | Dataset GenerationDiversity | CodeCode Available | 1 |
| Fewer Truncations Improve Language Modeling | Apr 16, 2024 | Combinatorial OptimizationHallucination | —Unverified | 0 |
| Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training | Apr 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| More Room for Language: Investigating the Effect of Retrieval on Language Models | Apr 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Autoregressive Pre-Training on Pixels and Texts | Apr 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Spiral of Silence: How is Large Language Model Killing Information Retrieval? -- A Case Study on Open Domain Question Answering | Apr 16, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 1 |
| Deep Learning and LLM-based Methods Applied to Stellar Lightcurve Classification | Apr 16, 2024 | Feature EngineeringLanguage Modeling | CodeCode Available | 3 |
| Balancing Speciality and Versatility: a Coarse to Fine Framework for Supervised Fine-tuning Large Language Model | Apr 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Construction of Domain-specified Japanese Large Language Model for Finance through Continual Pre-training | Apr 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Future Language Modeling from Temporal Document History | Apr 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| HLAT: High-quality Large Language Model Pre-trained on AWS Trainium | Apr 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Vocabulary-free Image Classification and Semantic Segmentation | Apr 16, 2024 | Classificationimage-classification | CodeCode Available | 0 |
| From a Lossless (~1.5:1) Compression Algorithm for Llama2 7B Weights to Variable Precision, Variable Range, Compressed Numeric Data Types for CNNs and LLMs | Apr 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reasoning on Efficient Knowledge Paths:Knowledge Graph Guides Large Language Model for Domain Question Answering | Apr 16, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Exact and Efficient Unlearning for Large Language Model-based Recommendation | Apr 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |