| A Large Language Model Enhanced Sequential Recommender for Joint Video and Comment Recommendation | Mar 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Instruction Multi-Constraint Molecular Generation Using a Teacher-Student Large Language Model | Mar 20, 2024 | Drug DiscoveryKnowledge Distillation | CodeCode Available | 1 |
| Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language Translation | Mar 19, 2024 | Gloss-free Sign Language TranslationLanguage Modeling | CodeCode Available | 1 |
| Subjective-Aligned Dataset and Metric for Text-to-Video Quality Assessment | Mar 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Training A Small Emotional Vision Language Model for Visual Art Comprehension | Mar 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Large Language Model-informed ECG Dual Attention Network for Heart Failure Risk Prediction | Mar 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CDGP: Automatic Cloze Distractor Generation based on Pre-trained Language Model | Mar 15, 2024 | Cloze TestDistractor Generation | CodeCode Available | 1 |
| Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models | Mar 14, 2024 | Decoderimage-classification | CodeCode Available | 1 |
| MoralBERT: A Fine-Tuned Language Model for Capturing Moral Values in Social Discussions | Mar 12, 2024 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| FontCLIP: A Semantic Typography Visual-Language Model for Multilingual Font Applications | Mar 11, 2024 | AttributeDescriptive | CodeCode Available | 1 |
| ConspEmoLLM: Conspiracy Theory Detection Using an Emotion-Based Large Language Model | Mar 11, 2024 | Binary ClassificationLanguage Modeling | CodeCode Available | 1 |
| Personalized LoRA for Human-Centered Text Understanding | Mar 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ClinicalMamba: A Generative Clinical Language Model on Longitudinal Clinical Notes | Mar 9, 2024 | Few-Shot LearningLanguage Modeling | CodeCode Available | 1 |
| Algorithmic progress in language models | Mar 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents | Mar 8, 2024 | BenchmarkingDecision Making | CodeCode Available | 1 |
| Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought | Mar 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free Attention | Mar 2, 2024 | 16kCPU | CodeCode Available | 1 |
| An Interpretable Ensemble of Graph and Language Models for Improving Search Relevance in E-Commerce | Mar 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Merging Text Transformer Models from Different Initializations | Mar 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Resonance RoPE: Improving Context Length Generalization of Large Language Models | Feb 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| TEncDM: Understanding the Properties of the Diffusion Model in the Space of Language Model Encodings | Feb 29, 2024 | Conditional Text GenerationDecoder | CodeCode Available | 1 |
| SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model | Feb 28, 2024 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| Grounding Language Models for Visual Entity Recognition | Feb 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CogBench: a large language model walks into a psychology lab | Feb 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Characterizing Truthfulness in Large Language Model Generations with Local Intrinsic Dimension | Feb 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| NextLevelBERT: Masked Language Modeling with Higher-Level Representations for Long Documents | Feb 27, 2024 | Document ClassificationLanguage Modeling | CodeCode Available | 1 |
| XMoE: Sparse Models with Fine-grained and Adaptive Expert Selection | Feb 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Language Model based Framework for New Concept Placement in Ontologies | Feb 27, 2024 | Contrastive LearningEntity Linking | CodeCode Available | 1 |
| Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space | Feb 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property | Feb 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression | Feb 25, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 |
| TV-SAM: Increasing Zero-Shot Segmentation Performance on Multimodal Medical Images Using GPT-4 Generated Descriptive Prompts Without Human Annotation | Feb 24, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 |
| MATHWELL: Generating Educational Math Word Problems Using Teacher Annotations | Feb 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Empowering Large Language Model Agents through Action Learning | Feb 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AttributionBench: How Hard is Automatic Attribution Evaluation? | Feb 23, 2024 | Binary ClassificationLanguage Modeling | CodeCode Available | 1 |
| Self-Retrieval: End-to-End Information Retrieval with One Large Language Model | Feb 23, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 1 |
| RelayAttention for Efficient Large Language Model Serving with Long System Prompts | Feb 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Watermarking Makes Language Models Radioactive | Feb 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| INSTRUCTIR: A Benchmark for Instruction Following of Information Retrieval Models | Feb 22, 2024 | Information RetrievalInstruction Following | CodeCode Available | 1 |
| Balanced Data Sampling for Language Model Training with Clustering | Feb 22, 2024 | ClusteringLanguage Modeling | CodeCode Available | 1 |
| Uncertainty-Aware Evaluation for Vision-Language Models | Feb 22, 2024 | Conformal PredictionLanguage Modeling | CodeCode Available | 1 |
| Q-Probe: A Lightweight Approach to Reward Maximization for Language Models | Feb 22, 2024 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| SIMPLOT: Enhancing Chart Question Answering by Distilling Essentials | Feb 22, 2024 | Chart Question AnsweringLanguage Modeling | CodeCode Available | 1 |
| LLMBind: A Unified Modality-Task Integration Framework | Feb 22, 2024 | AI AgentAudio Generation | CodeCode Available | 1 |
| CriticEval: Evaluating Large Language Model as Critic | Feb 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Analysing The Impact of Sequence Composition on Language Model Pre-Training | Feb 21, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| A Multimodal In-Context Tuning Approach for E-Commerce Product Description Generation | Feb 21, 2024 | DiversityIn-Context Learning | CodeCode Available | 1 |
| Breaking the HISCO Barrier: Automatic Occupational Standardization with OccCANINE | Feb 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |