| DiveR-CT: Diversity-enhanced Red Teaming Large Language Model Assistants with Relaxing Constraints | May 29, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Kotlin ML Pack: Technical Report | May 29, 2024 | Code GenerationHumanEval | —Unverified | 0 |
| Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models | May 29, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series | May 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Learning from Litigation: Graphs and LLMs for Retrieval and Reasoning in eDiscovery | May 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards a theory of how the structure of language is acquired by deep neural networks | May 28, 2024 | ArticlesLanguage Modeling | —Unverified | 0 |
| Black-Box Detection of Language Model Watermarks | May 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning diverse attacks on large language models for robust red-teaming and safety tuning | May 28, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Unified Preference Optimization: Language Model Alignment Beyond the Preference Frontier | May 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLM experiments with simulation: Large Language Model Multi-Agent System for Simulation Model Parametrization in Digital Twins | May 28, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Don't Forget to Connect! Improving RAG with Graph-based Reranking | May 28, 2024 | Abstract Meaning RepresentationLanguage Modeling | —Unverified | 0 |
| Semantic are Beacons: A Semantic Perspective for Unveiling Parameter-Efficient Fine-Tuning in Knowledge Learning | May 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| IAPT: Instruction-Aware Prompt Tuning for Large Language Models | May 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Pipette: Automatic Fine-grained Large Language Model Training Configurator for Real-World Clusters | May 28, 2024 | GPULanguage Modeling | CodeCode Available | 0 |
| Detection-Correction Structure via General Language Model for Grammatical Error Correction | May 28, 2024 | Grammatical Error CorrectionLanguage Modeling | CodeCode Available | 1 |
| XL3M: A Training-free Framework for LLM Length Extension Based on Segment-wise Inference | May 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Knowledge Circuits in Pretrained Transformers | May 28, 2024 | In-Context Learningknowledge editing | CodeCode Available | 2 |
| Automated Real-World Sustainability Data Generation from Images of Buildings | May 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model-Driven Curriculum Design for Mobile Networks | May 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Context-Aware Approach for Enhancing Data Imputation with Pre-trained Language Models | May 28, 2024 | ImputationLanguage Modeling | —Unverified | 0 |
| Facilitating Holistic Evaluations with LLMs: Insights from Scenario-Based Experiments | May 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass | May 28, 2024 | Code CompletionLanguage Modeling | CodeCode Available | 1 |
| SLMRec: Distilling Large Language Models into Small for Sequential Recommendation | May 28, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model | May 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| 4-bit Shampoo for Memory-Efficient Network Training | May 28, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| The Economic Implications of Large Language Model Selection on Earnings and Return on Investment: A Decision Theoretic Model | May 27, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| DeeperImpact: Optimizing Sparse Learned Index Structures | May 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Salutary Labeling with Zero Human Annotation | May 27, 2024 | Active LearningLanguage Modeling | —Unverified | 0 |
| Video Enriched Retrieval Augmented Generation Using Aligned Video Captions | May 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SMR: State Memory Replay for Long Sequence Modeling | May 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention | May 27, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| An Introduction to Vision-Language Modeling | May 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation | May 27, 2024 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| SelfCP: Compressing Over-Limit Prompt via the Frozen Large Language Model Itself | May 27, 2024 | DecoderIn-Context Learning | —Unverified | 0 |
| Glauber Generative Model: Discrete Diffusion Models via Binary Classification | May 27, 2024 | Binary ClassificationDenoising | —Unverified | 0 |
| Motion-Agent: A Conversational Framework for Human Motion Generation with LLMs | May 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model | May 27, 2024 | DecoderLanguage Modeling | CodeCode Available | 2 |
| Unlocking the Secrets of Linear Complexity Sequence Model from A Unified Perspective | May 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence | May 27, 2024 | Decision MakingDescriptive | —Unverified | 0 |
| The Expressive Capacity of State Space Models: A Formal Language Perspective | May 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TEII: Think, Explain, Interact and Iterate with Large Language Models to Solve Cross-lingual Emotion Detection | May 27, 2024 | Few-Shot LearningLanguage Modeling | CodeCode Available | 0 |
| Interesting Scientific Idea Generation using Knowledge Graphs and LLMs: Evaluations with 100 Research Group Leaders | May 27, 2024 | Knowledge GraphsLanguage Modeling | CodeCode Available | 1 |
| Benchmarking General-Purpose In-Context Learning | May 27, 2024 | BenchmarkingDecision Making | —Unverified | 0 |
| Advanced Language Model-based Translator for English-Vietnamese Translation | May 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Large Language Model-based multi-agent manufacturing system for intelligent shopfloor | May 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Code Repair with LLMs gives an Exploration-Exploitation Tradeoff | May 26, 2024 | Code RepairLanguage Modeling | —Unverified | 0 |
| CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge Fusion | May 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 9 |
| Disentangling and Integrating Relational and Sensory Information in Transformer Architectures | May 26, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| M-RAG: Reinforcing Large Language Model Performance through Retrieval-Augmented Generation with Multiple Partitions | May 26, 2024 | Dialogue GenerationLanguage Modeling | —Unverified | 0 |
| KG-FIT: Knowledge Graph Fine-Tuning Upon Open-World Knowledge | May 26, 2024 | Graph EmbeddingInformativeness | CodeCode Available | 2 |