| Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models | May 29, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| DiveR-CT: Diversity-enhanced Red Teaming Large Language Model Assistants with Relaxing Constraints | May 29, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Learning from Litigation: Graphs and LLMs for Retrieval and Reasoning in eDiscovery | May 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series | May 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Kotlin ML Pack: Technical Report | May 29, 2024 | Code GenerationHumanEval | —Unverified | 0 |
| Towards a theory of how the structure of language is acquired by deep neural networks | May 28, 2024 | ArticlesLanguage Modeling | —Unverified | 0 |
| Black-Box Detection of Language Model Watermarks | May 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning diverse attacks on large language models for robust red-teaming and safety tuning | May 28, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| XL3M: A Training-free Framework for LLM Length Extension Based on Segment-wise Inference | May 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SLMRec: Distilling Large Language Models into Small for Sequential Recommendation | May 28, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| Don't Forget to Connect! Improving RAG with Graph-based Reranking | May 28, 2024 | Abstract Meaning RepresentationLanguage Modeling | —Unverified | 0 |
| Detection-Correction Structure via General Language Model for Grammatical Error Correction | May 28, 2024 | Grammatical Error CorrectionLanguage Modeling | CodeCode Available | 1 |
| Large Language Model-Driven Curriculum Design for Mobile Networks | May 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Automated Real-World Sustainability Data Generation from Images of Buildings | May 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unified Preference Optimization: Language Model Alignment Beyond the Preference Frontier | May 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Knowledge Circuits in Pretrained Transformers | May 28, 2024 | In-Context Learningknowledge editing | CodeCode Available | 2 |
| Semantic are Beacons: A Semantic Perspective for Unveiling Parameter-Efficient Fine-Tuning in Knowledge Learning | May 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model | May 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Context-Aware Approach for Enhancing Data Imputation with Pre-trained Language Models | May 28, 2024 | ImputationLanguage Modeling | —Unverified | 0 |
| Facilitating Holistic Evaluations with LLMs: Insights from Scenario-Based Experiments | May 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| IAPT: Instruction-Aware Prompt Tuning for Large Language Models | May 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLM experiments with simulation: Large Language Model Multi-Agent System for Simulation Model Parametrization in Digital Twins | May 28, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass | May 28, 2024 | Code CompletionLanguage Modeling | CodeCode Available | 1 |
| Pipette: Automatic Fine-grained Large Language Model Training Configurator for Real-World Clusters | May 28, 2024 | GPULanguage Modeling | CodeCode Available | 0 |
| 4-bit Shampoo for Memory-Efficient Network Training | May 28, 2024 | image-classificationImage Classification | CodeCode Available | 1 |