| Low-Rank Adapting Models for Sparse Autoencoders | Jan 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Estimating the Probability of Sampling a Trained Neural Network at Random | Jan 31, 2025 | Inductive BiasLanguage Modeling | —Unverified | 0 |
| An Efficient Approach for Machine Translation on Low-resource Languages: A Case Study in Vietnamese-Chinese | Jan 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving LLM Unlearning Robustness via Random Perturbations | Jan 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Scalable-Softmax Is Superior for Attention | Jan 31, 2025 | Information RetrievalLanguage Modeling | CodeCode Available | 1 |
| Structural Embedding Projection for Contextual Large Language Model Inference | Jan 31, 2025 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| Intrinsic Tensor Field Propagation in Large Language Models: A Novel Approach to Contextual Information Flow | Jan 31, 2025 | Domain GeneralizationLanguage Modeling | —Unverified | 0 |
| Scaling Laws for Differentially Private Language Models | Jan 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards the Worst-case Robustness of Large Language Models | Jan 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Offline Learning for Combinatorial Multi-armed Bandits | Jan 31, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models | Jan 31, 2025 | Caption GenerationLanguage Modeling | CodeCode Available | 4 |
| s1: Simple test-time scaling | Jan 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 9 |
| Brain-inspired sparse training enables Transformers and LLMs to perform as fully connected | Jan 31, 2025 | GPULanguage Modeling | —Unverified | 0 |
| BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning | Jan 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions | Jan 31, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Importing Phantoms: Measuring LLM Package Hallucination Vulnerabilities | Jan 31, 2025 | Code GenerationHallucination | —Unverified | 0 |
| Partially Rewriting a Transformer in Natural Language | Jan 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Fine-tuning LLaMA 2 interference: a comparative study of language implementations for optimal efficiency | Jan 30, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Exploring Audio Editing Features as User-Centric Privacy Defenses Against Large Language Model(LLM) Based Emotion Inference Attacks | Jan 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Efficiency and Effectiveness of LLM-Based Summarization of Evidence in Crowdsourced Fact-Checking | Jan 30, 2025 | Fact CheckingLanguage Modeling | —Unverified | 0 |
| Enhancing Large Language Model Efficiencyvia Symbolic Compression: A Formal Approach Towards Interpretability | Jan 30, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering | Jan 30, 2025 | General KnowledgeLanguage Modeling | —Unverified | 0 |
| Investigating Tax Evasion Emergence Using Dual Large Language Model and Deep Reinforcement Learning Powered Agent-based Simulation | Jan 30, 2025 | Deep Reinforcement LearningExperimental Design | —Unverified | 0 |
| Differentially Private Steering for Large Language Model Alignment | Jan 30, 2025 | HallucinationInference Attack | CodeCode Available | 0 |
| CLEAR: Cue Learning using Evolution for Accurate Recognition Applied to Sustainability Data Extraction | Jan 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |