| Estimating the Probability of Sampling a Trained Neural Network at Random | Jan 31, 2025 | Inductive BiasLanguage Modeling | —Unverified | 0 |
| Brain-inspired sparse training enables Transformers and LLMs to perform as fully connected | Jan 31, 2025 | GPULanguage Modeling | —Unverified | 0 |
| BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning | Jan 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions | Jan 31, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Scaling Laws for Differentially Private Language Models | Jan 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving LLM Unlearning Robustness via Random Perturbations | Jan 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Importing Phantoms: Measuring LLM Package Hallucination Vulnerabilities | Jan 31, 2025 | Code GenerationHallucination | —Unverified | 0 |
| An Efficient Approach for Machine Translation on Low-resource Languages: A Case Study in Vietnamese-Chinese | Jan 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards the Worst-case Robustness of Large Language Models | Jan 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| s1: Simple test-time scaling | Jan 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 9 |
| Partially Rewriting a Transformer in Natural Language | Jan 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Offline Learning for Combinatorial Multi-armed Bandits | Jan 31, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Structural Embedding Projection for Contextual Large Language Model Inference | Jan 31, 2025 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models | Jan 31, 2025 | Caption GenerationLanguage Modeling | CodeCode Available | 4 |
| Low-Rank Adapting Models for Sparse Autoencoders | Jan 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Scalable-Softmax Is Superior for Attention | Jan 31, 2025 | Information RetrievalLanguage Modeling | CodeCode Available | 1 |
| Intrinsic Tensor Field Propagation in Large Language Models: A Novel Approach to Contextual Information Flow | Jan 31, 2025 | Domain GeneralizationLanguage Modeling | —Unverified | 0 |
| Fine-tuning LLaMA 2 interference: a comparative study of language implementations for optimal efficiency | Jan 30, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering | Jan 30, 2025 | General KnowledgeLanguage Modeling | —Unverified | 0 |
| Enhancing Large Language Model Efficiencyvia Symbolic Compression: A Formal Approach Towards Interpretability | Jan 30, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Efficiency and Effectiveness of LLM-Based Summarization of Evidence in Crowdsourced Fact-Checking | Jan 30, 2025 | Fact CheckingLanguage Modeling | —Unverified | 0 |
| Investigating Tax Evasion Emergence Using Dual Large Language Model and Deep Reinforcement Learning Powered Agent-based Simulation | Jan 30, 2025 | Deep Reinforcement LearningExperimental Design | —Unverified | 0 |
| Exploring Audio Editing Features as User-Centric Privacy Defenses Against Large Language Model(LLM) Based Emotion Inference Attacks | Jan 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training | Jan 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Loss Functions and Operators Generated by f-Divergences | Jan 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Differentially Private Steering for Large Language Model Alignment | Jan 30, 2025 | HallucinationInference Attack | CodeCode Available | 0 |
| Vision-Language Model Selection and Reuse for Downstream Adaptation | Jan 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Token-Hungry, Yet Precise: DeepSeek R1 Highlights the Need for Multi-Step Reasoning Over Speed in MATH | Jan 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CLEAR: Cue Learning using Evolution for Accurate Recognition Applied to Sustainability Data Extraction | Jan 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Economic Rationality under Specialization: Evidence of Decision Bias in AI Agents | Jan 30, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Can Generative LLMs Create Query Variants for Test Collections? An Exploratory Study | Jan 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Perforated Backpropagation: A Neuroscience Inspired Extension to Artificial Neural Networks | Jan 29, 2025 | Drug DiscoveryLanguage Modeling | CodeCode Available | 0 |
| Prompt-oriented Output of Culture-Specific Items in Translated African Poetry by Large Language Model: An Initial Multi-layered Tabular Review | Jan 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Models for Single-Step and Multi-Step Flight Trajectory Prediction | Jan 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| 2SSP: A Two-Stage Framework for Structured Pruning of LLMs | Jan 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Leveraging Multimodal LLM for Inspirational User Interface Search | Jan 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Query-Aware Learnable Graph Pooling Tokens as Prompt for Large Language Models | Jan 29, 2025 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |
| DINT Transformer | Jan 29, 2025 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Learning Free Token Reduction for Multi-Modal Large Language Models | Jan 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| From tools to thieves: Measuring and understanding public perceptions of AI through crowdsourced metaphors | Jan 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Planning with Vision-Language Models and a Use Case in Robot-Assisted Teaching | Jan 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation -- Challenges and Insights | Jan 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DReSS: Data-driven Regularized Structured Streamlining for Large Language Models | Jan 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Is Conversational XAI All You Need? Human-AI Decision Making With a Conversational XAI Assistant | Jan 29, 2025 | AllDecision Making | CodeCode Available | 0 |
| Implementation of a Generative AI Assistant in K-12 Education: The CyberScholar Initiative | Jan 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model | Jan 28, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 2 |
| RadioLLM: Introducing Large Language Model into Cognitive Radio via Hybrid Prompt and Token Reprogrammings | Jan 28, 2025 | DenoisingDomain Generalization | CodeCode Available | 1 |
| "Ownership, Not Just Happy Talk": Co-Designing a Participatory Large Language Model for Journalism | Jan 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue | Jan 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling | Jan 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |