| The Differences Between Direct Alignment Algorithms are a Blur | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Scalable Language Models with Posterior Inference of Latent Thought Vectors | Feb 3, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| Position: Towards a Responsible LLM-empowered Multi-Agent Systems | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Soup-of-Experts: Pretraining Specialist Models via Parameters Averaging | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| QLESS: A Quantized Approach for Data Valuation and Selection in Large Language Model Fine-Tuning | Feb 3, 2025 | Data ValuationLanguage Modeling | CodeCode Available | 0 |
| Scaling Embedding Layers in Language Models | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FALCON: Fine-grained Activation Manipulation by Contrastive Orthogonal Unalignment for Large Language Model | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Explaining Context Length Scaling and Bounds for Language Models | Feb 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Learning to Learn Weight Generation via Local Consistency Diffusion | Feb 3, 2025 | Domain GeneralizationFew-Shot Learning | —Unverified | 0 |
| InfoBridge: Mutual Information estimation via Bridge Matching | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Latent Lexical Projection in Large Language Models: A Novel Approach to Implicit Representation Refinement | Feb 3, 2025 | Computational EfficiencyDiversity | —Unverified | 0 |
| Eliciting Language Model Behaviors with Investigator Agents | Feb 3, 2025 | Bayesian InferenceHallucination | —Unverified | 0 |
| Knowledge Synthesis of Photosynthesis Research Using a Large Language Model | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ConditionNET: Learning Preconditions and Effects for Execution Monitoring | Feb 3, 2025 | Anomaly DetectionLanguage Modeling | —Unverified | 0 |
| An Inquiry into Datacenter TCO for LLM Inference with FP8 | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Efficient Multi-Agent System Training with Data Influence-Oriented Tree Search | Feb 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Agent-Based Uncertainty Awareness Improves Automated Radiology Report Labeling with an Open-Source Large Language Model | Feb 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LIBRA: Measuring Bias of Large Language Model from a Local Context | Feb 2, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Language Models Use Trigonometry to Do Addition | Feb 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLM Safety Alignment is Divergence Estimation in Disguise | Feb 2, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Avoiding exp(R_max) scaling in RLHF through Preference-based Exploration | Feb 2, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Decision-informed Neural Networks with Large Language Model Integration for Portfolio Optimization | Feb 2, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Vision-centric Token Compression in Large Language Model | Feb 2, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| OrcaLoca: An LLM Agent Framework for Software Issue Localization | Feb 1, 2025 | Code SearchLanguage Modeling | —Unverified | 0 |
| Doing More with Less -- Implementing Routing Strategies in Large Language Model-Based Systems: An Extended Survey | Feb 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhancing Token Filtering Efficiency in Large Language Model Training with Collider | Feb 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| INSIGHT: Enhancing Autonomous Driving Safety through Vision-Language Models on Context-Aware Hazard Detection and Edge Case Evaluation | Feb 1, 2025 | Autonomous DrivingDecision Making | —Unverified | 0 |
| A statistically consistent measure of Semantic Variability using Language Models | Feb 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Importing Phantoms: Measuring LLM Package Hallucination Vulnerabilities | Jan 31, 2025 | Code GenerationHallucination | —Unverified | 0 |
| Intrinsic Tensor Field Propagation in Large Language Models: A Novel Approach to Contextual Information Flow | Jan 31, 2025 | Domain GeneralizationLanguage Modeling | —Unverified | 0 |
| Brain-inspired sparse training enables Transformers and LLMs to perform as fully connected | Jan 31, 2025 | GPULanguage Modeling | —Unverified | 0 |
| Improving LLM Unlearning Robustness via Random Perturbations | Jan 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning | Jan 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Can AI Solve the Peer Review Crisis? A Large Scale Cross Model Experiment of LLMs' Performance and Biases in Evaluating over 1000 Economics Papers | Jan 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| An Efficient Approach for Machine Translation on Low-resource Languages: A Case Study in Vietnamese-Chinese | Jan 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Estimating the Probability of Sampling a Trained Neural Network at Random | Jan 31, 2025 | Inductive BiasLanguage Modeling | —Unverified | 0 |
| Structural Embedding Projection for Contextual Large Language Model Inference | Jan 31, 2025 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| Mobile Robot Navigation Using Hand-Drawn Maps: A Vision Language Model Approach | Jan 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Offline Learning for Combinatorial Multi-armed Bandits | Jan 31, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Towards the Worst-case Robustness of Large Language Models | Jan 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Scaling Laws for Differentially Private Language Models | Jan 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions | Jan 31, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Resolving Editing-Unlearning Conflicts: A Knowledge Codebook Framework for Large Language Model Updating | Jan 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Token-Hungry, Yet Precise: DeepSeek R1 Highlights the Need for Multi-Step Reasoning Over Speed in MATH | Jan 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Vision-Language Model Selection and Reuse for Downstream Adaptation | Jan 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering | Jan 30, 2025 | General KnowledgeLanguage Modeling | —Unverified | 0 |
| Exploring Audio Editing Features as User-Centric Privacy Defenses Against Large Language Model(LLM) Based Emotion Inference Attacks | Jan 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Investigating Tax Evasion Emergence Using Dual Large Language Model and Deep Reinforcement Learning Powered Agent-based Simulation | Jan 30, 2025 | Deep Reinforcement LearningExperimental Design | —Unverified | 0 |
| Efficiency and Effectiveness of LLM-Based Summarization of Evidence in Crowdsourced Fact-Checking | Jan 30, 2025 | Fact CheckingLanguage Modeling | —Unverified | 0 |
| Differentially Private Steering for Large Language Model Alignment | Jan 30, 2025 | HallucinationInference Attack | CodeCode Available | 0 |