| Prompt-based Depth Pruning of Large Language Models | Feb 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model | Feb 4, 2025 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs | Feb 4, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| JingFang: A Traditional Chinese Medicine Large Language Model of Expert-Level Medical Diagnosis and Syndrome Differentiation-Based Treatment | Feb 4, 2025 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Reviving The Classics: Active Reward Modeling in Large Language Model Alignment | Feb 4, 2025 | Computational EfficiencyExperimental Design | CodeCode Available | 2 |
| LLM-USO: Large Language Model-based Universal Sizing Optimizer | Feb 4, 2025 | Bayesian OptimizationLanguage Modeling | —Unverified | 0 |
| Unlocking Efficient Large Inference Models: One-Bit Unrolling Tips the Scales | Feb 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Flatten Graphs as Sequences: Transformers are Scalable Graph Generators | Feb 4, 2025 | DecoderGraph Generation | —Unverified | 0 |
| MPIC: Position-Independent Multimodal Context Caching System for Efficient MLLM Serving | Feb 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Connections between Schedule-Free Optimizers, AdEMAMix, and Accelerated SGD Variants | Feb 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Analyzing Similarity Metrics for Data Selection for Language Model Pretraining | Feb 4, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| Rethinking Homogeneity of Vision and Text Tokens in Large Vision-and-Language Models | Feb 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EditIQ: Automated Cinematic Editing of Static Wide-Angle Videos via Dialogue Interpretation and Saliency Cues | Feb 4, 2025 | Dialogue InterpretationDialogue Understanding | —Unverified | 0 |
| CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing | Feb 4, 2025 | Collaborative InferenceLanguage Modeling | CodeCode Available | 1 |
| ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling | Feb 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Knowledge Synthesis of Photosynthesis Research Using a Large Language Model | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Eliciting Language Model Behaviors with Investigator Agents | Feb 3, 2025 | Bayesian InferenceHallucination | —Unverified | 0 |
| InfoBridge: Mutual Information estimation via Bridge Matching | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Scaling Embedding Layers in Language Models | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning to Learn Weight Generation via Local Consistency Diffusion | Feb 3, 2025 | Domain GeneralizationFew-Shot Learning | —Unverified | 0 |
| Scalable Language Models with Posterior Inference of Latent Thought Vectors | Feb 3, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| The Differences Between Direct Alignment Algorithms are a Blur | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| QLESS: A Quantized Approach for Data Valuation and Selection in Large Language Model Fine-Tuning | Feb 3, 2025 | Data ValuationLanguage Modeling | CodeCode Available | 0 |
| Latent Lexical Projection in Large Language Models: A Novel Approach to Implicit Representation Refinement | Feb 3, 2025 | Computational EfficiencyDiversity | —Unverified | 0 |
| Explaining Context Length Scaling and Bounds for Language Models | Feb 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Soup-of-Experts: Pretraining Specialist Models via Parameters Averaging | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FALCON: Fine-grained Activation Manipulation by Contrastive Orthogonal Unalignment for Large Language Model | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| An Inquiry into Datacenter TCO for LLM Inference with FP8 | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Position: Towards a Responsible LLM-empowered Multi-Agent Systems | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods | Feb 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Simulating Rumor Spreading in Social Networks using LLM Agents | Feb 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ConditionNET: Learning Preconditions and Effects for Execution Monitoring | Feb 3, 2025 | Anomaly DetectionLanguage Modeling | —Unverified | 0 |
| Language Models Use Trigonometry to Do Addition | Feb 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Efficient Multi-Agent System Training with Data Influence-Oriented Tree Search | Feb 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Agent-Based Uncertainty Awareness Improves Automated Radiology Report Labeling with an Open-Source Large Language Model | Feb 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Vision-centric Token Compression in Large Language Model | Feb 2, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Decision-informed Neural Networks with Large Language Model Integration for Portfolio Optimization | Feb 2, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| LLM Safety Alignment is Divergence Estimation in Disguise | Feb 2, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Avoiding exp(R_max) scaling in RLHF through Preference-based Exploration | Feb 2, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LIBRA: Measuring Bias of Large Language Model from a Local Context | Feb 2, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A statistically consistent measure of Semantic Variability using Language Models | Feb 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Doing More with Less -- Implementing Routing Strategies in Large Language Model-Based Systems: An Extended Survey | Feb 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| INSIGHT: Enhancing Autonomous Driving Safety through Vision-Language Models on Context-Aware Hazard Detection and Edge Case Evaluation | Feb 1, 2025 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Speculative Ensemble: Fast Large Language Model Ensemble via Speculation | Feb 1, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Enhancing Token Filtering Efficiency in Large Language Model Training with Collider | Feb 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MetaOpenFOAM 2.0: Large Language Model Driven Chain of Thought for Automating CFD Simulation and Post-Processing | Feb 1, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| OrcaLoca: An LLM Agent Framework for Software Issue Localization | Feb 1, 2025 | Code SearchLanguage Modeling | —Unverified | 0 |
| Resolving Editing-Unlearning Conflicts: A Knowledge Codebook Framework for Large Language Model Updating | Jan 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Can AI Solve the Peer Review Crisis? A Large Scale Cross Model Experiment of LLMs' Performance and Biases in Evaluating over 1000 Economics Papers | Jan 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mobile Robot Navigation Using Hand-Drawn Maps: A Vision Language Model Approach | Jan 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |