| Uni-Retrieval: A Multi-Style Retrieval Framework for STEM's Education | Feb 9, 2025 | Image RetrievalLanguage Modeling | —Unverified | 0 |
| RECOVER: Designing a Large Language Model-based Remote Patient Monitoring System for Postoperative Gastrointestinal Cancer Care | Feb 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| μnit Scaling: Simple and Scalable FP8 LLM Training | Feb 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Complexity of Learning Sparse Superposed Features with Feedback | Feb 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging | Feb 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Refining Integration-by-Parts Reduction of Feynman Integrals with Machine Learning | Feb 7, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| RAG-Verus: Repository-Level Program Verification with LLMs using Retrieval Augmented Generation | Feb 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Prot2Chat: Protein LLM with Early-Fusion of Text, Sequence and Structure | Feb 7, 2025 | Answer GenerationDecoder | CodeCode Available | 0 |
| Learning the Language of NVMe Streams for Ransomware Detection | Feb 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DCFormer: Efficient 3D Vision-Language Modeling with Decomposed Convolutions | Feb 7, 2025 | Anomaly DetectionImage-text Retrieval | —Unverified | 0 |
| Agentic Reasoning: Reasoning LLMs with Tools for the Deep Research | Feb 7, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 0 |
| Concept Navigation and Classification via Open-Source Large Language Model Processing | Feb 7, 2025 | ArticlesLanguage Modeling | —Unverified | 0 |
| ChamaleonLLM: Batch-Aware Dynamic Low-Rank Adaptation via Inference-Time Clusters | Feb 6, 2025 | DecoderLanguage Modeling | CodeCode Available | 0 |
| FairT2I: Mitigating Social Bias in Text-to-Image Generation via Large Language Model-Assisted Detection and Attribute Rebalancing | Feb 6, 2025 | AttributeBias Detection | —Unverified | 0 |
| DiTAR: Diffusion Transformer Autoregressive Modeling for Speech Generation | Feb 6, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| Adaptive Semantic Prompt Caching with VectorQ | Feb 6, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Contextual Gradient Flow Modeling for Large Language Model Generalization in Multi-Scale Feature Spaces | Feb 6, 2025 | Domain AdaptationLanguage Modeling | —Unverified | 0 |
| Vision-Integrated LLMs for Autonomous Driving Assistance : Human Performance Comparison and Trust Evaluation | Feb 6, 2025 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Verifiable Format Control for Large Language Model Generations | Feb 6, 2025 | BenchmarkingInstruction Following | —Unverified | 0 |
| RWKV-UI: UI Understanding with Enhanced Perception and Reasoning | Feb 6, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning | Feb 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Overcoming Vision Language Model Challenges in Diagram Understanding: A Proof-of-Concept with XML-Driven Large Language Models Solutions | Feb 5, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| On Fairness of Unified Multimodal Large Language Model for Image Generation | Feb 5, 2025 | FairnessImage Generation | —Unverified | 0 |
| Simplifying Formal Proof-Generating Models with ChatGPT and Basic Searching Techniques | Feb 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HACK: Homomorphic Acceleration via Compression of the Key-Value Cache for Disaggregated LLM Inference | Feb 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Fine-grained Preference Optimization Improves Zero-shot Text-to-Speech | Feb 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2 | Feb 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Adapt-Pruner: Adaptive Structural Pruning for Efficient Small Language Model Training | Feb 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model Guided Self-Debugging Code Generation | Feb 5, 2025 | Code GenerationComputational Efficiency | —Unverified | 0 |
| Control Search Rankings, Control the World: What is a Good Search Engine? | Feb 5, 2025 | EthicsInformation Retrieval | —Unverified | 0 |
| A Contemporary Survey of Large Language Model Assisted Program Analysis | Feb 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model as Universal Retriever in Industrial-Scale Recommender System | Feb 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Efficient Vision Language Model Fine-tuning for Text-based Person Anomaly Search | Feb 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Entropy Adaptive Decoding: Dynamic Model Switching for Efficient Inference | Feb 5, 2025 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling | Feb 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FinBloom: Knowledge Grounding Large Language Model with Real-time Financial Data | Feb 4, 2025 | Algorithmic TradingArticles | —Unverified | 0 |
| LLM-USO: Large Language Model-based Universal Sizing Optimizer | Feb 4, 2025 | Bayesian OptimizationLanguage Modeling | —Unverified | 0 |
| Analyzing Similarity Metrics for Data Selection for Language Model Pretraining | Feb 4, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling | Feb 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Flatten Graphs as Sequences: Transformers are Scalable Graph Generators | Feb 4, 2025 | DecoderGraph Generation | —Unverified | 0 |
| JingFang: A Traditional Chinese Medicine Large Language Model of Expert-Level Medical Diagnosis and Syndrome Differentiation-Based Treatment | Feb 4, 2025 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Connections between Schedule-Free Optimizers, AdEMAMix, and Accelerated SGD Variants | Feb 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Automating Mathematical Proof Generation Using Large Language Model Agents and Knowledge Graphs | Feb 4, 2025 | Formal LogicKnowledge Graphs | —Unverified | 0 |
| EditIQ: Automated Cinematic Editing of Static Wide-Angle Videos via Dialogue Interpretation and Saliency Cues | Feb 4, 2025 | Dialogue InterpretationDialogue Understanding | —Unverified | 0 |
| MPIC: Position-Independent Multimodal Context Caching System for Efficient MLLM Serving | Feb 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unlocking Efficient Large Inference Models: One-Bit Unrolling Tips the Scales | Feb 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Position: Stop Acting Like Language Model Agents Are Normal Agents | Feb 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Prompt-based Depth Pruning of Large Language Models | Feb 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model | Feb 4, 2025 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Rethinking Homogeneity of Vision and Text Tokens in Large Vision-and-Language Models | Feb 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |