| Reinforced Large Language Model is a formal theorem prover | Feb 13, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Logical forms complement probability in understanding language model (and human) performance | Feb 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AIDE: Agentically Improve Visual Language Model with Domain Experts | Feb 13, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| On Mechanistic Circuits for Extractive Question-Answering | Feb 12, 2025 | Extractive Question-AnsweringLanguage Modeling | —Unverified | 0 |
| LLM4GNAS: A Large Language Model Based Toolkit for Graph Neural Architecture Search | Feb 12, 2025 | Feature EngineeringGraph Learning | —Unverified | 0 |
| E2LVLM:Evidence-Enhanced Large Vision-Language Model for Multimodal Out-of-Context Misinformation Detection | Feb 12, 2025 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Lexical Manifold Reconfiguration in Large Language Models: A Novel Architectural Approach for Contextual Modulation | Feb 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TANTE: Time-Adaptive Operator Learning via Neural Taylor Expansion | Feb 12, 2025 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification | Feb 12, 2025 | DecoderDescriptive | CodeCode Available | 2 |
| SelfElicit: Your Language Model Secretly Knows Where is the Relevant Evidence | Feb 12, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| Contextual Subspace Manifold Projection for Structural Refinement of Large Language Model Representations | Feb 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Can a Single Model Master Both Multi-turn Conversations and Tool Use? CALM: A Unified Conversational Agentic Language Model | Feb 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Examining Multilingual Embedding Models Cross-Lingually Through LLM-Generated Adversarial Examples | Feb 12, 2025 | Distractor GenerationInformation Retrieval | —Unverified | 0 |
| LLM Pretraining with Continuous Concepts | Feb 12, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| QA-Expand: Multi-Question Answer Generation for Enhanced Query Expansion in Information Retrieval | Feb 12, 2025 | Answer GenerationInformation Retrieval | —Unverified | 0 |
| AI-VERDE: A Gateway for Egalitarian Access to Large Language Model-Based Resources For Educational Institutions | Feb 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MetaSC: Test-Time Safety Specification Optimization for Language Models | Feb 11, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MGPATH: Vision-Language Model with Multi-Granular Prompt Learning for Few-Shot WSI Classification | Feb 11, 2025 | Contrastive LearningData Augmentation | CodeCode Available | 1 |
| ETimeline: An Extensive Timeline Generation Dataset based on Large Language Model | Feb 11, 2025 | ArticlesLanguage Modeling | —Unverified | 0 |
| Recursive Inference Scaling: A Winning Path to Scalable Inference in Language and Multimodal Systems | Feb 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| JamendoMaxCaps: A Large Scale Music-caption Dataset with Imputed Metadata | Feb 11, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Small Language Model Makes an Effective Long Text Extractor | Feb 11, 2025 | GPULanguage Modeling | CodeCode Available | 1 |
| DrugImproverGPT: A Large Language Model for Drug Optimization with Fine-Tuning via Structured Policy Optimization | Feb 11, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| RomanLens: Latent Romanization and its role in Multilinguality in LLMs | Feb 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More | Feb 11, 2025 | DecoderInformation Retrieval | CodeCode Available | 0 |
| Auditing Prompt Caching in Language Model APIs | Feb 11, 2025 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Implicit Language Models are RNNs: Balancing Parallelization and Expressivity | Feb 10, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AppVLM: A Lightweight Vision Language Model for Online App Control | Feb 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Steel-LLM:From Scratch to Open Source -- A Personal Journey in Building a Chinese-Centric LLM | Feb 10, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| K-ON: Stacking Knowledge On the Head Layer of Large Language Model | Feb 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates | Feb 10, 2025 | Hierarchical Reinforcement LearningLanguage Modeling | CodeCode Available | 4 |
| Recent Advances in Discrete Speech Tokens: A Review | Feb 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Structural Reformation of Large Language Model Neuron Encapsulation for Divergent Information Aggregation | Feb 10, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| RALLRec: Improving Retrieval Augmented Large Language Model Recommendation with Representation Learning | Feb 10, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE | Feb 10, 2025 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Rationalization Models for Text-to-SQL | Feb 10, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| μnit Scaling: Simple and Scalable FP8 LLM Training | Feb 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HSI: Head-Specific Intervention Can Induce Misaligned AI Coordination in Large Language Models | Feb 9, 2025 | Answer GenerationLanguage Modeling | CodeCode Available | 0 |
| Investigating Compositional Reasoning in Time Series Foundation Models | Feb 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Digital Twin Buildings: 3D Modeling, GIS Integration, and Visual Descriptions Using Gaussian Splatting, ChatGPT/Deepseek, and Google Maps Platform | Feb 9, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Effective Black-Box Multi-Faceted Attacks Breach Vision Large Language Model Guardrails | Feb 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enabling Autoregressive Models to Fill In Masked Tokens | Feb 9, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| Uni-Retrieval: A Multi-Style Retrieval Framework for STEM's Education | Feb 9, 2025 | Image RetrievalLanguage Modeling | —Unverified | 0 |
| Certifying Language Model Robustness with Fuzzed Randomized Smoothing: An Efficient Defense Against Backdoor Attacks | Feb 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ScaffoldGPT: A Scaffold-based GPT Model for Drug Optimization | Feb 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DexVLA: Vision-Language Model with Plug-In Diffusion Expert for General Robot Control | Feb 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| RECOVER: Designing a Large Language Model-based Remote Patient Monitoring System for Postoperative Gastrointestinal Cancer Care | Feb 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| UniCMs: A Unified Consistency Model For Efficient Multimodal Generation and Understanding | Feb 8, 2025 | DenoisingImage Generation | CodeCode Available | 1 |
| IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System | Feb 8, 2025 | DecoderLanguage Modeling | CodeCode Available | 11 |
| Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging | Feb 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |