| MLorc: Momentum Low-rank Compression for Large Language Model Adaptation | Jun 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Parameter Efficient Fine Tuning Llama 3.1 for Answering Arabic Legal Questions: A Case Study on Jordanian Laws | Jun 2, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Reasoning-Table: Exploring Reinforcement Learning for Table Reasoning | Jun 2, 2025 | Fact VerificationLanguage Modeling | CodeCode Available | 2 |
| Infinity Parser: Layout Aware Reinforcement Learning for Scanned Document Parsing | Jun 1, 2025 | Document AIdocument understanding | CodeCode Available | 0 |
| HouseTS: A Large-Scale, Multimodal Spatiotemporal U.S. Housing Dataset | Jun 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CLAP-ART: Automated Audio Captioning with Semantic-rich Audio Representation Tokenizer | Jun 1, 2025 | Audio captioningLanguage Modeling | —Unverified | 0 |
| EEG2TEXT-CN: An Exploratory Study of Open-Vocabulary Chinese Text-EEG Alignment via Large Language Model and Contrastive Learning on ChineseEEG | Jun 1, 2025 | Contrastive LearningDecoder | —Unverified | 0 |
| Language-Guided Multi-Agent Learning in Simulations: A Unified Framework and Evaluation | Jun 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GigaAM: Efficient Self-Supervised Learner for Speech Recognition | Jun 1, 2025 | Automatic Speech RecognitionLanguage Modeling | CodeCode Available | 4 |
| NTPP: Generative Speech Language Modeling for Dual-Channel Spoken Dialogue via Next-Token-Pair Prediction | Jun 1, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| A Large Language Model-Supported Threat Modeling Framework for Transportation Cyber-Physical Systems | Jun 1, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Goal-Aware Identification and Rectification of Misinformation in Multi-Agent Systems | May 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Chain-of-Thought Training for Open E2E Spoken Dialogue Systems | May 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MagiCodec: Simple Masked Gaussian-Injected Codec for High-Fidelity Reconstruction and Generation | May 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Translate With Care: Addressing Gender Bias, Neutrality, and Reasoning in Large Language Model Translations | May 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Hierarchical Level-Wise News Article Clustering via Multilingual Matryoshka Embeddings | May 30, 2025 | ArticlesClustering | —Unverified | 0 |
| MythTriage: Scalable Detection of Opioid Use Disorder Myths on a Video-Sharing Platform | May 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Speech Token Prediction via Compressed-to-fine Language Modeling for Speech Generation | May 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Period-LLM: Extending the Periodic Capability of Multimodal Large Language Model | May 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Drop Dropout on Single-Epoch Language Model Pretraining | May 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and Finetuning | May 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Beyond Multiple Choice: Evaluating Steering Vectors for Adaptive Free-Form Summarization | May 30, 2025 | FormLanguage Modeling | —Unverified | 0 |
| Transformers Are Universally Consistent | May 30, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Can Slow-thinking LLMs Reason Over Time? Empirical Studies in Time Series Forecasting | May 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Dynamic Context-Aware Streaming Pretrained Language Model For Inverse Text Normalization | May 30, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Circuit Stability Characterizes Language Model Generalization | May 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Interpreting Large Text-to-Image Diffusion Models with Dictionary Learning | May 30, 2025 | Dictionary LearningImage Generation | CodeCode Available | 0 |
| From Macro to Micro: Probing Dataset Diversity in Language Model Fine-Tuning | May 30, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RL | May 30, 2025 | Image GenerationLanguage Modeling | CodeCode Available | 2 |
| Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking | May 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CREFT: Sequential Multi-Agent LLM for Character Relation Extraction | May 30, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| TRIDENT: Enhancing Large Language Model Safety with Tri-Dimensional Diversified Red-Teaming Data Synthesis | May 30, 2025 | DiversityLanguage Modeling | CodeCode Available | 0 |
| HardTests: Synthesizing High-Quality Test Cases for LLM Coding | May 30, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Grid-LOGAT: Grid Based Local and Global Area Transcription for Video Question Answering | May 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Probing the Robustness Properties of Neural Speech Codecs | May 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| GradPower: Powering Gradients for Faster Language Model Pre-Training | May 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| How much do language models memorize? | May 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Intuitionistic Fuzzy Sets for Large Language Model Data Annotation: A Novel Approach to Side-by-Side Preference Labeling | May 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language Models | May 30, 2025 | ClassificationDisaster Response | CodeCode Available | 2 |
| FABLE: A Novel Data-Flow Analysis Benchmark on Procedural Text for Large Language Model Evaluation | May 30, 2025 | DiagnosticLanguage Model Evaluation | CodeCode Available | 0 |
| Reducing Latency in LLM-Based Natural Language Commands Processing for Robot Navigation | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Preemptive Hallucination Reduction: An Input-Level Approach for Multimodal Language Model | May 29, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| Hidden Persuasion: Detecting Manipulative Narratives on Social Media During the 2022 Russian Invasion of Ukraine | May 29, 2025 | Binary ClassificationClassification | —Unverified | 0 |
| Large Language Model-Based Agents for Automated Research Reproducibility: An Exploratory Study in Alzheimer's Disease | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Critical Batch Size Revisited: A Simple Empirical Approach to Large-Batch Language Model Training | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FLAT-LLM: Fine-grained Low-rank Activation Space Transformation for Large Language Model Compression | May 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Actor-Critic based Online Data Mixing For Language Model Pre-Training | May 29, 2025 | HumanEvalLanguage Modeling | —Unverified | 0 |
| Large Language Model Meets Constraint Propagation | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Dataset Cartography for Large Language Model Alignment: Mapping and Diagnosing Preference Data | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Position: Federated Foundation Language Model Post-Training Should Focus on Open-Source Models | May 29, 2025 | Federated LearningLanguage Modeling | —Unverified | 0 |