| MLorc: Momentum Low-rank Compression for Large Language Model Adaptation | Jun 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Parameter Efficient Fine Tuning Llama 3.1 for Answering Arabic Legal Questions: A Case Study on Jordanian Laws | Jun 2, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Reasoning-Table: Exploring Reinforcement Learning for Table Reasoning | Jun 2, 2025 | Fact VerificationLanguage Modeling | CodeCode Available | 2 |
| Infinity Parser: Layout Aware Reinforcement Learning for Scanned Document Parsing | Jun 1, 2025 | Document AIdocument understanding | CodeCode Available | 0 |
| HouseTS: A Large-Scale, Multimodal Spatiotemporal U.S. Housing Dataset | Jun 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| NTPP: Generative Speech Language Modeling for Dual-Channel Spoken Dialogue via Next-Token-Pair Prediction | Jun 1, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| EEG2TEXT-CN: An Exploratory Study of Open-Vocabulary Chinese Text-EEG Alignment via Large Language Model and Contrastive Learning on ChineseEEG | Jun 1, 2025 | Contrastive LearningDecoder | —Unverified | 0 |
| CLAP-ART: Automated Audio Captioning with Semantic-rich Audio Representation Tokenizer | Jun 1, 2025 | Audio captioningLanguage Modeling | —Unverified | 0 |
| Language-Guided Multi-Agent Learning in Simulations: A Unified Framework and Evaluation | Jun 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GigaAM: Efficient Self-Supervised Learner for Speech Recognition | Jun 1, 2025 | Automatic Speech RecognitionLanguage Modeling | CodeCode Available | 4 |
| A Large Language Model-Supported Threat Modeling Framework for Transportation Cyber-Physical Systems | Jun 1, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Goal-Aware Identification and Rectification of Misinformation in Multi-Agent Systems | May 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Chain-of-Thought Training for Open E2E Spoken Dialogue Systems | May 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MagiCodec: Simple Masked Gaussian-Injected Codec for High-Fidelity Reconstruction and Generation | May 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Translate With Care: Addressing Gender Bias, Neutrality, and Reasoning in Large Language Model Translations | May 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Hierarchical Level-Wise News Article Clustering via Multilingual Matryoshka Embeddings | May 30, 2025 | ArticlesClustering | —Unverified | 0 |
| Speech Token Prediction via Compressed-to-fine Language Modeling for Speech Generation | May 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Period-LLM: Extending the Periodic Capability of Multimodal Large Language Model | May 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MythTriage: Scalable Detection of Opioid Use Disorder Myths on a Video-Sharing Platform | May 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language Models | May 30, 2025 | ClassificationDisaster Response | CodeCode Available | 2 |
| ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RL | May 30, 2025 | Image GenerationLanguage Modeling | CodeCode Available | 2 |
| From Macro to Micro: Probing Dataset Diversity in Language Model Fine-Tuning | May 30, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| Transformers Are Universally Consistent | May 30, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| GradPower: Powering Gradients for Faster Language Model Pre-Training | May 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Drop Dropout on Single-Epoch Language Model Pretraining | May 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |