| LM-SPT: LM-Aligned Semantic Distillation for Speech Tokenization | Jun 20, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Challenges in Grounding Language in the Real World | Jun 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLMs in Coding and their Impact on the Commercial Software Engineering Landscape | Jun 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Finance Language Model Evaluation (FLaME) | Jun 18, 2025 | BenchmarkingLanguage Model Evaluation | —Unverified | 0 |
| From RAG to Agentic: Validating Islamic-Medicine Responses with LLM Agents | Jun 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Make Your AUV Adaptive: An Environment-Aware Reinforcement Learning Framework For Underwater Tasks | Jun 18, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| RAS-Eval: A Comprehensive Benchmark for Security Evaluation of LLM Agents in Real-World Environments | Jun 18, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Hypothesis Testing for Quantifying LLM-Human Misalignment in Multiple Choice Settings | Jun 17, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| DiffusionBlocks: Blockwise Training for Generative Models via Score-Based Diffusion | Jun 17, 2025 | DenoisingImage Generation | —Unverified | 0 |
| Lightweight Relevance Grader in RAG | Jun 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Thinking in Directivity: Speech Large Language Model for Multi-Talker Directional Speech Recognition | Jun 17, 2025 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Interpreting Biomedical VLMs on High-Imbalance Out-of-Distributions: An Insight into BiomedCLIP on Radiology | Jun 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| From What to Respond to When to Respond: Timely Response Generation for Open-domain Dialogue Agents | Jun 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ELI-Why: Evaluating the Pedagogical Utility of Language Model Explanations | Jun 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Guaranteed Guess: A Language Modeling Approach for CISC-to-RISC Transpilation with Testing Guarantees | Jun 17, 2025 | Code TranslationHumanEval | —Unverified | 0 |
| Don't Make It Up: Preserving Ignorance Awareness in LLM Fine-Tuning | Jun 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM | Jun 17, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| PRISM2: Unlocking Multi-Modal General Pathology AI with Clinical Dialogue | Jun 16, 2025 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Bi-directional Context-Enhanced Speech Large Language Models for Multilingual Conversational ASR | Jun 16, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Seewo's Submission to MLC-SLM: Lessons learned from Speech Reasoning Language Models | Jun 16, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Leveraging In-Context Learning for Language Model Agents | Jun 16, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Qwen vs. Gemma Integration with Whisper: A Comparative Study in Multilingual SpeechLLM Systems | Jun 16, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| GeoRecon: Graph-Level Representation Learning for 3D Molecules via Reconstruction-Based Pretraining | Jun 16, 2025 | DenoisingLanguage Modeling | —Unverified | 0 |
| Mixture of Weight-shared Heterogeneous Group Attention Experts for Dynamic Token-wise KV Optimization | Jun 16, 2025 | Causal Language ModelingInstruction Following | —Unverified | 0 |
| NTU Speechlab LLM-Based Multilingual ASR System for Interspeech MLC-SLM Challenge 2025 | Jun 16, 2025 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| EmoNews: A Spoken Dialogue System for Expressive News Conversations | Jun 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Value-Free Policy Optimization via Reward Partitioning | Jun 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CFBenchmark-MM: Chinese Financial Assistant Benchmark for Multimodal Large Language Model | Jun 16, 2025 | Decision MakingFinancial Analysis | —Unverified | 0 |
| VIS-Shepherd: Constructing Critic for LLM-based Data Visualization Generation | Jun 16, 2025 | Data VisualizationLanguage Modeling | CodeCode Available | 0 |
| SciSage: A Multi-Agent Framework for High-Quality Scientific Survey Generation | Jun 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HypER: Literature-grounded Hypothesis Generation and Distillation with Provenance | Jun 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| From Human to Machine Psychology: A Conceptual Framework for Understanding Well-Being in Large Language Model | Jun 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Information fusion strategy integrating pre-trained language model and contrastive learning for materials knowledge mining | Jun 14, 2025 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| Is your batch size the problem? Revisiting the Adam-SGD gap in language modeling | Jun 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Information Suppression in Large Language Models: Auditing, Quantifying, and Characterizing Censorship in DeepSeek | Jun 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model-Powered Conversational Agent Delivering Problem-Solving Therapy (PST) for Family Caregivers: Enhancing Empathy and Therapeutic Alliance Using In-Context Learning | Jun 13, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Improving Large Language Model Safety with Contrastive Representation Learning | Jun 13, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Taming Stable Diffusion for Computed Tomography Blind Super-Resolution | Jun 13, 2025 | Blind Super-ResolutionComputed Tomography (CT) | —Unverified | 0 |
| FAA Framework: A Large Language Model-Based Approach for Credit Card Fraud Investigations | Jun 13, 2025 | Fraud DetectionLanguage Modeling | —Unverified | 0 |
| Investigating the Potential of Large Language Model-Based Router Multi-Agent Architectures for Foundation Design Automation: A Task Classification and Expert Selection Study | Jun 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Semantic Preprocessing for LLM-based Malware Analysis | Jun 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unsourced Adversarial CAPTCHA: A Bi-Phase Adversarial CAPTCHA Framework | Jun 12, 2025 | Adversarial AttackDiversity | —Unverified | 0 |
| Automated Validation of Textual Constraints Against AutomationML via LLMs and SHACL | Jun 12, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DanceChat: Large Language Model-Guided Music-to-Dance Generation | Jun 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Discrete Audio Tokens: More Than a Survey! | Jun 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Slimming Down LLMs Without Losing Their Minds | Jun 12, 2025 | Computational EfficiencyGSM8K | —Unverified | 0 |
| Motion-R1: Chain-of-Thought Reasoning and Reinforcement Learning for Human Motion Generation | Jun 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices | Jun 12, 2025 | CPUGPU | —Unverified | 0 |
| NeuralNexus at BEA 2025 Shared Task: Retrieval-Augmented Prompting for Mistake Identification in AI Tutors | Jun 12, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Sequential-Parallel Duality in Prefix Scannable Models | Jun 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |