| Prompt-Guided Turn-Taking Prediction | Jun 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| World-aware Planning Narratives Enhance Large Vision-Language Model Planner | Jun 26, 2025 | Imitation LearningLanguage Modeling | —Unverified | 0 |
| V2X-REALM: Vision-Language Model-Based Robust End-to-End Cooperative Autonomous Driving with Adaptive Long-Tail Modeling | Jun 26, 2025 | Autonomous DrivingContrastive Learning | —Unverified | 0 |
| Large Language Model-Driven Code Compliance Checking in Building Information Modeling | Jun 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GPTailor: Large Language Model Pruning Through Layer Cutting and Stitching | Jun 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Biomed-Enriched: A Biomedical Dataset Enriched with LLMs for Pretraining and Extracting Rare and Hidden Content | Jun 25, 2025 | ArticlesContinual Pretraining | —Unverified | 0 |
| OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling | Jun 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Towards Community-Driven Agents for Machine Learning Engineering | Jun 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SEED: A Structural Encoder for Embedding-Driven Decoding in Time Series Prediction with LLMs | Jun 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Multi-Pass Large Language Model Framework for Precise and Efficient Radiology Report Error Detection | Jun 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Case-based Reasoning Augmented Large Language Model Framework for Decision Making in Realistic Safety-Critical Driving Scenarios | Jun 25, 2025 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Automatic Demonstration Selection for LLM-based Tabular Data Classification | Jun 25, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Enterprise Large Language Model Evaluation Benchmark | Jun 25, 2025 | Language Model EvaluationLanguage Modeling | —Unverified | 0 |
| Language Modeling by Language Models | Jun 25, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| AALC: Large Language Model Efficient Reasoning via Adaptive Accuracy-Length Control | Jun 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Narrative Shift Detection: A Hybrid Approach of Dynamic Topic Models and Large Language Models | Jun 25, 2025 | ArticlesChange Point Detection | CodeCode Available | 0 |
| PARALLELPROMPT: Extracting Parallelism from Large Language Model Queries | Jun 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Large Language Model-based Multi-Agent Framework for Analog Circuits' Sizing Relationships Extraction | Jun 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GradualDiff-Fed: A Federated Learning Specialized Framework for Large Language Model | Jun 23, 2025 | Federated LearningLanguage Modeling | —Unverified | 0 |
| AdapThink: Adaptive Thinking Preferences for Reasoning Language Model | Jun 23, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| Smart-LLaMA-DPO: Reinforced Large Language Model for Explainable Smart Contract Vulnerability Detection | Jun 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation | Jun 22, 2025 | GPUImage Generation | CodeCode Available | 3 |
| Pre-Trained LLM is a Semantic-Aware and Generalizable Segmentation Booster | Jun 22, 2025 | DecoderImage Segmentation | CodeCode Available | 2 |
| Leveraging Large Language Model for Intelligent Log Processing and Autonomous Debugging in Cloud AI Platforms | Jun 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Research on Model Parallelism and Data Parallelism Optimization Methods in Large Language Model-Based Recommendation Systems | Jun 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reflective Verbal Reward Design for Pluralistic Alignment | Jun 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Can Generated Images Serve as a Viable Modality for Text-Centric Multimodal Learning? | Jun 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Challenges in Grounding Language in the Real World | Jun 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Computational Approaches to Understanding Large Language Model Impact on Writing and Information Ecosystems | Jun 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LM-SPT: LM-Aligned Semantic Distillation for Speech Tokenization | Jun 20, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| LLMs in Coding and their Impact on the Commercial Software Engineering Landscape | Jun 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LMR-BENCH: Evaluating LLM Agent's Ability on Reproducing Language Modeling Research | Jun 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Watermarking Autoregressive Image Generation | Jun 19, 2025 | Image GenerationLanguage Modeling | CodeCode Available | 2 |
| From RAG to Agentic: Validating Islamic-Medicine Responses with LLM Agents | Jun 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Make Your AUV Adaptive: An Environment-Aware Reinforcement Learning Framework For Underwater Tasks | Jun 18, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| RAS-Eval: A Comprehensive Benchmark for Security Evaluation of LLM Agents in Real-World Environments | Jun 18, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Show-o2: Improved Native Unified Multimodal Models | Jun 18, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| Finance Language Model Evaluation (FLaME) | Jun 18, 2025 | BenchmarkingLanguage Model Evaluation | —Unverified | 0 |
| BMFM-RNA: An Open Framework for Building and Evaluating Transcriptomic Foundation Models | Jun 17, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 2 |
| Thinking in Directivity: Speech Large Language Model for Multi-Talker Directional Speech Recognition | Jun 17, 2025 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Hypothesis Testing for Quantifying LLM-Human Misalignment in Multiple Choice Settings | Jun 17, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Lightweight Relevance Grader in RAG | Jun 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Don't Make It Up: Preserving Ignorance Awareness in LLM Fine-Tuning | Jun 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| From Bytes to Ideas: Language Modeling with Autoregressive U-Nets | Jun 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 7 |
| From What to Respond to When to Respond: Timely Response Generation for Open-domain Dialogue Agents | Jun 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM | Jun 17, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| DiffusionBlocks: Blockwise Training for Generative Models via Score-Based Diffusion | Jun 17, 2025 | DenoisingImage Generation | —Unverified | 0 |
| Guaranteed Guess: A Language Modeling Approach for CISC-to-RISC Transpilation with Testing Guarantees | Jun 17, 2025 | Code TranslationHumanEval | —Unverified | 0 |
| Interpreting Biomedical VLMs on High-Imbalance Out-of-Distributions: An Insight into BiomedCLIP on Radiology | Jun 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| RMIT-ADM+S at the SIGIR 2025 LiveRAG Challenge | Jun 17, 2025 | Answer GenerationLanguage Modeling | CodeCode Available | 1 |