| Customizing Speech Recognition Model with Large Language Model Feedback | Jun 5, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Zeroth-Order Optimization Finds Flat Minima | Jun 5, 2025 | Binary ClassificationLanguage Modeling | —Unverified | 0 |
| Improving Low-Resource Morphological Inflection via Self-Supervised Objectives | Jun 5, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| Unleashing Hour-Scale Video Training for Long Video-Language Understanding | Jun 5, 2025 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| E-bike agents: Large Language Model-Driven E-Bike Accident Analysis and Severity Prediction | Jun 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HALoS: Hierarchical Asynchronous Local SGD over Slow Networks for Geo-Distributed Large Language Model Training | Jun 5, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Accelerated Test-Time Scaling with Model-Free Speculative Sampling | Jun 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ConECT Dataset: Overcoming Data Scarcity in Context-Aware E-Commerce MT | Jun 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MesaNet: Sequence Modeling by Locally Optimal Test-Time Training | Jun 5, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Hierarchical Language Models for Semantic Navigation and Manipulation in an Aerial-Ground Robotic System | Jun 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OpenMaskDINO3D : Reasoning 3D Segmentation via Large Language Model | Jun 5, 2025 | Instance SegmentationLanguage Modeling | CodeCode Available | 1 |
| HoliSafe: Holistic Safety Benchmarking and Modeling with Safety Meta Token for Vision-Language Model | Jun 5, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| LESS: Large Language Model Enhanced Semi-Supervised Learning for Speech Foundational Models | Jun 5, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| The NTNU System at the S&I Challenge 2025 SLA Open Track | Jun 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Handle-based Mesh Deformation Guided By Vision Language Model | Jun 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Sparse Autoencoders, Again? | Jun 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Robust Few-Shot Vision-Language Model Adaptation | Jun 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Clustering and Median Aggregation Improve Differentially Private Inference | Jun 5, 2025 | ClusteringLanguage Modeling | —Unverified | 0 |
| Exp4Fuse: A Rank Fusion Framework for Enhanced Sparse Retrieval using Large Language Model-based Query Expansion | Jun 5, 2025 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| Go-Browse: Training Web Agents with Structured Exploration | Jun 4, 2025 | Efficient ExplorationLanguage Modeling | —Unverified | 0 |
| Debate, Reflect, and Distill: Multi-Agent Feedback with Tree-Structured Preference Optimization for Efficient Language Model Enhancement | Jun 4, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Phi-Omni-ST: A multimodal language model for direct speech-to-speech translation | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evaluating Large Language Model Capabilities in Assessing Spatial Econometrics Research | Jun 4, 2025 | counterfactualEconometrics | —Unverified | 0 |
| POSS: Position Specialist Generates Better Draft for Speculative Decoding | Jun 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EuroLLM-9B: Technical Report | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Rectified Sparse Attention | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Statistical Physics of Language Model Reasoning | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Efficient Speech-Text Jointly Decoding within One Speech Language Model | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| KOALA++: Efficient Kalman-Based Optimization of Neural Networks with Gradient-Covariance Products | Jun 4, 2025 | image-classificationImage Classification | —Unverified | 0 |
| MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale | Jun 4, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| LaF-GRPO: In-Situ Navigation Instruction Generation for the Visually Impaired via GRPO with LLM-as-Follower Reward | Jun 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| "Don't Do That!": Guiding Embodied Systems through Large Language Model-based Constraint Generation | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Novel Data Augmentation Approach for Automatic Speaking Assessment on Opinion Expressions | Jun 4, 2025 | Data AugmentationDiversity | —Unverified | 0 |
| Think Like a Person Before Responding: A Multi-Faceted Evaluation of Persona-Guided LLMs for Countering Hate | Jun 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Evaluating Apple Intelligence's Writing Tools for Privacy Against Large Language Model-Based Inference Attacks: Insights from Early Datasets | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Understanding and Meeting Practitioner Needs When Measuring Representational Harms Caused by LLM-Based Systems | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MASTER: Enhancing Large Language Model via Multi-Agent Simulated Teaching | Jun 3, 2025 | Data AugmentationInstruction Following | —Unverified | 0 |
| A Smart Multimodal Healthcare Copilot with Powerful LLM Reasoning | Jun 3, 2025 | Decision MakingDiagnostic | CodeCode Available | 3 |
| TaxAgent: How Large Language Model Designs Fiscal Policy | Jun 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Trajectory Prediction Meets Large Language Models: A Survey | Jun 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| IMPARA-GED: Grammatical Error Detection is Boosting Reference-free Grammatical Error Quality Estimator | Jun 3, 2025 | Grammatical Error CorrectionGrammatical Error Detection | —Unverified | 0 |
| EALG: Evolutionary Adversarial Generation of Language Model-Guided Generators for Combinatorial Optimization | Jun 3, 2025 | Combinatorial OptimizationLanguage Modeling | —Unverified | 0 |
| TalkingMachines: Real-Time Audio-Driven FaceTime-Style Video via Autoregressive Diffusion Models | Jun 3, 2025 | DecoderKnowledge Distillation | —Unverified | 0 |
| SurgVLM: A Large Vision-Language Model and Systematic Evaluation Benchmark for Surgical Intelligence | Jun 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond Text Compression: Evaluating Tokenizers Across Scales | Jun 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PGPO: Enhancing Agent Reasoning via Pseudocode-style Planning Guided Preference Optimization | Jun 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Hybrid AI for Responsive Multi-Turn Online Conversations with Novel Dynamic Routing and Feedback Adaptation | Jun 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Investigating the Impact of Word Informativeness on Speech Emotion Recognition | Jun 2, 2025 | Emotion RecognitionInformativeness | —Unverified | 0 |
| Why Gradients Rapidly Increase Near the End of Training | Jun 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Self-Challenging Language Model Agents | Jun 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |