| Parameterized Synthetic Text Generation with SimpleStories | Apr 12, 2025 | DiversityLanguage Modeling | CodeCode Available | 1 |
| PACT: Pruning and Clustering-Based Token Reduction for Faster Visual Language Models | Apr 11, 2025 | ClusteringLanguage Modeling | CodeCode Available | 2 |
| Large Language Model Empowered Recommendation Meets All-domain Continual Pre-Training | Apr 11, 2025 | AllLanguage Modeling | —Unverified | 0 |
| Spatial Audio Processing with Large Language Model on Wearable Devices | Apr 11, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| AstroLLaVA: towards the unification of astronomical data and natural language | Apr 11, 2025 | AstronomyImage Captioning | —Unverified | 0 |
| ELSA: A Style Aligned Dataset for Emotionally Intelligent Language Generation | Apr 11, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| SWAN-GPT: An Efficient and Scalable Approach for Long-Context Language Modeling | Apr 11, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| MedRep: Medical Concept Representation for General Electronic Health Record Foundation Models | Apr 11, 2025 | Data AugmentationLanguage Modeling | CodeCode Available | 0 |
| SpecEE: Accelerating Large Language Model Inference with Speculative Early Exiting | Apr 11, 2025 | GPULanguage Modeling | —Unverified | 0 |
| TP-RAG: Benchmarking Retrieval-Augmented Large Language Model Agents for Spatiotemporal-Aware Travel Planning | Apr 11, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| EO-VLM: VLM-Guided Energy Overload Attacks on Vision Models | Apr 11, 2025 | Autonomous DrivingGPU | —Unverified | 0 |
| Data Metabolism: An Efficient Data Design Schema For Vision Language Model | Apr 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| JEPA4Rec: Learning Effective Language Representations for Sequential Recommendation via Joint Embedding Predictive Architecture | Apr 10, 2025 | Common Sense ReasoningDescriptive | —Unverified | 0 |
| Investigating Vision-Language Model for Point Cloud-based Vehicle Classification | Apr 10, 2025 | Autonomous DrivingClassification | —Unverified | 0 |
| An LLM-Driven Multi-Agent Debate System for Mendelian Diseases | Apr 10, 2025 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Beyond LLMs: A Linguistic Approach to Causal Graph Generation from Narrative Texts | Apr 10, 2025 | Graph GenerationLanguage Modeling | —Unverified | 0 |
| VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model | Apr 10, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 9 |
| LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models | Apr 10, 2025 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Cat, Rat, Meow: On the Alignment of Language Model and Human Term-Similarity Judgments | Apr 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Synthetic Fluency: Hallucinations, Confabulations, and the Creation of Irish Words in LLM-Generated Translations | Apr 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation | Apr 10, 2025 | Contrastive LearningLanguage Modeling | CodeCode Available | 2 |
| DeepGreen: Effective LLM-Driven Green-washing Monitoring System Designed for Empirical Testing -- Evidence from China | Apr 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Token Level Routing Inference System for Edge Devices | Apr 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens | Apr 9, 2025 | Fact CheckingHallucination | —Unverified | 0 |
| The Method for Storing Patterns in Neural Networks-Memorization and Recall of QR code Patterns- | Apr 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RuOpinionNE-2024: Extraction of Opinion Tuples from Russian News Texts | Apr 9, 2025 | Dialogue EvaluationLanguage Modeling | CodeCode Available | 0 |
| A Multi-Phase Analysis of Blood Culture Stewardship: Machine Learning Prediction, Expert Recommendation Assessment, and LLM Automation | Apr 9, 2025 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Language Modeling for the Future of Finance: A Quantitative Survey into Metrics, Tasks, and Data Opportunities | Apr 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MovSAM: A Single-image Moving Object Segmentation Framework Based on Deep Thinking | Apr 9, 2025 | Autonomous DrivingLanguage Modeling | CodeCode Available | 0 |
| PAYADOR: A Minimalist Approach to Grounding Language Models on Structured Data for Interactive Storytelling and Role-playing Games | Apr 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Q-Agent: Quality-Driven Chain-of-Thought Image Restoration Agent through Robust Multimodal Large Language Model | Apr 9, 2025 | Image Quality AssessmentImage Restoration | —Unverified | 0 |
| TASTE: Text-Aligned Speech Tokenization and Embedding for Spoken Language Modeling | Apr 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Societal Impacts Research Requires Benchmarks for Creative Composition Tasks | Apr 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought | Apr 8, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 7 |
| InstructMPC: A Human-LLM-in-the-Loop Framework for Context-Aware Control | Apr 8, 2025 | energy managementLanguage Modeling | —Unverified | 0 |
| Simplifying Data Integration: SLM-Driven Systems for Unified Semantic Queries Across Heterogeneous Databases | Apr 8, 2025 | Data IntegrationLanguage Modeling | —Unverified | 0 |
| DoCIA: An Online Document-Level Context Incorporation Agent for Speech Translation | Apr 7, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Evaluating Knowledge Graph Based Retrieval Augmented Generation Methods under Knowledge Incompleteness | Apr 7, 2025 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |
| Towards Visual Text Grounding of Multimodal Large Language Model | Apr 7, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling | Apr 7, 2025 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration | Apr 7, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| 'Neural howlround' in large language models: a self-reinforcing bias phenomenon, and a dynamic attenuation solution | Apr 7, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| A Taxonomy of Self-Handover | Apr 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Dream Within Huang Long Cave: AI-Driven Interactive Narrative for Family Storytelling and Emotional Reflection | Apr 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model (LLM) for Software Security: Code Analysis, Malware Analysis, Reverse Engineering | Apr 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CO-Bench: Benchmarking Language Model Agents in Algorithm Search for Combinatorial Optimization | Apr 6, 2025 | BenchmarkingCombinatorial Optimization | CodeCode Available | 1 |
| DDPT: Diffusion-Driven Prompt Tuning for Large Language Model Code Generation | Apr 6, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Thanos: A Block-wise Pruning Algorithm for Efficient Large Language Model Compression | Apr 6, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 0 |
| ZeroED: Hybrid Zero-shot Error Detection through Large Language Model Reasoning | Apr 6, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Hessian of Perplexity for Large Language Models by PyTorch autograd (Open Source) | Apr 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |