MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO May 19, 2025 Decoder Image Generation
Code Code Available 0R1dacted: Investigating Local Censorship in DeepSeek's R1 Language Model May 19, 2025 Language Modeling Language Modelling
— Unverified 0The Traitors: Deception and Trust in Multi-Agent Language Model Simulations May 19, 2025 Language Modeling Language Modelling
Code Code Available 0Tianyi: A Traditional Chinese Medicine all-rounder language model and its Real-World Clinical Practice May 19, 2025 All Hallucination
— Unverified 0Sat2Sound: A Unified Framework for Zero-Shot Soundscape Mapping May 19, 2025 Contrastive Learning Cross-Modal Retrieval
— Unverified 0SpatialLLM: From Multi-modality Data to Urban Spatial Intelligence May 19, 2025 Language Modeling Language Modelling
Code Code Available 0Krikri: Advancing Open Large Language Models for Greek May 19, 2025 Code Generation Language Modeling
— Unverified 0TinyAlign: Boosting Lightweight Vision-Language Models by Mitigating Modal Alignment Bottlenecks May 19, 2025 Language Modeling Language Modelling
— Unverified 0On the Thinking-Language Modeling Gap in Large Language Models May 19, 2025 Language Modeling Language Modelling
— Unverified 0ORQA: A Benchmark and Foundation Model for Holistic Operating Room Modeling May 19, 2025 Graph Generation Knowledge Distillation
— Unverified 0SurveillanceVQA-589K: A Benchmark for Comprehensive Surveillance Video-Language Understanding with Large Models May 19, 2025 Causal Inference Decision Making
— Unverified 0Temporal-Oriented Recipe for Transferring Large Vision-Language Model to Video Understanding May 19, 2025 Language Modeling Language Modelling
Code Code Available 0Structure-Aware Corpus Construction and User-Perception-Aligned Metrics for Large-Language-Model Code Completion May 19, 2025 Code Completion Language Modeling
— Unverified 0ReSW-VL: Representation Learning for Surgical Workflow Analysis Using Vision-Language Model May 19, 2025 Language Modeling Language Modelling
— Unverified 0Why Knowledge Distillation Works in Generative Models: A Minimal Working Explanation May 19, 2025 Knowledge Distillation Language Modeling
— Unverified 0VocalAgent: Large Language Models for Vocal Health Diagnostics with Safety-Aware Evaluation May 19, 2025 Diagnostic Language Modeling
— Unverified 0VLC Fusion: Vision-Language Conditioned Sensor Fusion for Robust Object Detection May 19, 2025 Autonomous Driving Language Modeling
— Unverified 0CMLFormer: A Dual Decoder Transformer with Switching Point Learning for Code-Mixed Language Modeling May 19, 2025 Decoder Language Modeling
— Unverified 0A Physics-Inspired Optimizer: Velocity Regularized Adam May 19, 2025 image-classification Image Classification
— Unverified 0IDEAL: Data Equilibrium Adaptation for Multi-Capability Language Model Alignment May 19, 2025 Language Modeling Language Modelling
— Unverified 0CIE: Controlling Language Model Text Generations Using Continuous Signals May 19, 2025 continuous-control Continuous Control
Code Code Available 0Combining the Best of Both Worlds: A Method for Hybrid NMT and LLM Translation May 19, 2025 Language Modeling Language Modelling
— Unverified 0A*-Decoding: Token-Efficient Inference Scaling May 19, 2025 Language Modeling Language Modelling
— Unverified 0CALM: Co-evolution of Algorithms and Language Model for Automatic Heuristic Design May 18, 2025 GPU Language Modeling
— Unverified 0Beyond Frameworks: Unpacking Collaboration Strategies in Multi-Agent Systems May 18, 2025 Computational Efficiency Language Modeling
— Unverified 0From n-gram to Attention: How Model Architectures Learn and Propagate Bias in Language Modeling May 18, 2025 Language Modeling Language Modelling
— Unverified 0Bridging Generative and Discriminative Learning: Few-Shot Relation Extraction via Two-Stage Knowledge-Guided Pre-training May 18, 2025 Contrastive Learning In-Context Learning
Code Code Available 0DS-ProGen: A Dual-Structure Deep Language Model for Functional Protein Design May 18, 2025 Language Modeling Language Modelling
— Unverified 0Towards DS-NER: Unveiling and Addressing Latent Noise in Distant Annotations May 18, 2025 Language Modeling Language Modelling
Code Code Available 0mCLM: A Function-Infused and Synthesis-Friendly Modular Chemical Language Model May 18, 2025 Language Modeling Language Modelling
— Unverified 0NeuroGen: Neural Network Parameter Generation via Large Language Models May 18, 2025 Language Modeling Language Modelling
— Unverified 0Mitigating Content Effects on Reasoning in Language Models through Fine-Grained Activation Steering May 18, 2025 Language Modeling Language Modelling
— Unverified 0Self-Destructive Language Model May 18, 2025 Language Modeling Language Modelling
— Unverified 0LLM-Based User Simulation for Low-Knowledge Shilling Attacks on Recommender Systems May 18, 2025 Language Modeling Language Modelling
— Unverified 0SGDPO: Self-Guided Direct Preference Optimization for Language Model Alignment May 18, 2025 Language Modeling Language Modelling
— Unverified 0SOCIA: An End-to-End Agentic Framework for Automated Cyber-Physical-Social Simulator Generation May 17, 2025 Code Generation Language Modeling
— Unverified 0TinyRS-R1: Compact Multimodal Language Model for Remote Sensing May 17, 2025 Language Modeling Language Modelling
— Unverified 0Recursive Question Understanding for Complex Question Answering over Heterogeneous Personal Data May 17, 2025 Language Modeling Language Modelling
— Unverified 0Reasoning Large Language Model Errors Arise from Hallucinating Critical Problem Features May 17, 2025 Language Modeling Language Modelling
Code Code Available 0PRS-Med: Position Reasoning Segmentation with Vision-Language Model in Medical Imaging May 17, 2025 Image Segmentation Language Modeling
— Unverified 0LoRASuite: Efficient LoRA Adaptation Across Large Language Model Upgrades May 17, 2025 Language Modeling Language Modelling
— Unverified 0Communication-Efficient Hybrid Language Model via Uncertainty-Aware Opportunistic and Compressed Transmission May 17, 2025 Language Modeling Language Modelling
— Unverified 0An Explanation of Intrinsic Self-Correction via Linear Representations and Latent Concepts May 17, 2025 Concept Alignment Language Modeling
— Unverified 0CorBenchX: Large-Scale Chest X-Ray Error Dataset and Vision-Language Model Benchmark for Report Error Correction May 17, 2025 Language Modeling Language Modelling
— Unverified 0Internal Causal Mechanisms Robustly Predict Language Model Out-of-Distribution Behaviors May 17, 2025 counterfactual Instruction Following
Code Code Available 0Chain-of-Model Learning for Language Model May 17, 2025 Language Modeling Language Modelling
Code Code Available 0Efficiently Building a Domain-Specific Large Language Model from Scratch: A Case Study of a Classical Chinese Large Language Model May 17, 2025 Language Modeling Language Modelling
— Unverified 0Maximizing Asynchronicity in Event-based Neural Networks May 16, 2025 Event-based vision Language Modeling
— Unverified 0Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline May 16, 2025 Abstractive Text Summarization Language Modeling
Code Code Available 0On DeepSeekMoE: Statistical Benefits of Shared Experts and Normalized Sigmoid Gating May 16, 2025 Language Modeling Language Modelling
— Unverified 0