| Assessing and Enhancing the Robustness of LLM-based Multi-Agent Systems Through Chaos Engineering | May 6, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Validating the Effectiveness of a Large Language Model-based Approach for Identifying Children's Development across Various Free Play Settings in Kindergarten | May 6, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Soft Best-of-n Sampling for Model Alignment | May 6, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Knowledge Guided Encoder-Decoder Framework: Integrating Multiple Physical Models for Agricultural Ecosystem Modeling | May 5, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| Leveraging Protein Language Model Embeddings for Catalytic Turnover Prediction of Adenylate Kinase Orthologs in a Low-Data Regime | May 5, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Automatic Proficiency Assessment in L2 English Learners | May 5, 2025 | Deep LearningLanguage Modeling | —Unverified | 0 |
| SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning | May 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EntroLLM: Entropy Encoded Weight Compression for Efficient Large Language Model Inference on Edge Devices | May 5, 2025 | 4kLanguage Modeling | —Unverified | 0 |
| Radio: Rate-Distortion Optimization for Large Language Model Compression | May 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Bielik 11B v2 Technical Report | May 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Technical Report: Evaluating Goal Drift in Language Model Agents | May 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhancing Chemical Reaction and Retrosynthesis Prediction with Large Language Model and Dual-task Learning | May 5, 2025 | Drug DesignDrug Discovery | —Unverified | 0 |
| CreoPep: A Universal Deep Learning Framework for Target-Specific Peptide Design and Optimization | May 5, 2025 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Large Language Model Partitioning for Low-Latency Inference at the Edge | May 5, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| Giving Simulated Cells a Voice: Evolving Prompt-to-Intervention Models for Cellular Control | May 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TeDA: Boosting Vision-Lanuage Models for Zero-Shot 3D Object Retrieval via Testing-time Distribution Alignment | May 5, 2025 | 3D Object RetrievalLanguage Modeling | CodeCode Available | 0 |
| Bielik v3 Small: Technical Report | May 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Comprehensive Analysis for Visual Object Hallucination in Large Vision-Language Models | May 4, 2025 | AttributeHallucination | —Unverified | 0 |
| R-Bench: Graduate-level Multi-disciplinary Benchmarks for LLM & MLLM Complex Reasoning Evaluation | May 4, 2025 | Language Model EvaluationLanguage Modeling | —Unverified | 0 |
| MemEngine: A Unified and Modular Library for Developing Advanced Memory of LLM-based Agents | May 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| DNAZEN: Enhanced Gene Sequence Representations via Mixed Granularities of Coding Units | May 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| What do Language Model Probabilities Represent? From Distribution Estimation to Response Prediction | May 4, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| LecEval: An Automated Metric for Multimodal Knowledge Acquisition in Multimedia Learning | May 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Vision and Intention Boost Large Language Model in Long-Term Action Anticipation | May 3, 2025 | Action AnticipationIn-Context Learning | —Unverified | 0 |
| Intra-Layer Recurrence in Transformers for Language Modeling | May 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Accelerating Large Language Model Reasoning via Speculative Search | May 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Facilitating Video Story Interaction with Multi-Agent Collaborative System | May 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| WirelessAgent: Large Language Model Agents for Intelligent Wireless Networks | May 2, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| On the Limitations of Steering in Language Model Alignment | May 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Seeking to Collide: Online Safety-Critical Scenario Generation for Autonomous Driving with Retrieval Augmented Large Language Models | May 2, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Large Language Model-Driven Dynamic Assessment of Grammatical Accuracy in English Language Learner Writing | May 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Any-to-Any Vision-Language Model for Multimodal X-ray Imaging and Radiological Report Generation | May 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FlowDubber: Movie Dubbing with LLM-based Semantic-aware Learning and Flow Matching based Voice Enhancing | May 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evaluating Vision Language Model Adaptations for Radiology Report Generation in Low-Resource Languages | May 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PipeSpec: Breaking Stage Dependencies in Hierarchical LLM Decoding | May 2, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Document Retrieval Augmented Fine-Tuning (DRAFT) for safety-critical software assessments | May 2, 2025 | Dataset GenerationLanguage Modeling | —Unverified | 0 |
| CodeSSM: Towards State Space Models for Code Understanding | May 2, 2025 | Clone DetectionLanguage Modeling | —Unverified | 0 |
| LLM Watermarking Using Mixtures and Statistical-to-Computational Gaps | May 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AdCare-VLM: Leveraging Large Vision Language Model (LVLM) to Monitor Long-Term Medication Adherence and Care | May 1, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Patchwork: A Unified Framework for RAG Serving | May 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Red Teaming Large Language Models for Healthcare | May 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| KoACD: The First Korean Adolescent Dataset for Cognitive Distortion Analysis | May 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MoxE: Mixture of xLSTM Experts with Entropy-Aware Routing for Efficient Language Modeling | May 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FreqKV: Frequency Domain Key-Value Compression for Efficient Context Window Extension | May 1, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| Data Therapist: Eliciting Domain Knowledge from Subject Matter Experts Using Large Language Models | May 1, 2025 | Data VisualizationLanguage Modeling | —Unverified | 0 |
| Visual Test-time Scaling for GUI Agent Grounding | May 1, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Multi-Granularity Retrieval Framework for Visually-Rich Documents | May 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Urban Air Mobility as a System of Systems: An LLM-Enhanced Holonic Approach | May 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Survey on Large Language Model based Human-Agent Systems | May 1, 2025 | Human Agent CollaborationLanguage Modeling | CodeCode Available | 0 |
| LLM-Based Threat Detection and Prevention Framework for IoT Ecosystems | May 1, 2025 | Anomaly DetectionLanguage Modeling | —Unverified | 0 |