| ResNetVLLM-2: Addressing ResNetVLLM's Multi-Modal Hallucinations | Apr 20, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| Automatic Text Summarization (ATS) for Research Documents in Sorani Kurdish | Apr 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OmniV-Med: Scaling Medical Vision-Language Model for Universal Visual Understanding | Apr 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ResNetVLLM -- Multi-modal Vision LLM for the Video Understanding Task | Apr 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Bottom-Up Synthesis of Knowledge-Grounded Task-Oriented Dialogues with Iteratively Self-Refined Prompts | Apr 19, 2025 | Conversational Question AnsweringLanguage Modeling | —Unverified | 0 |
| SOTOPIA-S4: a user-friendly system for flexible, customizable, and large-scale social simulation | Apr 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model Enhanced Particle Swarm Optimization for Hyperparameter Tuning for Deep Learning Models | Apr 19, 2025 | Deep LearningLanguage Modeling | —Unverified | 0 |
| A Multimodal Recaptioning Framework to Account for Perceptual Diversity in Multilingual Vision-Language Modeling | Apr 19, 2025 | DiversityImage Retrieval | —Unverified | 0 |
| Improving the Serving Performance of Multi-LoRA Large Language Models via Efficient LoRA and KV Cache Management | Apr 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PV-VLM: A Multimodal Vision-Language Approach Incorporating Sky Images for Intra-Hour Photovoltaic Power Forecasting | Apr 18, 2025 | energy managementLanguage Modeling | —Unverified | 0 |
| System of Agentic AI for the Discovery of Metal-Organic Frameworks | Apr 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards a Multi-Agent Vision-Language System for Zero-Shot Novel Hazardous Object Detection for Autonomous Driving Safety | Apr 18, 2025 | Anomaly DetectionAutonomous Driving | CodeCode Available | 0 |
| A mean teacher algorithm for unlearning of language models | Apr 18, 2025 | Continual LearningLanguage Modeling | CodeCode Available | 0 |
| Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models | Apr 18, 2025 | image-classificationImage Classification | —Unverified | 0 |
| Zero-Shot Industrial Anomaly Segmentation with Image-Aware Prompt Generation | Apr 18, 2025 | Anomaly SegmentationLanguage Modeling | CodeCode Available | 0 |
| RAG Without the Lag: Interactive Debugging for Retrieval-Augmented Generation Pipelines | Apr 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Bayes | Apr 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Baseline for Self-state Identification and Classification in Mental Health Data: CLPsych 2025 Task | Apr 18, 2025 | AttributeBinary Classification | —Unverified | 0 |
| Chain-of-Thought Textual Reasoning for Few-shot Temporal Action Localization | Apr 18, 2025 | Action LocalizationAnomaly Detection | —Unverified | 0 |
| DIDS: Domain Impact-aware Data Sampling for Large Language Model Training | Apr 17, 2025 | Dimensionality ReductionLanguage Modeling | —Unverified | 0 |
| Uncertainty-Aware Trajectory Prediction via Rule-Regularized Heteroscedastic Deep Classification | Apr 17, 2025 | DiversityGaussian Processes | CodeCode Available | 0 |
| It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization | Apr 17, 2025 | AllLanguage Modeling | —Unverified | 0 |
| Pandora: A Code-Driven Large Language Model Agent for Unified Reasoning Across Diverse Structured Knowledge | Apr 17, 2025 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |
| VLLFL: A Vision-Language Model Based Lightweight Federated Learning Framework for Smart Agriculture | Apr 17, 2025 | Federated LearningLanguage Modeling | —Unverified | 0 |
| Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training | Apr 17, 2025 | Caption GenerationHallucination | —Unverified | 0 |
| Energy-Based Reward Models for Robust Language Model Alignment | Apr 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training | Apr 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ChatEXAONEPath: An Expert-level Multimodal Large Language Model for Histopathology Using Whole Slide Images | Apr 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Are Retrials All You Need? Enhancing Large Language Model Reasoning Without Verbalized Feedback | Apr 17, 2025 | AllLanguage Modeling | —Unverified | 0 |
| DVLTA-VQA: Decoupled Vision-Language Modeling with Text-Guided Adaptation for Blind Video Quality Assessment | Apr 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mixer Metaphors: audio interfaces for non-musical applications | Apr 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BitNet b1.58 2B4T Technical Report | Apr 16, 2025 | Computational EfficiencyCPU | —Unverified | 0 |
| Rethinking LLM-Based Recommendations: A Query Generation-Based, Training-Free Approach | Apr 16, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| Towards Conversational AI for Human-Machine Collaborative MLOps | Apr 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Securing the Skies: A Comprehensive Survey on Anti-UAV Methods, Benchmarking, and Future Directions | Apr 16, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Generative Recommendation with Continuous-Token Diffusion | Apr 16, 2025 | DenoisingLanguage Modeling | —Unverified | 0 |
| d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning | Apr 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Interpreting the linear structure of vision-language model embedding spaces | Apr 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Higher-Order Binding of Language Model Virtual Personas: a Study on Approximating Political Partisan Misperceptions | Apr 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ReZero: Enhancing LLM search ability by trying one-more-time | Apr 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| From Gaze to Insight: Bridging Human Visual Attention and Vision Language Model Explanation for Weakly-Supervised Medical Image Segmentation | Apr 15, 2025 | DiagnosticImage Segmentation | CodeCode Available | 0 |
| Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning | Apr 15, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Efficient Distributed Retrieval-Augmented Generation for Enhancing Language Model Performance | Apr 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DeepMLF: Multimodal language model with learnable tokens for deep fusion in sentiment analysis | Apr 15, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| ProtFlow: Fast Protein Sequence Design via Flow Matching on Compressed Protein Language Model Embeddings | Apr 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model-Informed Feature Discovery Improves Prediction and Interpretation of Credibility Perceptions of Visual Content | Apr 15, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| Looking beyond the next token | Apr 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Recommending Clinical Trials for Online Patient Cases using Artificial Intelligence | Apr 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Co-STAR: Collaborative Curriculum Self-Training with Adaptive Regularization for Source-Free Video Domain Adaptation | Apr 15, 2025 | Domain AdaptationLanguage Modeling | CodeCode Available | 0 |
| A Large-Language Model Framework for Relative Timeline Extraction from PubMed Case Reports | Apr 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |