| Improving Significant Wave Height Prediction Using Chronos Models | Apr 23, 2025 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| LongMamba: Enhancing Mamba's Long Context Capabilities via Training-Free Receptive Field Enlargement | Apr 22, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| FaceInsight: A Multimodal Large Language Model for Face Perception | Apr 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhancing TCR-Peptide Interaction Prediction with Pretrained Language Models and Molecular Representations | Apr 22, 2025 | BenchmarkingFew-Shot Learning | —Unverified | 0 |
| Benchmarking LLM for Code Smells Detection: OpenAI GPT-4.0 vs DeepSeek-V3 | Apr 22, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Research on Cloud Platform Network Traffic Monitoring and Anomaly Detection System based on Large Language Models | Apr 22, 2025 | Anomaly DetectionComputational Efficiency | —Unverified | 0 |
| DATETIME: A new benchmark to measure LLM translation and reasoning capabilities | Apr 22, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LLMs meet Federated Learning for Scalable and Secure IoT Management | Apr 22, 2025 | Computational EfficiencyDecision Making | —Unverified | 0 |
| What's the Difference? Supporting Users in Identifying the Effects of Prompt and Model Changes Through Token Patterns | Apr 22, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Do It For Me vs. Do It With Me: Investigating User Perceptions of Different Paradigms of Automation in Copilots for Feature-Rich Software | Apr 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model Empowered Privacy-Protected Framework for PHI Annotation in Clinical Notes | Apr 22, 2025 | De-identificationLanguage Modeling | —Unverified | 0 |
| LAPP: Large Language Model Feedback for Preference-Driven Reinforcement Learning | Apr 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Speculative Sampling via Exponential Races | Apr 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Virology Capabilities Test (VCT): A Multimodal Virology Q&A Benchmark | Apr 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| RepliBench: Evaluating the Autonomous Replication Capabilities of Language Model Agents | Apr 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models | Apr 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Values in the Wild: Discovering and Analyzing Values in Real-World Language Model Interactions | Apr 21, 2025 | EthicsLanguage Modeling | —Unverified | 0 |
| Kuwain 1.5B: An Arabic SLM via Language Injection | Apr 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Automatic Text Summarization (ATS) for Research Documents in Sorani Kurdish | Apr 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OmniV-Med: Scaling Medical Vision-Language Model for Universal Visual Understanding | Apr 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ResNetVLLM-2: Addressing ResNetVLLM's Multi-Modal Hallucinations | Apr 20, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| PROMPTEVALS: A Dataset of Assertions and Guardrails for Custom Production Large Language Model Pipelines | Apr 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ResNetVLLM -- Multi-modal Vision LLM for the Video Understanding Task | Apr 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Bottom-Up Synthesis of Knowledge-Grounded Task-Oriented Dialogues with Iteratively Self-Refined Prompts | Apr 19, 2025 | Conversational Question AnsweringLanguage Modeling | —Unverified | 0 |
| Improving the Serving Performance of Multi-LoRA Large Language Models via Efficient LoRA and KV Cache Management | Apr 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SOTOPIA-S4: a user-friendly system for flexible, customizable, and large-scale social simulation | Apr 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Multimodal Recaptioning Framework to Account for Perceptual Diversity in Multilingual Vision-Language Modeling | Apr 19, 2025 | DiversityImage Retrieval | —Unverified | 0 |
| Large Language Model Enhanced Particle Swarm Optimization for Hyperparameter Tuning for Deep Learning Models | Apr 19, 2025 | Deep LearningLanguage Modeling | —Unverified | 0 |
| Walk the Talk? Measuring the Faithfulness of Large Language Model Explanations | Apr 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Large Language Bayes | Apr 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| System of Agentic AI for the Discovery of Metal-Organic Frameworks | Apr 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Baseline for Self-state Identification and Classification in Mental Health Data: CLPsych 2025 Task | Apr 18, 2025 | AttributeBinary Classification | —Unverified | 0 |
| Learning to Attribute with Attention | Apr 18, 2025 | AttributeLanguage Modeling | CodeCode Available | 1 |
| A mean teacher algorithm for unlearning of language models | Apr 18, 2025 | Continual LearningLanguage Modeling | CodeCode Available | 0 |
| Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models | Apr 18, 2025 | image-classificationImage Classification | —Unverified | 0 |
| PV-VLM: A Multimodal Vision-Language Approach Incorporating Sky Images for Intra-Hour Photovoltaic Power Forecasting | Apr 18, 2025 | energy managementLanguage Modeling | —Unverified | 0 |
| Towards a Multi-Agent Vision-Language System for Zero-Shot Novel Hazardous Object Detection for Autonomous Driving Safety | Apr 18, 2025 | Anomaly DetectionAutonomous Driving | CodeCode Available | 0 |
| RAG Without the Lag: Interactive Debugging for Retrieval-Augmented Generation Pipelines | Apr 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Zero-Shot Industrial Anomaly Segmentation with Image-Aware Prompt Generation | Apr 18, 2025 | Anomaly SegmentationLanguage Modeling | CodeCode Available | 0 |
| Chain-of-Thought Textual Reasoning for Few-shot Temporal Action Localization | Apr 18, 2025 | Action LocalizationAnomaly Detection | —Unverified | 0 |
| VLLFL: A Vision-Language Model Based Lightweight Federated Learning Framework for Smart Agriculture | Apr 17, 2025 | Federated LearningLanguage Modeling | —Unverified | 0 |
| Energy-Based Reward Models for Robust Language Model Alignment | Apr 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DIDS: Domain Impact-aware Data Sampling for Large Language Model Training | Apr 17, 2025 | Dimensionality ReductionLanguage Modeling | —Unverified | 0 |
| It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization | Apr 17, 2025 | AllLanguage Modeling | —Unverified | 0 |
| Perception Encoder: The best visual embeddings are not at the output of the network | Apr 17, 2025 | Depth EstimationLanguage Modeling | CodeCode Available | 8 |
| Are Retrials All You Need? Enhancing Large Language Model Reasoning Without Verbalized Feedback | Apr 17, 2025 | AllLanguage Modeling | —Unverified | 0 |
| Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training | Apr 17, 2025 | Caption GenerationHallucination | —Unverified | 0 |
| Pandora: A Code-Driven Large Language Model Agent for Unified Reasoning Across Diverse Structured Knowledge | Apr 17, 2025 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |
| ChatEXAONEPath: An Expert-level Multimodal Large Language Model for Histopathology Using Whole Slide Images | Apr 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Chinese-Vicuna: A Chinese Instruction-following Llama-based Model | Apr 17, 2025 | Code GenerationCPU | CodeCode Available | 7 |