| Bottom-Up Synthesis of Knowledge-Grounded Task-Oriented Dialogues with Iteratively Self-Refined Prompts | Apr 19, 2025 | Conversational Question AnsweringLanguage Modeling | —Unverified | 0 |
| Manipulating Multimodal Agents via Cross-Modal Prompt Injection | Apr 19, 2025 | Large Language Model | —Unverified | 0 |
| Large Language Model Enhanced Particle Swarm Optimization for Hyperparameter Tuning for Deep Learning Models | Apr 19, 2025 | Deep LearningLanguage Modeling | —Unverified | 0 |
| Improving the Serving Performance of Multi-LoRA Large Language Models via Efficient LoRA and KV Cache Management | Apr 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SOTOPIA-S4: a user-friendly system for flexible, customizable, and large-scale social simulation | Apr 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FGMP: Fine-Grained Mixed-Precision Weight and Activation Quantization for Hardware-Accelerated LLM Inference | Apr 19, 2025 | Large Language ModelQuantization | —Unverified | 0 |
| PV-VLM: A Multimodal Vision-Language Approach Incorporating Sky Images for Intra-Hour Photovoltaic Power Forecasting | Apr 18, 2025 | energy managementLanguage Modeling | —Unverified | 0 |
| Chain-of-Thought Textual Reasoning for Few-shot Temporal Action Localization | Apr 18, 2025 | Action LocalizationAnomaly Detection | —Unverified | 0 |
| High-Throughput LLM inference on Heterogeneous Clusters | Apr 18, 2025 | Large Language ModelScheduling | —Unverified | 0 |
| System of Agentic AI for the Discovery of Metal-Organic Frameworks | Apr 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Think Deep, Think Fast: Investigating Efficiency of Verifier-free Inference-time-scaling Methods | Apr 18, 2025 | Large Language Model | —Unverified | 0 |
| Large Language Bayes | Apr 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards a Multi-Agent Vision-Language System for Zero-Shot Novel Hazardous Object Detection for Autonomous Driving Safety | Apr 18, 2025 | Anomaly DetectionAutonomous Driving | CodeCode Available | 0 |
| Zero-Shot Industrial Anomaly Segmentation with Image-Aware Prompt Generation | Apr 18, 2025 | Anomaly SegmentationLanguage Modeling | CodeCode Available | 0 |
| RAG Without the Lag: Interactive Debugging for Retrieval-Augmented Generation Pipelines | Apr 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Scaling sparse feature circuit finding for in-context learning | Apr 18, 2025 | In-Context LearningLarge Language Model | —Unverified | 0 |
| ChatEXAONEPath: An Expert-level Multimodal Large Language Model for Histopathology Using Whole Slide Images | Apr 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Are Retrials All You Need? Enhancing Large Language Model Reasoning Without Verbalized Feedback | Apr 17, 2025 | AllLanguage Modeling | —Unverified | 0 |
| Causal-Copilot: An Autonomous Causal Analysis Agent | Apr 17, 2025 | Causal DiscoveryCausal Inference | —Unverified | 0 |
| Can LLMs reason over extended multilingual contexts? Towards long-context evaluation beyond retrieval and haystacks | Apr 17, 2025 | Epistemic ReasoningLarge Language Model | CodeCode Available | 0 |
| EarthGPT-X: Enabling MLLMs to Flexibly and Comprehensively Understand Multi-Source Remote Sensing Imagery | Apr 17, 2025 | Large Language ModelMulti-Task Learning | —Unverified | 0 |
| Uncertainty-Aware Trajectory Prediction via Rule-Regularized Heteroscedastic Deep Classification | Apr 17, 2025 | DiversityGaussian Processes | CodeCode Available | 0 |
| DIDS: Domain Impact-aware Data Sampling for Large Language Model Training | Apr 17, 2025 | Dimensionality ReductionLanguage Modeling | —Unverified | 0 |
| Pandora: A Code-Driven Large Language Model Agent for Unified Reasoning Across Diverse Structured Knowledge | Apr 17, 2025 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |
| Characterizing and Optimizing LLM Inference Workloads on CPU-GPU Coupled Architectures | Apr 16, 2025 | CPUGPU | —Unverified | 0 |
| Position: The Most Expensive Part of an LLM should be its Training Data | Apr 16, 2025 | Large Language ModelPosition | —Unverified | 0 |
| Towards Conversational AI for Human-Machine Collaborative MLOps | Apr 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Trusting CHATGPT: how minor tweaks in the prompts lead to major differences in sentiment classification | Apr 16, 2025 | Large Language ModelSentiment Analysis | —Unverified | 0 |
| Generative Recommendation with Continuous-Token Diffusion | Apr 16, 2025 | DenoisingLanguage Modeling | —Unverified | 0 |
| BitNet b1.58 2B4T Technical Report | Apr 16, 2025 | Computational EfficiencyCPU | —Unverified | 0 |
| Mixer Metaphors: audio interfaces for non-musical applications | Apr 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Rethinking LLM-Based Recommendations: A Query Generation-Based, Training-Free Approach | Apr 16, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| Modular-Cam: Modular Dynamic Camera-view Video Generation with LLM | Apr 16, 2025 | Large Language ModelText-to-Video Generation | —Unverified | 0 |
| When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers | Apr 15, 2025 | Binary ClassificationDomain Generalization | —Unverified | 0 |
| The Obvious Invisible Threat: LLM-Powered GUI Agents' Vulnerability to Fine-Print Injections | Apr 15, 2025 | Large Language Model | —Unverified | 0 |
| Large Language Model-Informed Feature Discovery Improves Prediction and Interpretation of Credibility Perceptions of Visual Content | Apr 15, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| Video Summarization with Large Language Models | Apr 15, 2025 | Large Language ModelVideo Summarization | —Unverified | 0 |
| ReZero: Enhancing LLM search ability by trying one-more-time | Apr 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Recommending Clinical Trials for Online Patient Cases using Artificial Intelligence | Apr 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning to Be A Doctor: Searching for Effective Medical Agent Architectures | Apr 15, 2025 | AutoMLDiagnostic | —Unverified | 0 |
| GraphicBench: A Planning Benchmark for Graphic Design with Language Agents | Apr 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Large-Language Model Framework for Relative Timeline Extraction from PubMed Case Reports | Apr 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models | Apr 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Investigating cybersecurity incidents using large language models in latest-generation wireless networks | Apr 14, 2025 | Binary ClassificationData Poisoning | —Unverified | 0 |
| SymRTLO: Enhancing RTL Code Optimization with LLMs and Neuron-Inspired Symbolic Reasoning | Apr 14, 2025 | Large Language ModelRAG | —Unverified | 0 |
| GNN-ACLP: Graph Neural Networks based Analog Circuit Link Prediction | Apr 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Benchmarking Practices in LLM-driven Offensive Security: Testbeds, Metrics, and Experiment Design | Apr 14, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| SUMART: SUMmARizing Translation from Wordy to Concise Expression | Apr 14, 2025 | Large Language ModelTranslation | —Unverified | 0 |
| Automated Testing of COBOL to Java Transformation | Apr 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mavors: Multi-granularity Video Representation for Multimodal Large Language Model | Apr 14, 2025 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |