| Rethinking LLM-Based Recommendations: A Query Generation-Based, Training-Free Approach | Apr 16, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| HLS-Eval: A Benchmark and Framework for Evaluating LLMs on High-Level Synthesis Design Tasks | Apr 16, 2025 | High-Level SynthesisLarge Language Model | CodeCode Available | 1 |
| Position: The Most Expensive Part of an LLM should be its Training Data | Apr 16, 2025 | Large Language ModelPosition | —Unverified | 0 |
| Characterizing and Optimizing LLM Inference Workloads on CPU-GPU Coupled Architectures | Apr 16, 2025 | CPUGPU | —Unverified | 0 |
| Towards Conversational AI for Human-Machine Collaborative MLOps | Apr 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Recommending Clinical Trials for Online Patient Cases using Artificial Intelligence | Apr 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GraphicBench: A Planning Benchmark for Graphic Design with Language Agents | Apr 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Large-Language Model Framework for Relative Timeline Extraction from PubMed Case Reports | Apr 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Video Summarization with Large Language Models | Apr 15, 2025 | Large Language ModelVideo Summarization | —Unverified | 0 |
| When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers | Apr 15, 2025 | Binary ClassificationDomain Generalization | —Unverified | 0 |
| Large Language Model-Informed Feature Discovery Improves Prediction and Interpretation of Credibility Perceptions of Visual Content | Apr 15, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| ReZero: Enhancing LLM search ability by trying one-more-time | Apr 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning | Apr 15, 2025 | Automated Theorem ProvingLarge Language Model | CodeCode Available | 3 |
| Learning to Be A Doctor: Searching for Effective Medical Agent Architectures | Apr 15, 2025 | AutoMLDiagnostic | —Unverified | 0 |
| The Obvious Invisible Threat: LLM-Powered GUI Agents' Vulnerability to Fine-Print Injections | Apr 15, 2025 | Large Language Model | —Unverified | 0 |
| Evaluation Report on MCP Servers | Apr 15, 2025 | Large Language Model | CodeCode Available | 3 |
| Transferable text data distillation by trajectory matching | Apr 14, 2025 | ARCLarge Language Model | —Unverified | 0 |
| A Survey of Large Language Model-Powered Spatial Intelligence Across Scales: Advances in Embodied Agents, Smart Cities, and Earth Science | Apr 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Investigating cybersecurity incidents using large language models in latest-generation wireless networks | Apr 14, 2025 | Binary ClassificationData Poisoning | —Unverified | 0 |
| LLM Unlearning Reveals a Stronger-Than-Expected Coreset Effect in Current Benchmarks | Apr 14, 2025 | Large Language ModelMachine Unlearning | CodeCode Available | 0 |
| Mavors: Multi-granularity Video Representation for Multimodal Large Language Model | Apr 14, 2025 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer | Apr 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models | Apr 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LangPert: Detecting and Handling Task-level Perturbations for Robust Object Rearrangement | Apr 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Automated Testing of COBOL to Java Transformation | Apr 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |