| Automated Testing of COBOL to Java Transformation | Apr 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Omni-Dish: Photorealistic and Faithful Image Generation and Editing for Arbitrary Chinese Dishes | Apr 14, 2025 | Image GenerationLarge Language Model | CodeCode Available | 1 |
| Benchmarking Practices in LLM-driven Offensive Security: Testbeds, Metrics, and Experiment Design | Apr 14, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| SymRTLO: Enhancing RTL Code Optimization with LLMs and Neuron-Inspired Symbolic Reasoning | Apr 14, 2025 | Large Language ModelRAG | —Unverified | 0 |
| GNN-ACLP: Graph Neural Networks based Analog Circuit Link Prediction | Apr 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SUMART: SUMmARizing Translation from Wordy to Concise Expression | Apr 14, 2025 | Large Language ModelTranslation | —Unverified | 0 |
| SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model | Apr 13, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| MLRC-Bench: Can Language Agents Solve Machine Learning Research Challenges? | Apr 13, 2025 | Large Language Modelscientific discovery | —Unverified | 0 |
| CheatAgent: Attacking LLM-Empowered Recommender Systems via LLM Agent | Apr 13, 2025 | Large Language ModelRecommendation Systems | —Unverified | 0 |
| Kongzi: A Historical Large Language Model with Fact Enhancement | Apr 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ClinicalGPT-R1: Pushing reasoning capability of generalist disease diagnosis with large language model | Apr 13, 2025 | DiagnosticLanguage Modeling | CodeCode Available | 2 |
| Migrating Code At Scale With LLMs At Google | Apr 13, 2025 | Large Language Model | —Unverified | 0 |
| UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents | Apr 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AgentDynEx: Nudging the Mechanics and Dynamics of Multi-Agent Simulations | Apr 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AgentA/B: Automated and Scalable Web A/BTesting with Interactive LLM Agents | Apr 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Fine-tuning a Large Language Model for Automating Computational Fluid Dynamics Simulations | Apr 13, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| Large Language Model Empowered Recommendation Meets All-domain Continual Pre-Training | Apr 11, 2025 | AllLanguage Modeling | —Unverified | 0 |
| Spatial Audio Processing with Large Language Model on Wearable Devices | Apr 11, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| TP-RAG: Benchmarking Retrieval-Augmented Large Language Model Agents for Spatiotemporal-Aware Travel Planning | Apr 11, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| A Reproducibility Study of Graph-Based Legal Case Retrieval | Apr 11, 2025 | Information RetrievalLarge Language Model | —Unverified | 0 |
| SpecEE: Accelerating Large Language Model Inference with Speculative Early Exiting | Apr 11, 2025 | GPULanguage Modeling | —Unverified | 0 |
| Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions | Apr 11, 2025 | Contrastive LearningImage Captioning | —Unverified | 0 |
| MedRep: Medical Concept Representation for General Electronic Health Record Foundation Models | Apr 11, 2025 | Data AugmentationLanguage Modeling | CodeCode Available | 0 |
| Playpen: An Environment for Exploring Learning Through Conversational Interaction | Apr 11, 2025 | Instruction FollowingLarge Language Model | CodeCode Available | 0 |
| AI-University: An LLM-based platform for instructional alignment to scientific classrooms | Apr 11, 2025 | Large Language ModelRAG | CodeCode Available | 0 |
| Do LLMs trust AI regulation? Emerging behaviour of game-theoretic LLM agents | Apr 11, 2025 | Large Language Model | —Unverified | 0 |
| Variability-Driven User-Story Generation using LLM and Triadic Concept Analysis | Apr 11, 2025 | Large Language ModelStory Generation | —Unverified | 0 |
| Marmot: Multi-Agent Reasoning for Multi-Object Self-Correcting in Improving Image-Text Alignment | Apr 10, 2025 | AI AgentAttribute | —Unverified | 0 |
| Apt-Serve: Adaptive Request Scheduling on Hybrid Cache for Scalable LLM Inference Serving | Apr 10, 2025 | GPULarge Language Model | CodeCode Available | 1 |
| Enhancing Player Enjoyment with a Two-Tier DRL and LLM-Based Agent System for Fighting Games | Apr 10, 2025 | Deep Reinforcement LearningGame Design | —Unverified | 0 |
| GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation | Apr 10, 2025 | Contrastive LearningLanguage Modeling | CodeCode Available | 2 |
| Beyond LLMs: A Linguistic Approach to Causal Graph Generation from Narrative Texts | Apr 10, 2025 | Graph GenerationLanguage Modeling | —Unverified | 0 |
| Model Utility Law: Evaluating LLMs beyond Performance through Mechanism Interpretable Metric | Apr 10, 2025 | FairnessLarge Language Model | CodeCode Available | 1 |
| Synthetic Fluency: Hallucinations, Confabulations, and the Creation of Irish Words in LLM-Generated Translations | Apr 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs | Apr 10, 2025 | Large Language Model | —Unverified | 0 |
| Throughput-Optimal Scheduling Algorithms for LLM Inference and AI Agents | Apr 10, 2025 | AI AgentLarge Language Model | —Unverified | 0 |
| DeepGreen: Effective LLM-Driven Green-washing Monitoring System Designed for Empirical Testing -- Evidence from China | Apr 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Token Level Routing Inference System for Edge Devices | Apr 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Method for Storing Patterns in Neural Networks-Memorization and Recall of QR code Patterns- | Apr 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RuOpinionNE-2024: Extraction of Opinion Tuples from Russian News Texts | Apr 9, 2025 | Dialogue EvaluationLanguage Modeling | CodeCode Available | 0 |
| MovSAM: A Single-image Moving Object Segmentation Framework Based on Deep Thinking | Apr 9, 2025 | Autonomous DrivingLanguage Modeling | CodeCode Available | 0 |
| PAYADOR: A Minimalist Approach to Grounding Language Models on Structured Data for Interactive Storytelling and Role-playing Games | Apr 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Multi-Phase Analysis of Blood Culture Stewardship: Machine Learning Prediction, Expert Recommendation Assessment, and LLM Automation | Apr 9, 2025 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Face-LLaVA: Facial Expression and Attribute Understanding through Instruction Tuning | Apr 9, 2025 | Action Unit DetectionAge Estimation | —Unverified | 0 |
| Q-Agent: Quality-Driven Chain-of-Thought Image Restoration Agent through Robust Multimodal Large Language Model | Apr 9, 2025 | Image Quality AssessmentImage Restoration | —Unverified | 0 |
| SafeChat: A Framework for Building Trustworthy Collaborative Assistants and a Case Study of its Usefulness | Apr 8, 2025 | ChatbotExtractive Summarization | CodeCode Available | 0 |
| ARLO: A Tailorable Approach for Transforming Natural Language Software Requirements into Architecture using LLMs | Apr 8, 2025 | Large Language Model | —Unverified | 0 |
| Are Generative AI Agents Effective Personalized Financial Advisors? | Apr 8, 2025 | Large Language Model | CodeCode Available | 0 |
| InstructMPC: A Human-LLM-in-the-Loop Framework for Context-Aware Control | Apr 8, 2025 | energy managementLanguage Modeling | —Unverified | 0 |
| DoCIA: An Online Document-Level Context Incorporation Agent for Speech Translation | Apr 7, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |