| A Large Language Model-based Multi-Agent Framework for Analog Circuits' Sizing Relationships Extraction | Jun 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Smart-LLaMA-DPO: Reinforced Large Language Model for Explainable Smart Contract Vulnerability Detection | Jun 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Confucius3-Math: A Lightweight High-Performance Reasoning LLM for Chinese K-12 Mathematics Learning | Jun 23, 2025 | GPULarge Language Model | CodeCode Available | 2 |
| ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation | Jun 22, 2025 | GPUImage Generation | CodeCode Available | 3 |
| Evolving Prompts In-Context: An Open-ended, Self-replicating Perspective | Jun 22, 2025 | In-Context LearningLarge Language Model | CodeCode Available | 1 |
| Leveraging Large Language Model for Intelligent Log Processing and Autonomous Debugging in Cloud AI Platforms | Jun 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Pre-Trained LLM is a Semantic-Aware and Generalizable Segmentation Booster | Jun 22, 2025 | DecoderImage Segmentation | CodeCode Available | 2 |
| Mechanistic Interpretability in the Presence of Architectural Obfuscation | Jun 22, 2025 | Large Language ModelPrivacy Preserving | CodeCode Available | 0 |
| JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent | Jun 21, 2025 | Instruction FollowingLarge Language Model | —Unverified | 0 |
| DreamJourney: Perpetual View Generation with Video Diffusion Models | Jun 21, 2025 | Image to 3DLarge Language Model | —Unverified | 0 |
| Can Generated Images Serve as a Viable Modality for Text-Centric Multimodal Learning? | Jun 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Research on Model Parallelism and Data Parallelism Optimization Methods in Large Language Model-Based Recommendation Systems | Jun 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Programmable-Room: Interactive Textured 3D Room Meshes Generation Empowered by Large Language Models | Jun 21, 2025 | AttributeImage Generation | —Unverified | 0 |
| DRAMA-X: A Fine-grained Intent Prediction and Risk Reasoning Benchmark For Driving | Jun 21, 2025 | Autonomous DrivingDescriptive | CodeCode Available | 1 |
| OmniReflect: Discovering Transferable Constitutions for LLM agents via Neuro-Symbolic Reflections | Jun 20, 2025 | Computational EfficiencyLarge Language Model | —Unverified | 0 |
| Challenges in Grounding Language in the Real World | Jun 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Computational Approaches to Understanding Large Language Model Impact on Writing and Information Ecosystems | Jun 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Condition Number as a Scale-Invariant Proxy for Information Encoding in Neural Units | Jun 19, 2025 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 |
| Do We Talk to Robots Like Therapists, and Do They Respond Accordingly? Language Alignment in AI Emotional Support | Jun 19, 2025 | Large Language ModelSentence Embeddings | —Unverified | 0 |
| Probe before You Talk: Towards Black-box Defense against Backdoor Unalignment for Large Language Models | Jun 19, 2025 | Large Language ModelSafety Alignment | CodeCode Available | 1 |
| LLMs in Coding and their Impact on the Commercial Software Engineering Landscape | Jun 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LMR-BENCH: Evaluating LLM Agent's Ability on Reproducing Language Modeling Research | Jun 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AgentGroupChat-V2: Divide-and-Conquer Is What LLM-Based Multi-Agent System Need | Jun 18, 2025 | GSM8KHumanEval | CodeCode Available | 0 |
| Make Your AUV Adaptive: An Environment-Aware Reinforcement Learning Framework For Underwater Tasks | Jun 18, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| RAS-Eval: A Comprehensive Benchmark for Security Evaluation of LLM Agents in Real-World Environments | Jun 18, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language Models | Jun 18, 2025 | Audio captioningLarge Language Model | CodeCode Available | 2 |
| deepSURF: Detecting Memory Safety Vulnerabilities in Rust Through Fuzzing LLM-Augmented Harnesses | Jun 18, 2025 | Large Language Model | —Unverified | 0 |
| SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning | Jun 18, 2025 | Caption GenerationDescriptive | CodeCode Available | 2 |
| LLM Agent for Hyper-Parameter Optimization | Jun 18, 2025 | Large Language Model | —Unverified | 0 |
| DisProtEdit: Exploring Disentangled Representations for Multi-Attribute Protein Editing | Jun 17, 2025 | AttributeDisentanglement | —Unverified | 0 |
| Thinking in Directivity: Speech Large Language Model for Multi-Talker Directional Speech Recognition | Jun 17, 2025 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| FEAST: A Flexible Mealtime-Assistance System Towards In-the-Wild Personalization | Jun 17, 2025 | Large Language Model | —Unverified | 0 |
| Utility-Driven Speculative Decoding for Mixture-of-Experts | Jun 17, 2025 | GPULarge Language Model | —Unverified | 0 |
| ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM | Jun 17, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| Don't Make It Up: Preserving Ignorance Awareness in LLM Fine-Tuning | Jun 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unified Software Engineering agent as AI Software Engineer | Jun 17, 2025 | Large Language Model | —Unverified | 0 |
| Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs | Jun 17, 2025 | Data IntegrationLarge Language Model | —Unverified | 0 |
| From What to Respond to When to Respond: Timely Response Generation for Open-domain Dialogue Agents | Jun 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| RMIT-ADM+S at the SIGIR 2025 LiveRAG Challenge | Jun 17, 2025 | Answer GenerationLanguage Modeling | CodeCode Available | 1 |
| Bi-directional Context-Enhanced Speech Large Language Models for Multilingual Conversational ASR | Jun 16, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| CFBenchmark-MM: Chinese Financial Assistant Benchmark for Multimodal Large Language Model | Jun 16, 2025 | Decision MakingFinancial Analysis | —Unverified | 0 |
| Balancing Knowledge Delivery and Emotional Comfort in Healthcare Conversational Systems | Jun 16, 2025 | Large Language Model | —Unverified | 0 |
| Calibrated Predictive Lower Bounds on Time-to-Unsafe-Sampling in LLMs | Jun 16, 2025 | Conformal PredictionLarge Language Model | —Unverified | 0 |
| ProfiLLM: An LLM-Based Framework for Implicit Profiling of Chatbot Users | Jun 16, 2025 | ChatbotLarge Language Model | —Unverified | 0 |
| EmoNews: A Spoken Dialogue System for Expressive News Conversations | Jun 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ASMR: Augmenting Life Scenario using Large Generative Models for Robotic Action Reflection | Jun 16, 2025 | Data AugmentationLarge Language Model | —Unverified | 0 |
| Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model | Jun 16, 2025 | Large Language Modelmultimodal interaction | CodeCode Available | 5 |
| VIS-Shepherd: Constructing Critic for LLM-based Data Visualization Generation | Jun 16, 2025 | Data VisualizationLanguage Modeling | CodeCode Available | 0 |
| SciSage: A Multi-Agent Framework for High-Quality Scientific Survey Generation | Jun 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Foundation Cracks: A Comprehensive Study on Bugs and Testing Practices in LLM Libraries | Jun 14, 2025 | Bug fixingInference Optimization | —Unverified | 0 |