| Adapting Vision-Language Foundation Model for Next Generation Medical Ultrasound Image Analysis | Jun 10, 2025 | Domain AdaptationLarge Language Model | CodeCode Available | 1 |
| WIP: Large Language Model-Enhanced Smart Tutor for Undergraduate Circuit Analysis | Jun 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial Statements | Jun 10, 2025 | Binary ClassificationFinancial Analysis | CodeCode Available | 1 |
| MasHost Builds It All: Autonomous Multi-Agent System Directed by Reinforcement Learning | Jun 10, 2025 | Allgraph construction | —Unverified | 0 |
| CAF-I: A Collaborative Multi-Agent Framework for Enhanced Irony Detection with Large Language Models | Jun 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SPBA: Utilizing Speech Large Language Model for Backdoor Attacks on Speech Classification Models | Jun 10, 2025 | Backdoor AttackKeyword Spotting | —Unverified | 0 |
| From Pixels to Graphs: using Scene and Knowledge Graphs for HD-EPIC VQA Challenge | Jun 10, 2025 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |
| Your Agent Can Defend Itself against Backdoor Attacks | Jun 10, 2025 | Large Language Model | —Unverified | 0 |
| LeanTutor: A Formally-Verified AI Tutor for Mathematical Proofs | Jun 10, 2025 | Large Language ModelMath | —Unverified | 0 |
| Unlocking the Potential of Large Language Models in the Nuclear Industry with Synthetic Data | Jun 10, 2025 | Decision MakingInformation Retrieval | —Unverified | 0 |
| G-Memory: Tracing Hierarchical Memory for Multi-Agent Systems | Jun 9, 2025 | Large Language Model | CodeCode Available | 3 |
| LLM Unlearning Should Be Form-Independent | Jun 9, 2025 | FormLarge Language Model | —Unverified | 0 |
| JavelinGuard: Low-Cost Transformer Architectures for LLM Security | Jun 9, 2025 | CPULarge Language Model | —Unverified | 0 |
| MiniCPM4: Ultra-Efficient LLMs on End Devices | Jun 9, 2025 | Large Language Model | CodeCode Available | 9 |
| DeepVideo-R1: Video Reinforcement Fine-Tuning via Difficulty-aware Regressive GRPO | Jun 9, 2025 | Data AugmentationLarge Language Model | —Unverified | 0 |
| Statistical Hypothesis Testing for Auditing Robustness in Language Models | Jun 9, 2025 | FairnessLarge Language Model | —Unverified | 0 |
| Event-Priori-Based Vision-Language Model for Efficient Visual Understanding | Jun 9, 2025 | Event-based visionLanguage Modeling | —Unverified | 0 |
| An Intelligent Fault Self-Healing Mechanism for Cloud AI Systems via Integration of Large Language Models and Deep Reinforcement Learning | Jun 9, 2025 | Deep Reinforcement LearningLarge Language Model | —Unverified | 0 |
| Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions | Jun 9, 2025 | Large Language ModelReinforcement Learning (RL) | CodeCode Available | 2 |
| SpatialLM: Training Large Language Models for Structured Indoor Modeling | Jun 9, 2025 | 3D Object DetectionLanguage Modeling | —Unverified | 0 |
| Language-Grounded Hierarchical Planning and Execution with Multi-Robot 3D Scene Graphs | Jun 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations | Jun 9, 2025 | Large Language ModelMultimodal Reasoning | —Unverified | 0 |
| How Benchmark Prediction from Fewer Data Misses the Mark | Jun 9, 2025 | Large Language ModelPrediction | CodeCode Available | 0 |
| QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA | Jun 9, 2025 | Large Language Model | —Unverified | 0 |
| Cognitive Weave: Synthesizing Abstracted Knowledge with a Spatio-Temporal Resonance Graph | Jun 9, 2025 | Large Language ModelQuestion Answering | CodeCode Available | 0 |
| AnnoDPO: Protein Functional Annotation Learning with Direct Preference Optimization | Jun 8, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Speech Recognition on TV Series with Video-guided Post-Correction | Jun 8, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Contextual Experience Replay for Self-Improvement of Language Agents | Jun 7, 2025 | Decision MakingLarge Language Model | —Unverified | 0 |
| An Agentic Framework for Autonomous Metamaterial Modeling and Inverse Design | Jun 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RoboPARA: Dual-Arm Robot Planning with Parallel Allocation and Recomposition Across Tasks | Jun 7, 2025 | Large Language ModelTask Planning | —Unverified | 0 |
| AgentSwift: Efficient LLM Agent Design via Value-guided Hierarchical Search | Jun 6, 2025 | Large Language ModelMath | CodeCode Available | 0 |
| PersonaAgent: When Large Language Model Agents Meet Personalization at Test Time | Jun 6, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference Acceleration | Jun 6, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| SPARQ: Synthetic Problem Generation for Reasoning via Quality-Diversity Algorithms | Jun 6, 2025 | DiversityLarge Language Model | —Unverified | 0 |
| Prompting Wireless Networks: Reinforced In-Context Learning for Power Control | Jun 6, 2025 | Decision MakingIn-Context Learning | —Unverified | 0 |
| Hierarchical Debate-Based Large Language Model (LLM) for Complex Task Planning of 6G Network Management | Jun 6, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Training-Free Query Optimization via LLM-Based Plan Similarity | Jun 6, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Cost-Efficient LLM Training with Lifetime-Aware Tensor Offloading via GPUDirect Storage | Jun 6, 2025 | CPUGPU | —Unverified | 0 |
| HeavyWater and SimplexWater: Watermarking Low-Entropy Text Distributions | Jun 6, 2025 | Large Language ModelText Generation | CodeCode Available | 0 |
| ScriptDoctor: Automatic Generation of PuzzleScript Games via Large Language Models and Tree Search | Jun 6, 2025 | Game DesignLarge Language Model | —Unverified | 0 |
| Voice Impression Control in Zero-Shot TTS | Jun 6, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias | Jun 6, 2025 | image-classificationImage Classification | CodeCode Available | 1 |
| Customizing Speech Recognition Model with Large Language Model Feedback | Jun 5, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Agentomics-ML: Autonomous Machine Learning Experimentation Agent for Genomic and Transcriptomic Data | Jun 5, 2025 | Drug DiscoveryLarge Language Model | CodeCode Available | 1 |
| E-bike agents: Large Language Model-Driven E-Bike Accident Analysis and Severity Prediction | Jun 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development | Jun 5, 2025 | Large Language Model | CodeCode Available | 7 |
| DIMCIM: A Quantitative Evaluation Framework for Default-mode Diversity and Generalization in Text-to-Image Generative Models | Jun 5, 2025 | BenchmarkingDiversity | —Unverified | 0 |
| Interpretable Multimodal Framework for Human-Centered Street Assessment: Integrating Visual-Language Models for Perceptual Urban Diagnostics | Jun 5, 2025 | Large Language Model | —Unverified | 0 |
| Parking, Perception, and Retail: Street-Level Determinants of Community Vitality in Harbin | Jun 5, 2025 | Large Language ModelMorphological Analysis | —Unverified | 0 |
| HALoS: Hierarchical Asynchronous Local SGD over Slow Networks for Geo-Distributed Large Language Model Training | Jun 5, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |