| FlexRAG: A Flexible and Comprehensive Framework for Retrieval-Augmented Generation | Jun 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| TagRouter: Learning Route to LLMs through Tags for Open-Domain Text Generation Tasks | Jun 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| From Human to Machine Psychology: A Conceptual Framework for Understanding Well-Being in Large Language Model | Jun 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Information Suppression in Large Language Models: Auditing, Quantifying, and Characterizing Censorship in DeepSeek | Jun 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving Large Language Model Safety with Contrastive Representation Learning | Jun 13, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| VGR: Visual Grounded Reasoning | Jun 13, 2025 | Large Language ModelMath | —Unverified | 0 |
| Large Language Model-Powered Conversational Agent Delivering Problem-Solving Therapy (PST) for Family Caregivers: Enhancing Empathy and Therapeutic Alliance Using In-Context Learning | Jun 13, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Semantic Preprocessing for LLM-based Malware Analysis | Jun 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Investigating the Potential of Large Language Model-Based Router Multi-Agent Architectures for Foundation Design Automation: A Task Classification and Expert Selection Study | Jun 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FAA Framework: A Large Language Model-Based Approach for Credit Card Fraud Investigations | Jun 13, 2025 | Fraud DetectionLanguage Modeling | —Unverified | 0 |
| From Emergence to Control: Probing and Modulating Self-Reflection in Language Models | Jun 13, 2025 | Large Language ModelNavigate | CodeCode Available | 0 |
| The Behavior Gap: Evaluating Zero-shot LLM Agents in Complex Task-Oriented Dialogs | Jun 13, 2025 | Large Language Model | —Unverified | 0 |
| SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security Tasks | Jun 13, 2025 | BenchmarkingLarge Language Model | CodeCode Available | 2 |
| Intelligent Automation for FDI Facilitation: Optimizing Tariff Exemption Processes with OCR And Large Language Models | Jun 12, 2025 | Large Language ModelOptical Character Recognition | —Unverified | 0 |
| LLM-as-a-Fuzzy-Judge: Fine-Tuning Large Language Models as a Clinical Evaluation Judge with Fuzzy Logic | Jun 12, 2025 | Large Language ModelPrompt Engineering | CodeCode Available | 0 |
| Nowcasting the euro area with social media data | Jun 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices | Jun 12, 2025 | CPUGPU | —Unverified | 0 |
| Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills | Jun 12, 2025 | Large Language ModelTask Planning | —Unverified | 0 |
| Unsourced Adversarial CAPTCHA: A Bi-Phase Adversarial CAPTCHA Framework | Jun 12, 2025 | Adversarial AttackDiversity | —Unverified | 0 |
| Automated Validation of Textual Constraints Against AutomationML via LLMs and SHACL | Jun 12, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DanceChat: Large Language Model-Guided Music-to-Dance Generation | Jun 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Benchmark for Generalizing Across Diverse Team Strategies in Competitive Pokémon | Jun 12, 2025 | Large Language ModelStarcraft | CodeCode Available | 1 |
| Grounded Vision-Language Navigation for UAVs with Open-Vocabulary Goal Understanding | Jun 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Slimming Down LLMs Without Losing Their Minds | Jun 12, 2025 | Computational EfficiencyGSM8K | —Unverified | 0 |
| Provably Learning from Language Feedback | Jun 12, 2025 | Large Language Model | —Unverified | 0 |
| AutoMind: Adaptive Knowledgeable Agent for Automated Data Science | Jun 12, 2025 | Code GenerationLarge Language Model | CodeCode Available | 2 |
| NeuralNexus at BEA 2025 Shared Task: Retrieval-Augmented Prompting for Mistake Identification in AI Tutors | Jun 12, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Improving Named Entity Transcription with Contextual LLM-based Revision | Jun 12, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Alzheimer's Dementia Detection Using Perplexity from Paired Large Language Models | Jun 11, 2025 | Data AugmentationDecision Making | —Unverified | 0 |
| ADAgent: LLM Agent for Alzheimer's Disease Analysis with Collaborative Coordinator | Jun 11, 2025 | AI AgentLarge Language Model | —Unverified | 0 |
| DreamCS: Geometry-Aware Text-to-3D Generation with Unpaired 3D Reward Supervision | Jun 11, 2025 | 3D GenerationLarge Language Model | —Unverified | 0 |
| Superstudent intelligence in thermodynamics | Jun 11, 2025 | Large Language Model | —Unverified | 0 |
| Prompt-Guided Latent Diffusion with Predictive Class Conditioning for 3D Prostate MRI Generation | Jun 11, 2025 | Image GenerationLanguage Modeling | —Unverified | 0 |
| Towards Multi-modal Graph Large Language Model | Jun 11, 2025 | Graph LearningIn-Context Learning | —Unverified | 0 |
| V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning | Jun 11, 2025 | Action AnticipationLarge Language Model | CodeCode Available | 7 |
| Bridging the Gap Between Open-Source and Proprietary LLMs in Table QA | Jun 11, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 0 |
| Disclosure Audits for LLM Agents | Jun 11, 2025 | DiagnosticLanguage Modeling | —Unverified | 0 |
| GenBreak: Red Teaming Text-to-Image Generators Using Large Language Models | Jun 11, 2025 | Large Language ModelRed Teaming | —Unverified | 0 |
| Chat-of-Thought: Collaborative Multi-Agent System for Generating Domain Specific Information | Jun 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| XGraphRAG: Interactive Visual Analysis for Graph-based Retrieval-Augmented Generation | Jun 10, 2025 | graph constructionLanguage Modeling | CodeCode Available | 0 |
| The Predictive Brain: Neural Correlates of Word Expectancy Align with Large Language Model Prediction Probabilities | Jun 10, 2025 | EEGLanguage Modeling | —Unverified | 0 |
| SoK: Machine Unlearning for Large Language Models | Jun 10, 2025 | Large Language ModelMachine Unlearning | —Unverified | 0 |
| PHRASED: Phrase Dictionary Biasing for Speech Translation | Jun 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Efficient Fireworks Algorithm Equipped with an Explosion Mechanism based on Student's T-distribution | Jun 10, 2025 | Large Language Model | —Unverified | 0 |
| From Pixels to Graphs: using Scene and Knowledge Graphs for HD-EPIC VQA Challenge | Jun 10, 2025 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |
| Towards Secure and Private Language Models for Nuclear Power Plants | Jun 10, 2025 | GPULanguage Modeling | —Unverified | 0 |
| Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning | Jun 10, 2025 | Large Language Modelreinforcement-learning | CodeCode Available | 1 |
| SakugaFlow: A Stagewise Illustration Framework Emulating the Human Drawing Process and Providing Interactive Tutoring for Novice Drawing Skills | Jun 10, 2025 | AnatomyImage Generation | —Unverified | 0 |
| Safe and Economical UAV Trajectory Planning in Low-Altitude Airspace: A Hybrid DRL-LLM Approach with Compliance Awareness | Jun 10, 2025 | Collision AvoidanceDeep Reinforcement Learning | —Unverified | 0 |
| Adapting Vision-Language Foundation Model for Next Generation Medical Ultrasound Image Analysis | Jun 10, 2025 | Domain AdaptationLarge Language Model | CodeCode Available | 1 |