| Autellix: An Efficient Serving Engine for LLM Agents as General Programs | Feb 19, 2025 | BlockingLanguage Modeling | —Unverified | 0 |
| Complex Ontology Matching with Large Language Model Embeddings | Feb 19, 2025 | Graph MatchingLanguage Modeling | —Unverified | 0 |
| Reproducing NevIR: Negation in Neural Information Retrieval | Feb 19, 2025 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| LLM should think and action as a human | Feb 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| REFIND: Retrieval-Augmented Factuality Hallucination Detection in Large Language Models | Feb 19, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| RGAR: Recurrence Generation-augmented Retrieval for Factual-aware Medical Question Answering | Feb 19, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Exploring Personalized Health Support through Data-Driven, Theory-Guided LLMs: A Case Study in Sleep Health | Feb 19, 2025 | ChatbotLanguage Modeling | CodeCode Available | 0 |
| Democratizing Large Language Model-Based Graph Data Augmentation via Latent Knowledge Graphs | Feb 19, 2025 | Data AugmentationGraph Learning | CodeCode Available | 0 |
| Advanced simulation paradigm of human behaviour unveils complex financial systemic projection | Feb 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| User Intent to Use DeepSeek for Healthcare Purposes and their Trust in the Large Language Model: Multinational Survey Study | Feb 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OCCULT: Evaluating Large Language Models for Offensive Cyber Operation Capabilities | Feb 18, 2025 | Large Language ModelMultiple-choice | —Unverified | 0 |
| Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL | Feb 18, 2025 | counterfactualDeception Detection | —Unverified | 0 |
| Investigating and Extending Homans' Social Exchange Theory with Large Language Model based Agents | Feb 18, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MCTS-Judge: Test-Time Scaling in LLM-as-a-Judge for Code Correctness Evaluation | Feb 18, 2025 | global-optimizationLarge Language Model | —Unverified | 0 |
| Towards an automated workflow in materials science for combining multi-modal simulative and experimental information using data mining and large language models | Feb 18, 2025 | Information RetrievalLarge Language Model | —Unverified | 0 |
| G-Refer: Graph Retrieval-Augmented Large Language Model for Explainable Recommendation | Feb 18, 2025 | Collaborative FilteringExplainable Recommendation | CodeCode Available | 1 |
| Towards more Contextual Agents: An extractor-Generator Optimization Framework | Feb 18, 2025 | Large Language Model | —Unverified | 0 |
| STEER-ME: Assessing the Microeconomic Reasoning of Large Language Models | Feb 18, 2025 | BenchmarkingLarge Language Model | —Unverified | 0 |
| Benchmarking Automatic Speech Recognition coupled LLM Modules for Medical Diagnostics | Feb 18, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Private Text Generation by Seeding Large Language Model Prompts | Feb 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reasoning and the Trusting Behavior of DeepSeek and GPT: An Experiment Revealing Hidden Fault Lines in Large Language Models | Feb 18, 2025 | Large Language Model | —Unverified | 0 |
| You need to MIMIC to get FAME: Solving Meeting Transcript Scarcity with a Multi-Agent Conversations | Feb 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Gesture-Aware Zero-Shot Speech Recognition for Patients with Language Disorders | Feb 18, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| MSE-Adapter: A Lightweight Plugin Endowing LLMs with the Capability to Perform Multimodal Sentiment Analysis and Emotion Recognition | Feb 18, 2025 | Emotion RecognitionLarge Language Model | CodeCode Available | 1 |
| SEFL: Harnessing Large Language Model Agents to Improve Educational Feedback Systems | Feb 18, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Towards Text-Image Interleaved Retrieval | Feb 18, 2025 | Information RetrievalLanguage Modeling | CodeCode Available | 1 |
| Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge Networks | Feb 18, 2025 | graph constructionLarge Language Model | CodeCode Available | 3 |
| Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents | Feb 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BaKlaVa -- Budgeted Allocation of KV cache for Long-context Inference | Feb 18, 2025 | GPULanguage Modeling | —Unverified | 0 |
| KL Penalty Control via Perturbation for Direct Preference Optimization | Feb 18, 2025 | ChatbotLanguage Modeling | CodeCode Available | 0 |
| UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design | Feb 18, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Addressing Moral Uncertainty using Large Language Models for Ethical Decision-Making | Feb 17, 2025 | Decision MakingEthics | —Unverified | 0 |
| Learning to Reason at the Frontier of Learnability | Feb 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| NOTA: Multimodal Music Notation Understanding for Visual Large Language Model | Feb 17, 2025 | cross-modal alignmentLanguage Modeling | —Unverified | 0 |
| ConFit v2: Improving Resume-Job Matching using Hypothetical Resume Embedding and Runner-Up Hard-Negative Mining | Feb 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unveiling Privacy Risks in LLM Agent Memory | Feb 17, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Locally-Deployed Chain-of-Thought (CoT) Reasoning Model in Chemical Engineering: Starting from 30 Experimental Data | Feb 17, 2025 | Gaussian ProcessesLanguage Modeling | —Unverified | 0 |
| Large Language Models Can Help Mitigate Barren Plateaus | Feb 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Connecting Large Language Model Agent to High Performance Computing Resource | Feb 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SmartLLM: Smart Contract Auditing using Custom Generative AI | Feb 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MMRC: A Large-Scale Benchmark for Understanding Multimodal Large Language Model in Real-World Conversation | Feb 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RIDE: Enhancing Large Language Model Alignment through Restyled In-Context Learning Demonstration Exemplars | Feb 17, 2025 | Few-Shot LearningIn-Context Learning | CodeCode Available | 0 |
| Can LLM Watermarks Robustly Prevent Unauthorized Knowledge Distillation? | Feb 17, 2025 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| TimeCAP: Learning to Contextualize, Augment, and Predict Time Series Events with Large Language Model Agents | Feb 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs | Feb 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GRAPHGPT-O: Synergistic Multimodal Comprehension and Generation on Graphs | Feb 17, 2025 | Image GenerationLanguage Modeling | —Unverified | 0 |
| M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis | Feb 17, 2025 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 1 |
| SMART: Self-Aware Agent for Tool Overuse Mitigation | Feb 17, 2025 | GSM8KLarge Language Model | CodeCode Available | 1 |
| Aligning Sentence Simplification with ESL Learner's Proficiency for Language Acquisition | Feb 17, 2025 | DiversityLanguage Acquisition | CodeCode Available | 0 |
| video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model | Feb 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |