| Retrieving Versus Understanding Extractive Evidence in Few-Shot Learning | Feb 19, 2025 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| Event Segmentation Applications in Large Language Model Enabled Automated Recall Assessments | Feb 19, 2025 | Event SegmentationLanguage Modeling | —Unverified | 0 |
| AgentCF++: Memory-enhanced LLM-based Agents for Popularity-aware Cross-domain Recommendations | Feb 19, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 0 |
| What are Models Thinking about? Understanding Large Language Model Hallucinations "Psychology" through Model Inner State Analysis | Feb 19, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| RGAR: Recurrence Generation-augmented Retrieval for Factual-aware Medical Question Answering | Feb 19, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Democratizing Large Language Model-Based Graph Data Augmentation via Latent Knowledge Graphs | Feb 19, 2025 | Data AugmentationGraph Learning | CodeCode Available | 0 |
| Reproducing NevIR: Negation in Neural Information Retrieval | Feb 19, 2025 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| REFIND: Retrieval-Augmented Factuality Hallucination Detection in Large Language Models | Feb 19, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| PLDR-LLMs Learn A Generalizable Tensor Operator That Can Replace Its Own Deep Neural Net At Inference | Feb 19, 2025 | Graph AttentionLarge Language Model | CodeCode Available | 0 |
| Vision-Based Generic Potential Function for Policy Alignment in Multi-Agent Reinforcement Learning | Feb 19, 2025 | Common Sense ReasoningLanguage Modeling | —Unverified | 0 |
| Autellix: An Efficient Serving Engine for LLM Agents as General Programs | Feb 19, 2025 | BlockingLanguage Modeling | —Unverified | 0 |
| Complex Ontology Matching with Large Language Model Embeddings | Feb 19, 2025 | Graph MatchingLanguage Modeling | —Unverified | 0 |
| Reflection of Episodes: Learning to Play Game from Expert and Self Experiences | Feb 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TALKPLAY: Multimodal Music Recommendation with Large Language Models | Feb 19, 2025 | Conversational RecommendationInstruction Following | —Unverified | 0 |
| LLM should think and action as a human | Feb 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL | Feb 18, 2025 | counterfactualDeception Detection | —Unverified | 0 |
| OCCULT: Evaluating Large Language Models for Offensive Cyber Operation Capabilities | Feb 18, 2025 | Large Language ModelMultiple-choice | —Unverified | 0 |
| User Intent to Use DeepSeek for Healthcare Purposes and their Trust in the Large Language Model: Multinational Survey Study | Feb 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Private Text Generation by Seeding Large Language Model Prompts | Feb 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reasoning and the Trusting Behavior of DeepSeek and GPT: An Experiment Revealing Hidden Fault Lines in Large Language Models | Feb 18, 2025 | Large Language Model | —Unverified | 0 |
| Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents | Feb 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Benchmarking Automatic Speech Recognition coupled LLM Modules for Medical Diagnostics | Feb 18, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| STEER-ME: Assessing the Microeconomic Reasoning of Large Language Models | Feb 18, 2025 | BenchmarkingLarge Language Model | —Unverified | 0 |
| Advanced simulation paradigm of human behaviour unveils complex financial systemic projection | Feb 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards more Contextual Agents: An extractor-Generator Optimization Framework | Feb 18, 2025 | Large Language Model | —Unverified | 0 |
| BaKlaVa -- Budgeted Allocation of KV cache for Long-context Inference | Feb 18, 2025 | GPULanguage Modeling | —Unverified | 0 |
| Gesture-Aware Zero-Shot Speech Recognition for Patients with Language Disorders | Feb 18, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| MCTS-Judge: Test-Time Scaling in LLM-as-a-Judge for Code Correctness Evaluation | Feb 18, 2025 | global-optimizationLarge Language Model | —Unverified | 0 |
| SEFL: Harnessing Large Language Model Agents to Improve Educational Feedback Systems | Feb 18, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Investigating and Extending Homans' Social Exchange Theory with Large Language Model based Agents | Feb 18, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| KL Penalty Control via Perturbation for Direct Preference Optimization | Feb 18, 2025 | ChatbotLanguage Modeling | CodeCode Available | 0 |
| Towards an automated workflow in materials science for combining multi-modal simulative and experimental information using data mining and large language models | Feb 18, 2025 | Information RetrievalLarge Language Model | —Unverified | 0 |
| You need to MIMIC to get FAME: Solving Meeting Transcript Scarcity with a Multi-Agent Conversations | Feb 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ReviewEval: An Evaluation Framework for AI-Generated Reviews | Feb 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| NOTA: Multimodal Music Notation Understanding for Visual Large Language Model | Feb 17, 2025 | cross-modal alignmentLanguage Modeling | —Unverified | 0 |
| Learning to Reason at the Frontier of Learnability | Feb 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Accuracy Assessment of OpenAlex and Clarivate Scholar ID with an LLM-Assisted Benchmark | Feb 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Locally-Deployed Chain-of-Thought (CoT) Reasoning Model in Chemical Engineering: Starting from 30 Experimental Data | Feb 17, 2025 | Gaussian ProcessesLanguage Modeling | —Unverified | 0 |
| SmartLLM: Smart Contract Auditing using Custom Generative AI | Feb 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RIDE: Enhancing Large Language Model Alignment through Restyled In-Context Learning Demonstration Exemplars | Feb 17, 2025 | Few-Shot LearningIn-Context Learning | CodeCode Available | 0 |
| TimeCAP: Learning to Contextualize, Augment, and Predict Time Series Events with Large Language Model Agents | Feb 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DELMAN: Dynamic Defense Against Large Language Model Jailbreaking with Model Editing | Feb 17, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Addressing Moral Uncertainty using Large Language Models for Ethical Decision-Making | Feb 17, 2025 | Decision MakingEthics | —Unverified | 0 |
| SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities | Feb 17, 2025 | Large Language ModelMisinformation | —Unverified | 0 |
| Aligning Sentence Simplification with ESL Learner's Proficiency for Language Acquisition | Feb 17, 2025 | DiversityLanguage Acquisition | CodeCode Available | 0 |
| Connecting Large Language Model Agent to High Performance Computing Resource | Feb 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ConFit v2: Improving Resume-Job Matching using Hypothetical Resume Embedding and Runner-Up Hard-Negative Mining | Feb 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GRAPHGPT-O: Synergistic Multimodal Comprehension and Generation on Graphs | Feb 17, 2025 | Image GenerationLanguage Modeling | —Unverified | 0 |
| Competing LLM Agents in a Non-Cooperative Game of Opinion Polarisation | Feb 17, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Intelligent Mobile AI-Generated Content Services via Interactive Prompt Engineering and Dynamic Service Provisioning | Feb 17, 2025 | Deep Reinforcement LearningLarge Language Model | —Unverified | 0 |