| Understand User Opinions of Large Language Models via LLM-Powered In-the-Moment User Experience Interviews | Feb 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| PAPI: Exploiting Dynamic Parallelism in Large Language Model Decoding with a Processing-In-Memory-Enabled Computing System | Feb 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Is Relevance Propagated from Retriever to Generator in RAG? | Feb 20, 2025 | Large Language ModelQuestion Answering | —Unverified | 0 |
| Is Safety Standard Same for Everyone? User-Specific Safety Evaluation of Large Language Models | Feb 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Optimizing Singular Spectrum for Large Language Model Compression | Feb 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search | Feb 20, 2025 | AutoMLCode Generation | CodeCode Available | 1 |
| TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators | Feb 20, 2025 | BenchmarkingCode Generation | CodeCode Available | 2 |
| Rapid Word Learning Through Meta In-Context Learning | Feb 20, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| HPS: Hard Preference Sampling for Human Preference Alignment | Feb 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Generative adversarial networks vs large language models: a comparative study on synthetic tabular data generation | Feb 20, 2025 | Generative Adversarial NetworkLanguage Modeling | CodeCode Available | 0 |
| Prompt-to-Leaderboard | Feb 20, 2025 | ChatbotLanguage Modeling | CodeCode Available | 3 |
| Beyond Self-Talk: A Communication-Centric Survey of LLM-Based Multi-Agent Systems | Feb 20, 2025 | BenchmarkingDecision Making | —Unverified | 0 |
| STeCa: Step-level Trajectory Calibration for LLM Agent Learning | Feb 20, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| SR-LLM: Rethinking the Structured Representation in Large Language Model | Feb 20, 2025 | Abstract Meaning RepresentationLanguage Modeling | —Unverified | 0 |
| CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models | Feb 20, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 |
| Collaborative Retrieval for Large Language Model-based Conversational Recommender Systems | Feb 19, 2025 | Collaborative FilteringConversational Recommendation | CodeCode Available | 1 |
| Event Segmentation Applications in Large Language Model Enabled Automated Recall Assessments | Feb 19, 2025 | Event SegmentationLanguage Modeling | —Unverified | 0 |
| Retrieving Versus Understanding Extractive Evidence in Few-Shot Learning | Feb 19, 2025 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| Vision-Based Generic Potential Function for Policy Alignment in Multi-Agent Reinforcement Learning | Feb 19, 2025 | Common Sense ReasoningLanguage Modeling | —Unverified | 0 |
| DataSciBench: An LLM Agent Benchmark for Data Science | Feb 19, 2025 | Code GenerationLarge Language Model | CodeCode Available | 2 |
| LLM should think and action as a human | Feb 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RGAR: Recurrence Generation-augmented Retrieval for Factual-aware Medical Question Answering | Feb 19, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Democratizing Large Language Model-Based Graph Data Augmentation via Latent Knowledge Graphs | Feb 19, 2025 | Data AugmentationGraph Learning | CodeCode Available | 0 |
| PLDR-LLMs Learn A Generalizable Tensor Operator That Can Replace Its Own Deep Neural Net At Inference | Feb 19, 2025 | Graph AttentionLarge Language Model | CodeCode Available | 0 |
| TALKPLAY: Multimodal Music Recommendation with Large Language Models | Feb 19, 2025 | Conversational RecommendationInstruction Following | —Unverified | 0 |