| Thought Space Explorer: Navigating and Expanding Thought Space for Large Language Model Reasoning | Oct 31, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection | Oct 31, 2024 | Human-Object Interaction DetectionLarge Language Model | CodeCode Available | 1 |
| Plan-on-Graph: Self-Correcting Adaptive Planning of Large Language Model on Knowledge Graphs | Oct 31, 2024 | Knowledge GraphsLanguage Modeling | CodeCode Available | 2 |
| Simulating User Agents for Embodied Conversational-AI | Oct 31, 2024 | Dataset GenerationLarge Language Model | —Unverified | 0 |
| ALISE: Accelerating Large Language Model Serving with Speculative Scheduling | Oct 31, 2024 | BlockingLanguage Modeling | —Unverified | 0 |
| From Context to Action: Analysis of the Impact of State Representation and Context on the Generalization of Multi-Turn Web Navigation Agents | Oct 31, 2024 | Action AnalysisConversational Web Navigation | —Unverified | 0 |
| EF-LLM: Energy Forecasting LLM with AI-assisted Automation, Enhanced Sparse Prediction, Hallucination Detection | Oct 30, 2024 | Continual LearningHallucination | —Unverified | 0 |
| A Theoretical Perspective for Speculative Decoding Algorithm | Oct 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Dynamic Information Sub-Selection for Decision Support | Oct 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Explainable Behavior Cloning: Teaching Large Language Model Agents through Learning by Demonstration | Oct 30, 2024 | Code GenerationLanguage Modeling | —Unverified | 0 |
| EMMA: End-to-End Multimodal Model for Autonomous Driving | Oct 30, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Beyond Ontology in Dialogue State Tracking for Goal-Oriented Chatbot | Oct 30, 2024 | ChatbotDialogue State Tracking | CodeCode Available | 0 |
| Real-Time Personalization for LLM-based Recommendation with Customized In-Context Learning | Oct 30, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| Prove Your Point!: Bringing Proof-Enhancement Principles to Argumentative Essay Generation | Oct 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EMOS: Embodiment-aware Heterogeneous Multi-robot Operating System with LLM Agents | Oct 30, 2024 | Large Language ModelObject Rearrangement | —Unverified | 0 |
| PV-VTT: A Privacy-Centric Dataset for Mission-Specific Anomaly Detection and Natural Language Interpretation | Oct 30, 2024 | Anomaly DetectionDescriptive | —Unverified | 0 |
| Toward Understanding In-context vs. In-weight Learning | Oct 30, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback | Oct 30, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Anticipating Future with Large Language Model for Simultaneous Machine Translation | Oct 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance | Oct 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Auto-Intent: Automated Intent Discovery and Self-Exploration for Large Language Model Web Agents | Oct 29, 2024 | Decision MakingIntent Discovery | —Unverified | 0 |
| SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types | Oct 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Protecting Privacy in Multimodal Large Language Models with MLLMU-Bench | Oct 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Learning and Unlearning of Fabricated Knowledge in Language Models | Oct 29, 2024 | Data PoisoningLanguage Modeling | —Unverified | 0 |
| MARCO: Multi-Agent Real-time Chat Orchestration | Oct 29, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Online Detecting LLM-Generated Texts via Sequential Hypothesis Testing by Betting | Oct 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| An Actor-Critic Approach to Boosting Text-to-SQL Large Language Model | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Can Machines Think Like Humans? A Behavioral Evaluation of LLM-Agents in Dictator Games | Oct 28, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment | Oct 28, 2024 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| Large Language Model Benchmarks in Medical Tasks | Oct 28, 2024 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| ElectionSim: Massive Population Election Simulation Powered by Large Language Model Driven Agents | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model-Guided Prediction Toward Quantum Materials Synthesis | Oct 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Rephrasing natural text data with different languages and quality levels for Large Language Model pre-training | Oct 28, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Sorting Out the Bad Seeds: Automatic Classification of Cryptocurrency Abuse Reports | Oct 28, 2024 | Large Language Model | —Unverified | 0 |
| BongLLaMA: LLaMA for Bangla Language | Oct 28, 2024 | BenchmarkingData Augmentation | —Unverified | 0 |
| Zero-Shot Dense Retrieval with Embeddings from Relevance Feedback | Oct 28, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce | Oct 28, 2024 | Benchmarkinggraph construction | —Unverified | 0 |
| Large Language Model-assisted Speech and Pointing Benefits Multiple 3D Object Selection in Virtual Reality | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Stealthy Jailbreak Attacks on Large Language Models via Benign Data Mirroring | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Rethinking Data Synthesis: A Teacher Model Training Recipe with Interpretation | Oct 27, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| MedGo: A Chinese Medical Large Language Model | Oct 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Sequential Large Language Model-Based Hyper-parameter Optimization | Oct 27, 2024 | Bayesian OptimizationBenchmarking | CodeCode Available | 0 |
| Implementation and Application of an Intelligibility Protocol for Interaction with an LLM | Oct 27, 2024 | Drug DiscoveryLarge Language Model | CodeCode Available | 0 |
| TrajAgent: An Agent Framework for Unified Trajectory Modelling | Oct 27, 2024 | Future predictionLanguage Modeling | CodeCode Available | 1 |
| R^3AG: First Workshop on Refined and Reliable Retrieval Augmented Generation | Oct 27, 2024 | Information RetrievalLanguage Modelling | —Unverified | 0 |
| SWE-Search: Enhancing Software Agents with Monte Carlo Tree Search and Iterative Refinement | Oct 26, 2024 | Large Language Model | CodeCode Available | 4 |
| Agentic Feedback Loop Modeling Improves Recommendation and User Simulation | Oct 26, 2024 | Large Language ModelUser Simulation | CodeCode Available | 1 |
| Cobblestone: Iterative Automation for Formal Verification | Oct 25, 2024 | Large Language Model | —Unverified | 0 |
| EDGE: Enhanced Grounded GUI Understanding with Enriched Multi-Granularity Synthetic Data | Oct 25, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| Autonomous Building Cyber-Physical Systems Using Decentralized Autonomous Organizations, Digital Twins, and Large Language Model | Oct 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |