| Simulating User Agents for Embodied Conversational-AI | Oct 31, 2024 | Dataset GenerationLarge Language Model | —Unverified | 0 |
| Web-Scale Visual Entity Recognition: An LLM-Driven Data Approach | Oct 31, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Matchmaker: Self-Improving Large Language Model Programs for Schema Matching | Oct 31, 2024 | Data IntegrationLanguage Modeling | —Unverified | 0 |
| From Context to Action: Analysis of the Impact of State Representation and Context on the Generalization of Multi-Turn Web Navigation Agents | Oct 31, 2024 | Action AnalysisConversational Web Navigation | —Unverified | 0 |
| Schema Augmentation for Zero-Shot Domain Adaptation in Dialogue State Tracking | Oct 31, 2024 | Data AugmentationDialogue State Tracking | —Unverified | 0 |
| A Theoretical Perspective for Speculative Decoding Algorithm | Oct 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond Ontology in Dialogue State Tracking for Goal-Oriented Chatbot | Oct 30, 2024 | ChatbotDialogue State Tracking | CodeCode Available | 0 |
| Prove Your Point!: Bringing Proof-Enhancement Principles to Argumentative Essay Generation | Oct 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EF-LLM: Energy Forecasting LLM with AI-assisted Automation, Enhanced Sparse Prediction, Hallucination Detection | Oct 30, 2024 | Continual LearningHallucination | —Unverified | 0 |
| Toward Understanding In-context vs. In-weight Learning | Oct 30, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| PV-VTT: A Privacy-Centric Dataset for Mission-Specific Anomaly Detection and Natural Language Interpretation | Oct 30, 2024 | Anomaly DetectionDescriptive | —Unverified | 0 |
| Explainable Behavior Cloning: Teaching Large Language Model Agents through Learning by Demonstration | Oct 30, 2024 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Dynamic Information Sub-Selection for Decision Support | Oct 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EMOS: Embodiment-aware Heterogeneous Multi-robot Operating System with LLM Agents | Oct 30, 2024 | Large Language ModelObject Rearrangement | —Unverified | 0 |
| EMMA: End-to-End Multimodal Model for Autonomous Driving | Oct 30, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Anticipating Future with Large Language Model for Simultaneous Machine Translation | Oct 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Online Detecting LLM-Generated Texts via Sequential Hypothesis Testing by Betting | Oct 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MARCO: Multi-Agent Real-time Chat Orchestration | Oct 29, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Auto-Intent: Automated Intent Discovery and Self-Exploration for Large Language Model Web Agents | Oct 29, 2024 | Decision MakingIntent Discovery | —Unverified | 0 |
| Learning and Unlearning of Fabricated Knowledge in Language Models | Oct 29, 2024 | Data PoisoningLanguage Modeling | —Unverified | 0 |
| Sorting Out the Bad Seeds: Automatic Classification of Cryptocurrency Abuse Reports | Oct 28, 2024 | Large Language Model | —Unverified | 0 |
| Zero-Shot Dense Retrieval with Embeddings from Relevance Feedback | Oct 28, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| An Actor-Critic Approach to Boosting Text-to-SQL Large Language Model | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce | Oct 28, 2024 | Benchmarkinggraph construction | —Unverified | 0 |
| Large Language Model Benchmarks in Medical Tasks | Oct 28, 2024 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Large Language Model-assisted Speech and Pointing Benefits Multiple 3D Object Selection in Virtual Reality | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Rephrasing natural text data with different languages and quality levels for Large Language Model pre-training | Oct 28, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Can Machines Think Like Humans? A Behavioral Evaluation of LLM-Agents in Dictator Games | Oct 28, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Stealthy Jailbreak Attacks on Large Language Models via Benign Data Mirroring | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BongLLaMA: LLaMA for Bangla Language | Oct 28, 2024 | BenchmarkingData Augmentation | —Unverified | 0 |
| ElectionSim: Massive Population Election Simulation Powered by Large Language Model Driven Agents | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model-Guided Prediction Toward Quantum Materials Synthesis | Oct 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MedGo: A Chinese Medical Large Language Model | Oct 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Rethinking Data Synthesis: A Teacher Model Training Recipe with Interpretation | Oct 27, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| Implementation and Application of an Intelligibility Protocol for Interaction with an LLM | Oct 27, 2024 | Drug DiscoveryLarge Language Model | CodeCode Available | 0 |
| Sequential Large Language Model-Based Hyper-parameter Optimization | Oct 27, 2024 | Bayesian OptimizationBenchmarking | CodeCode Available | 0 |
| R^3AG: First Workshop on Refined and Reliable Retrieval Augmented Generation | Oct 27, 2024 | Information RetrievalLanguage Modelling | —Unverified | 0 |
| IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation | Oct 25, 2024 | Common Sense ReasoningLanguage Modeling | —Unverified | 0 |
| FairMT-Bench: Benchmarking Fairness for Multi-turn Dialogue in Conversational LLMs | Oct 25, 2024 | BenchmarkingFairness | —Unverified | 0 |
| Cobblestone: Iterative Automation for Formal Verification | Oct 25, 2024 | Large Language Model | —Unverified | 0 |
| EDGE: Enhanced Grounded GUI Understanding with Enriched Multi-Granularity Synthetic Data | Oct 25, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| Autonomous Building Cyber-Physical Systems Using Decentralized Autonomous Organizations, Digital Twins, and Large Language Model | Oct 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Provably Robust Watermarks for Open-Source Language Models | Oct 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks | Oct 24, 2024 | image-classificationImage Classification | —Unverified | 0 |
| Unbounded: A Generative Infinite Game of Character Life Simulation | Oct 24, 2024 | Instruction FollowingLanguage Modelling | —Unverified | 0 |
| Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms | Oct 24, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| The Stepwise Deception: Simulating the Evolution from True News to Fake News with LLM Agents | Oct 24, 2024 | Large Language ModelMisinformation | —Unverified | 0 |
| AlignCap: Aligning Speech Emotion Captioning to Human Preferences | Oct 24, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs | Oct 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Activation | Oct 23, 2024 | GPULanguage Modeling | —Unverified | 0 |