SOTAVerified

Large Language Model

Papers

Showing 33513400 of 6097 papers

TitleStatusHype
Simulating User Agents for Embodied Conversational-AI0
Web-Scale Visual Entity Recognition: An LLM-Driven Data Approach0
Matchmaker: Self-Improving Large Language Model Programs for Schema Matching0
From Context to Action: Analysis of the Impact of State Representation and Context on the Generalization of Multi-Turn Web Navigation Agents0
Schema Augmentation for Zero-Shot Domain Adaptation in Dialogue State Tracking0
A Theoretical Perspective for Speculative Decoding Algorithm0
Beyond Ontology in Dialogue State Tracking for Goal-Oriented ChatbotCode0
Prove Your Point!: Bringing Proof-Enhancement Principles to Argumentative Essay Generation0
EF-LLM: Energy Forecasting LLM with AI-assisted Automation, Enhanced Sparse Prediction, Hallucination Detection0
Toward Understanding In-context vs. In-weight Learning0
PV-VTT: A Privacy-Centric Dataset for Mission-Specific Anomaly Detection and Natural Language Interpretation0
Explainable Behavior Cloning: Teaching Large Language Model Agents through Learning by Demonstration0
Dynamic Information Sub-Selection for Decision Support0
EMOS: Embodiment-aware Heterogeneous Multi-robot Operating System with LLM Agents0
EMMA: End-to-End Multimodal Model for Autonomous Driving0
Anticipating Future with Large Language Model for Simultaneous Machine Translation0
Online Detecting LLM-Generated Texts via Sequential Hypothesis Testing by BettingCode0
MARCO: Multi-Agent Real-time Chat Orchestration0
Auto-Intent: Automated Intent Discovery and Self-Exploration for Large Language Model Web Agents0
Learning and Unlearning of Fabricated Knowledge in Language Models0
Sorting Out the Bad Seeds: Automatic Classification of Cryptocurrency Abuse Reports0
Zero-Shot Dense Retrieval with Embeddings from Relevance Feedback0
An Actor-Critic Approach to Boosting Text-to-SQL Large Language Model0
Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce0
Large Language Model Benchmarks in Medical Tasks0
Large Language Model-assisted Speech and Pointing Benefits Multiple 3D Object Selection in Virtual Reality0
Rephrasing natural text data with different languages and quality levels for Large Language Model pre-training0
Can Machines Think Like Humans? A Behavioral Evaluation of LLM-Agents in Dictator Games0
Stealthy Jailbreak Attacks on Large Language Models via Benign Data Mirroring0
BongLLaMA: LLaMA for Bangla Language0
ElectionSim: Massive Population Election Simulation Powered by Large Language Model Driven Agents0
Large Language Model-Guided Prediction Toward Quantum Materials SynthesisCode0
MedGo: A Chinese Medical Large Language Model0
Rethinking Data Synthesis: A Teacher Model Training Recipe with Interpretation0
Implementation and Application of an Intelligibility Protocol for Interaction with an LLMCode0
Sequential Large Language Model-Based Hyper-parameter OptimizationCode0
R^3AG: First Workshop on Refined and Reliable Retrieval Augmented Generation0
IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation0
FairMT-Bench: Benchmarking Fairness for Multi-turn Dialogue in Conversational LLMs0
Cobblestone: Iterative Automation for Formal Verification0
EDGE: Enhanced Grounded GUI Understanding with Enriched Multi-Granularity Synthetic Data0
Autonomous Building Cyber-Physical Systems Using Decentralized Autonomous Organizations, Digital Twins, and Large Language Model0
Provably Robust Watermarks for Open-Source Language Models0
Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks0
Unbounded: A Generative Infinite Game of Character Life Simulation0
Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms0
The Stepwise Deception: Simulating the Evolution from True News to Fake News with LLM Agents0
AlignCap: Aligning Speech Emotion Captioning to Human Preferences0
A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs0
CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Activation0
Show:102550
← PrevPage 68 of 122Next →

No leaderboard results yet.