SOTAVerified

Large Language Model

Papers

Showing 23012350 of 6097 papers

TitleStatusHype
Thought Space Explorer: Navigating and Expanding Thought Space for Large Language Model Reasoning0
EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI DetectionCode1
Plan-on-Graph: Self-Correcting Adaptive Planning of Large Language Model on Knowledge GraphsCode2
Simulating User Agents for Embodied Conversational-AI0
ALISE: Accelerating Large Language Model Serving with Speculative Scheduling0
From Context to Action: Analysis of the Impact of State Representation and Context on the Generalization of Multi-Turn Web Navigation Agents0
EF-LLM: Energy Forecasting LLM with AI-assisted Automation, Enhanced Sparse Prediction, Hallucination Detection0
A Theoretical Perspective for Speculative Decoding Algorithm0
Dynamic Information Sub-Selection for Decision Support0
Explainable Behavior Cloning: Teaching Large Language Model Agents through Learning by Demonstration0
EMMA: End-to-End Multimodal Model for Autonomous Driving0
Beyond Ontology in Dialogue State Tracking for Goal-Oriented ChatbotCode0
Real-Time Personalization for LLM-based Recommendation with Customized In-Context LearningCode1
Prove Your Point!: Bringing Proof-Enhancement Principles to Argumentative Essay Generation0
EMOS: Embodiment-aware Heterogeneous Multi-robot Operating System with LLM Agents0
PV-VTT: A Privacy-Centric Dataset for Mission-Specific Anomaly Detection and Natural Language Interpretation0
Toward Understanding In-context vs. In-weight Learning0
Online Intrinsic Rewards for Decision Making Agents from Large Language Model FeedbackCode1
Anticipating Future with Large Language Model for Simultaneous Machine Translation0
Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM GuidanceCode2
Auto-Intent: Automated Intent Discovery and Self-Exploration for Large Language Model Web Agents0
SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt TypesCode1
Protecting Privacy in Multimodal Large Language Models with MLLMU-BenchCode2
Learning and Unlearning of Fabricated Knowledge in Language Models0
MARCO: Multi-Agent Real-time Chat Orchestration0
Online Detecting LLM-Generated Texts via Sequential Hypothesis Testing by BettingCode0
An Actor-Critic Approach to Boosting Text-to-SQL Large Language Model0
Can Machines Think Like Humans? A Behavioral Evaluation of LLM-Agents in Dictator Games0
LLMCBench: Benchmarking Large Language Model Compression for Efficient DeploymentCode1
Large Language Model Benchmarks in Medical Tasks0
ElectionSim: Massive Population Election Simulation Powered by Large Language Model Driven Agents0
Large Language Model-Guided Prediction Toward Quantum Materials SynthesisCode0
Rephrasing natural text data with different languages and quality levels for Large Language Model pre-training0
Sorting Out the Bad Seeds: Automatic Classification of Cryptocurrency Abuse Reports0
BongLLaMA: LLaMA for Bangla Language0
Zero-Shot Dense Retrieval with Embeddings from Relevance Feedback0
Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce0
Large Language Model-assisted Speech and Pointing Benefits Multiple 3D Object Selection in Virtual Reality0
Stealthy Jailbreak Attacks on Large Language Models via Benign Data Mirroring0
Rethinking Data Synthesis: A Teacher Model Training Recipe with Interpretation0
MedGo: A Chinese Medical Large Language Model0
Sequential Large Language Model-Based Hyper-parameter OptimizationCode0
Implementation and Application of an Intelligibility Protocol for Interaction with an LLMCode0
TrajAgent: An Agent Framework for Unified Trajectory ModellingCode1
R^3AG: First Workshop on Refined and Reliable Retrieval Augmented Generation0
SWE-Search: Enhancing Software Agents with Monte Carlo Tree Search and Iterative RefinementCode4
Agentic Feedback Loop Modeling Improves Recommendation and User SimulationCode1
Cobblestone: Iterative Automation for Formal Verification0
EDGE: Enhanced Grounded GUI Understanding with Enriched Multi-Granularity Synthetic Data0
Autonomous Building Cyber-Physical Systems Using Decentralized Autonomous Organizations, Digital Twins, and Large Language Model0
Show:102550
← PrevPage 47 of 122Next →

No leaderboard results yet.