SOTAVerified

Large Language Model

Papers

Showing 10011025 of 6097 papers

TitleStatusHype
AutomaTikZ: Text-Guided Synthesis of Scientific Vector Graphics with TikZCode1
UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language ModelCode1
Inverse Constitutional AI: Compressing Preferences into PrinciplesCode1
LLMDet: A Third Party Large Language Models Generated Text Detection ToolCode1
DOMINO: A Dual-System for Multi-step Visual Language ReasoningCode1
InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply ChainsCode1
ALYMPICS: LLM Agents Meet Game Theory -- Exploring Strategic Decision-Making with AI AgentsCode1
The Machine Psychology of Cooperation: Can GPT models operationalise prompts for altruism, cooperation, competitiveness and selfishness in economic games?Code1
LLMBind: A Unified Modality-Task Integration FrameworkCode1
ChatCFD: an End-to-End CFD Agent with Domain-specific Structured ThinkingCode1
ChatCounselor: A Large Language Models for Mental Health SupportCode1
LLMCBench: Benchmarking Large Language Model Compression for Efficient DeploymentCode1
ChatEDA: A Large Language Model Powered Autonomous Agent for EDACode1
Automatic Model Selection with Large Language Models for ReasoningCode1
Do Large Language Model Benchmarks Test Reliability?Code1
LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tools and Self-ExplanationsCode1
LLM experiments with simulation: Large Language Model Multi-Agent System for Simulation Model Parametrization in Digital TwinsCode1
Is Safety Standard Same for Everyone? User-Specific Safety Evaluation of Large Language ModelsCode1
LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMsCode1
DMoERM: Recipes of Mixture-of-Experts for Effective Reward ModelingCode1
Automatic Evaluation of Attribution by Large Language ModelsCode1
Prompting as Probing: Using Language Models for Knowledge Base ConstructionCode1
DynaPipe: Optimizing Multi-task Training through Dynamic PipelinesCode1
Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical ReasoningCode1
LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spatial RelationsCode1
Show:102550
← PrevPage 41 of 244Next →

No leaderboard results yet.