SOTAVerified

Large Language Model

Papers

Showing 451500 of 6097 papers

TitleStatusHype
biorecap: an R package for summarizing bioRxiv preprints with a local LLMCode2
Control Industrial Automation System with Large Language Model AgentsCode2
Diff-eRank: A Novel Rank-Based Metric for Evaluating Large Language ModelsCode2
Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and EnhancementCode2
Large Language Model with Region-guided Referring and Grounding for CT Report GenerationCode2
BianCang: A Traditional Chinese Medicine Large Language ModelCode2
Alignment faking in large language modelsCode2
PsycoLLM: Enhancing LLM for Psychological Understanding and EvaluationCode2
Aligning to Thousands of Preferences via System Message GeneralizationCode2
Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile InstructionsCode2
Efficient LLM Scheduling by Learning to RankCode2
Beyond Text: Frozen Large Language Models in Visual Signal ComprehensionCode2
EarthGPT: A Universal Multi-modal Large Language Model for Multi-sensor Image Comprehension in Remote Sensing DomainCode2
Explainable Fake News Detection With Large Language Model via Defense Among Competing WisdomCode2
RAGViz: Diagnose and Visualize Retrieval-Augmented GenerationCode2
Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM GuidanceCode2
LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential RecommendationCode2
Listen, Think, and UnderstandCode2
KoSBi: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model ApplicationCode2
KnowCoder: Coding Structured Knowledge into LLMs for Universal Information ExtractionCode2
Drive Like a Human: Rethinking Autonomous Driving with Large Language ModelsCode2
RepairAgent: An Autonomous, LLM-Based Agent for Program RepairCode2
ARAGOG: Advanced RAG Output GradingCode2
Keeping Yourself is Important in Downstream Tuning Multimodal Large Language ModelCode2
KET-RAG: A Cost-Efficient Multi-Granular Indexing Framework for Graph-RAGCode2
Algorithm Evolution Using Large Language ModelCode2
CrackSQL: A Hybrid SQL Dialect Translation System Powered by Large Language ModelsCode2
500xCompressor: Generalized Prompt Compression for Large Language ModelsCode2
KICGPT: Large Language Model with Knowledge in Context for Knowledge Graph CompletionCode2
Archon: An Architecture Search Framework for Inference-Time TechniquesCode2
AgentSims: An Open-Source Sandbox for Large Language Model EvaluationCode2
RoarGraph: A Projected Bipartite Graph for Efficient Cross-Modal Approximate Nearest Neighbor SearchCode2
Language Models Can Improve Event Prediction by Few-Shot Abductive ReasoningCode2
RS-Agent: Automating Remote Sensing Tasks through Intelligent AgentCode2
AgentSociety Challenge: Designing LLM Agents for User Modeling and Recommendation on Web PlatformsCode2
CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model GenerationCode2
DreamLIP: Language-Image Pre-training with Long CaptionsCode2
Jailbreaking Attack against Multimodal Large Language ModelCode2
Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large Language Model ReasoningCode2
Jailbreak Vision Language Models via Bi-Modal Adversarial PromptCode2
DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented GenerationCode2
ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference OptimizationCode2
CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application VulnerabilitiesCode2
Customization Assistant for Text-to-image GenerationCode2
Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative DecodingCode2
Introducing Visual Perception Token into Multimodal Large Language ModelCode2
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You WantCode2
ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry AreaCode2
Large Language Model Instruction Following: A Survey of Progresses and ChallengesCode2
Language Models can Solve Computer TasksCode2
Show:102550
← PrevPage 10 of 122Next →

No leaderboard results yet.