SOTAVerified

Large Language Model

Papers

Showing 12261250 of 6097 papers

TitleStatusHype
Detecting Hallucinations in Large Language Model Generation: A Token Probability ApproachCode1
MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and DiagnosisCode1
On Diversified Preferences of Large Language Model AlignmentCode1
Democratizing Reasoning Ability: Tailored Learning from Large Language ModelCode1
Meaning Typed Prompting: A Technique for Efficient, Reliable Structured Output GenerationCode1
A Study of Generative Large Language Model for Medical Research and HealthcareCode1
CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global MemoryCode1
AuditWen:An Open-Source Large Language Model for AuditCode1
Measuring General Intelligence with Generated GamesCode1
CONFLARE: CONFormal LArge language model REtrievalCode1
OntoChatGPT Information System: Ontology-Driven Structured Prompts for ChatGPT Meta-LearningCode1
DefenderBench: A Toolkit for Evaluating Language Agents in Cybersecurity EnvironmentsCode1
DesCo: Learning Object Recognition with Rich Language DescriptionsCode1
AstroAgents: A Multi-Agent AI for Hypothesis Generation from Mass Spectrometry DataCode1
Fairer Preferences Elicit Improved Human-Aligned Large Language Model JudgmentsCode1
Expressing stigma and inappropriate responses prevents LLMs from safely replacing mental health providersCode1
Extensive Self-Contrast Enables Feedback-Free Language Model AlignmentCode1
CityBench: Evaluating the Capabilities of Large Language Models for Urban TasksCode1
Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language TranslationCode1
Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM ReasoningCode1
ConSmax: Hardware-Friendly Alternative Softmax with Learnable ParametersCode1
DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language ModelCode1
MechAgents: Large language model multi-agent collaborations can solve mechanics problems, generate new data, and integrate knowledgeCode1
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image SequencesCode1
Modeling Complex Mathematical Reasoning via Large Language Model based MathAgentCode1
Show:102550
← PrevPage 50 of 244Next →

No leaderboard results yet.