SOTAVerified

Large Language Model

Papers

Showing 12011225 of 6097 papers

TitleStatusHype
Enhancing RL Safety with Counterfactual LLM ReasoningCode1
CompeteAI: Understanding the Competition Dynamics in Large Language Model-based AgentsCode1
Matching Patients to Clinical Trials with Large Language ModelsCode1
Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities Using Only Forward PassesCode1
MSCPT: Few-shot Whole Slide Image Classification with Multi-scale and Context-focused Prompt TuningCode1
Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music RetrievalCode1
RARR: Researching and Revising What Language Models Say, Using Language ModelsCode1
AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric KnowledgeCode1
Composing Parameter-Efficient Modules with Arithmetic OperationsCode1
Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEditCode1
Compositional Chain-of-Thought Prompting for Large Multimodal ModelsCode1
AttributionBench: How Hard is Automatic Attribution Evaluation?Code1
MechAgents: Large language model multi-agent collaborations can solve mechanics problems, generate new data, and integrate knowledgeCode1
Multi-label Sequential Sentence Classification via Large Language ModelCode1
MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and DiagnosisCode1
CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-TuningCode1
Multimodal LLM-Guided Semantic Correction in Text-to-Image DiffusionCode1
Democratizing Reasoning Ability: Tailored Learning from Large Language ModelCode1
DefenderBench: A Toolkit for Evaluating Language Agents in Cybersecurity EnvironmentsCode1
Evaluating Retrieval Quality in Retrieval-Augmented GenerationCode1
Can ChatGPT Replace Traditional KBQA Models? An In-depth Analysis of the Question Answering Performance of the GPT LLM FamilyCode1
Can ChatGPT replace StackOverflow? A Study on Robustness and Reliability of Large Language Model Code GenerationCode1
DesCo: Learning Object Recognition with Rich Language DescriptionsCode1
Making Language Models Better Tool Learners with Execution FeedbackCode1
Dataflow Analysis-Inspired Deep Learning for Efficient Vulnerability DetectionCode1
Show:102550
← PrevPage 49 of 244Next →

No leaderboard results yet.