SOTAVerified

Large Language Model

Papers

Showing 12011250 of 6097 papers

TitleStatusHype
Clinical Camel: An Open Expert-Level Medical Language Model with Dialogue-Based Knowledge EncodingCode1
CompeteAI: Understanding the Competition Dynamics in Large Language Model-based AgentsCode1
A Large Language Model Enhanced Sequential Recommender for Joint Video and Comment RecommendationCode1
MEPNet: Medical Entity-balanced Prompting Network for Brain CT Report GenerationCode1
Clinfo.ai: An Open-Source Retrieval-Augmented Large Language Model System for Answering Medical Questions using Scientific LiteratureCode1
Evaluating ChatGPT as a Recommender System: A Rigorous ApproachCode1
Memory Sharing for Large Language Model based AgentsCode1
AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric KnowledgeCode1
Composing Parameter-Efficient Modules with Arithmetic OperationsCode1
Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEditCode1
Compositional Chain-of-Thought Prompting for Large Multimodal ModelsCode1
AttributionBench: How Hard is Automatic Attribution Evaluation?Code1
Evaluating Retrieval Quality in Retrieval-Augmented GenerationCode1
NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each BenchmarkCode1
MemoNet: Memorizing All Cross Features' Representations Efficiently via Multi-Hash Codebook Network for CTR PredictionCode1
Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMsCode1
MindGPT: Interpreting What You See with Non-invasive Brain RecordingsCode1
CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-TuningCode1
Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU HeterogeneityCode1
Development and bilingual evaluation of Japanese medical large language model within reasonably low computational resourcesCode1
Meerkat: Audio-Visual Large Language Model for Grounding in Space and TimeCode1
MELLM: Exploring LLM-Powered Micro-Expression Understanding Enhanced by Subtle Motion PerceptionCode1
Can ChatGPT replace StackOverflow? A Study on Robustness and Reliability of Large Language Model Code GenerationCode1
Excuse me, sir? Your language model is leaking (information)Code1
MedFILIP: Medical Fine-grained Language-Image Pre-trainingCode1
Detecting Hallucinations in Large Language Model Generation: A Token Probability ApproachCode1
MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and DiagnosisCode1
On Diversified Preferences of Large Language Model AlignmentCode1
Democratizing Reasoning Ability: Tailored Learning from Large Language ModelCode1
Meaning Typed Prompting: A Technique for Efficient, Reliable Structured Output GenerationCode1
A Study of Generative Large Language Model for Medical Research and HealthcareCode1
CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global MemoryCode1
AuditWen:An Open-Source Large Language Model for AuditCode1
Measuring General Intelligence with Generated GamesCode1
CONFLARE: CONFormal LArge language model REtrievalCode1
OntoChatGPT Information System: Ontology-Driven Structured Prompts for ChatGPT Meta-LearningCode1
DefenderBench: A Toolkit for Evaluating Language Agents in Cybersecurity EnvironmentsCode1
DesCo: Learning Object Recognition with Rich Language DescriptionsCode1
AstroAgents: A Multi-Agent AI for Hypothesis Generation from Mass Spectrometry DataCode1
Fairer Preferences Elicit Improved Human-Aligned Large Language Model JudgmentsCode1
Expressing stigma and inappropriate responses prevents LLMs from safely replacing mental health providersCode1
Extensive Self-Contrast Enables Feedback-Free Language Model AlignmentCode1
CityBench: Evaluating the Capabilities of Large Language Models for Urban TasksCode1
Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language TranslationCode1
Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM ReasoningCode1
ConSmax: Hardware-Friendly Alternative Softmax with Learnable ParametersCode1
DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language ModelCode1
MechAgents: Large language model multi-agent collaborations can solve mechanics problems, generate new data, and integrate knowledgeCode1
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image SequencesCode1
Modeling Complex Mathematical Reasoning via Large Language Model based MathAgentCode1
Show:102550
← PrevPage 25 of 122Next →

No leaderboard results yet.