SOTAVerified

Large Language Model

Papers

Showing 56015650 of 6097 papers

TitleStatusHype
Automatically Generating Visual Hallucination Test Cases for Multimodal Large Language ModelsCode0
An agentic system with reinforcement-learned subsystem improvements for parsing form-like documentsCode0
Automated title and abstract screening for scoping reviews using the GPT-4 Large Language ModelCode0
WaterDrum: Watermarking for Data-centric Unlearning MetricCode0
TruthEval: A Dataset to Evaluate LLM Truthfulness and ReliabilityCode0
Conversations in Galician: a Large Language Model for an Underrepresented LanguageCode0
Vamos: Versatile Action Models for Video UnderstandingCode0
Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language ModelsCode0
PeriGuru: A Peripheral Robotic Mobile App Operation Assistant based on GUI Image Understanding and Prompting with LLMCode0
Leveraging Content and Acoustic Representations for Speech Emotion RecognitionCode0
Can a Large Language Model Learn Matrix Functions In Context?Code0
The Factuality Tax of Diversity-Intervened Text-to-Image Generation: Benchmark and Fact-Augmented InterventionCode0
Can a large language model be a gaslighter?Code0
Conversational Feedback in Scripted versus Spontaneous Dialogues: A Comparative AnalysisCode0
Can AI Relate: Testing Large Language Model Response for Mental Health SupportCode0
SemEval-2017 Task 4: Sentiment Analysis in Twitter using BERTCode0
TULUN: Transparent and Adaptable Low-resource Machine TranslationCode0
A Multi-Pass Large Language Model Framework for Precise and Efficient Radiology Report Error DetectionCode0
Exploiting the Vulnerability of Large Language Models via Defense-Aware Architectural BackdoorCode0
Personalized LLM for Generating Customized Responses to the Same Query from Different UsersCode0
Automated Privacy Information Annotation in Large Language Model InteractionsCode0
HSI: Head-Specific Intervention Can Induce Misaligned AI Coordination in Large Language ModelsCode0
Let Me Think! A Long Chain-of-Thought Can Be Worth Exponentially Many Short OnesCode0
Exploiting ChatGPT for Diagnosing Autism-Associated Language Disorders and Identifying Distinct FeaturesCode0
Scaling Reasoning can Improve Factuality in Large Language ModelsCode0
Computational Reasoning of Large Language ModelsCode0
Length Optimization in Conformal PredictionCode0
Can (A)I Change Your Mind?Code0
Legal Documents Drafting with Fine-Tuned Pre-Trained Large Language ModelCode0
Conversational AI Powered by Large Language Models Amplifies False Memories in Witness InterviewsCode0
LecEval: An Automated Metric for Multimodal Knowledge Acquisition in Multimedia LearningCode0
Controlling Large Language Model with Latent ActionsCode0
Expertise elevates AI usage: experimental evidence comparing laypeople and professional artistsCode0
Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesisCode0
Layer-Wise Quantization: A Pragmatic and Effective Method for Quantizing LLMs Beyond Integer Bit-LevelsCode0
The impact of responding to patient messages with large language model assistanceCode0
Physics Event Classification Using Large Language ModelsCode0
Variance Control via Weight Rescaling in LLM Pre-trainingCode0
LEAVS: An LLM-based Labeler for Abdominal CT SupervisionCode0
Learning to Verify Summary Facts with Fine-Grained LLM FeedbackCode0
PIORS: Personalized Intelligent Outpatient Reception based on Large Language Model with Multi-Agents Medical Scenario SimulationCode0
TutorGym: A Testbed for Evaluating AI Agents as Tutors and StudentsCode0
Pipette: Automatic Fine-grained Large Language Model Training Configurator for Real-World ClustersCode0
Learning to Rank Context for Named Entity Recognition Using a Synthetic DatasetCode0
Controlled LLM Decoding via Discrete Auto-regressive BiasingCode0
Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry PriorsCode0
TwinBooster: Synergising Large Language Models with Barlow Twins and Gradient Boosting for Enhanced Molecular Property PredictionCode0
Expanding the Vocabulary of BERT for Knowledge Base ConstructionCode0
A multimodal LLM for the non-invasive decoding of spoken text from brain recordingsCode0
Exp4Fuse: A Rank Fusion Framework for Enhanced Sparse Retrieval using Large Language Model-based Query ExpansionCode0
Show:102550
← PrevPage 113 of 122Next →

No leaderboard results yet.