SOTAVerified

Large Language Model

Papers

Showing 56015625 of 6097 papers

TitleStatusHype
Automatically Generating Visual Hallucination Test Cases for Multimodal Large Language ModelsCode0
An agentic system with reinforcement-learned subsystem improvements for parsing form-like documentsCode0
Automated title and abstract screening for scoping reviews using the GPT-4 Large Language ModelCode0
WaterDrum: Watermarking for Data-centric Unlearning MetricCode0
TruthEval: A Dataset to Evaluate LLM Truthfulness and ReliabilityCode0
Conversations in Galician: a Large Language Model for an Underrepresented LanguageCode0
Vamos: Versatile Action Models for Video UnderstandingCode0
Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language ModelsCode0
PeriGuru: A Peripheral Robotic Mobile App Operation Assistant based on GUI Image Understanding and Prompting with LLMCode0
Leveraging Content and Acoustic Representations for Speech Emotion RecognitionCode0
Can a Large Language Model Learn Matrix Functions In Context?Code0
The Factuality Tax of Diversity-Intervened Text-to-Image Generation: Benchmark and Fact-Augmented InterventionCode0
Can a large language model be a gaslighter?Code0
Conversational Feedback in Scripted versus Spontaneous Dialogues: A Comparative AnalysisCode0
Can AI Relate: Testing Large Language Model Response for Mental Health SupportCode0
SemEval-2017 Task 4: Sentiment Analysis in Twitter using BERTCode0
TULUN: Transparent and Adaptable Low-resource Machine TranslationCode0
A Multi-Pass Large Language Model Framework for Precise and Efficient Radiology Report Error DetectionCode0
Exploiting the Vulnerability of Large Language Models via Defense-Aware Architectural BackdoorCode0
Personalized LLM for Generating Customized Responses to the Same Query from Different UsersCode0
Automated Privacy Information Annotation in Large Language Model InteractionsCode0
HSI: Head-Specific Intervention Can Induce Misaligned AI Coordination in Large Language ModelsCode0
Let Me Think! A Long Chain-of-Thought Can Be Worth Exponentially Many Short OnesCode0
Exploiting ChatGPT for Diagnosing Autism-Associated Language Disorders and Identifying Distinct FeaturesCode0
Scaling Reasoning can Improve Factuality in Large Language ModelsCode0
Show:102550
← PrevPage 225 of 244Next →

No leaderboard results yet.