SOTAVerified

Large Language Model

Papers

Showing 15511575 of 6097 papers

TitleStatusHype
The Foundation Cracks: A Comprehensive Study on Bugs and Testing Practices in LLM Libraries0
From Human to Machine Psychology: A Conceptual Framework for Understanding Well-Being in Large Language Model0
Improving Large Language Model Safety with Contrastive Representation LearningCode0
From Emergence to Control: Probing and Modulating Self-Reflection in Language ModelsCode0
Large Language Model-Powered Conversational Agent Delivering Problem-Solving Therapy (PST) for Family Caregivers: Enhancing Empathy and Therapeutic Alliance Using In-Context Learning0
Investigating the Potential of Large Language Model-Based Router Multi-Agent Architectures for Foundation Design Automation: A Task Classification and Expert Selection Study0
FAA Framework: A Large Language Model-Based Approach for Credit Card Fraud Investigations0
The Behavior Gap: Evaluating Zero-shot LLM Agents in Complex Task-Oriented Dialogs0
Semantic Preprocessing for LLM-based Malware Analysis0
VGR: Visual Grounded Reasoning0
NeuralNexus at BEA 2025 Shared Task: Retrieval-Augmented Prompting for Mistake Identification in AI TutorsCode0
Nowcasting the euro area with social media data0
Grounded Vision-Language Navigation for UAVs with Open-Vocabulary Goal Understanding0
Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills0
MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices0
Intelligent Automation for FDI Facilitation: Optimizing Tariff Exemption Processes with OCR And Large Language Models0
Unsourced Adversarial CAPTCHA: A Bi-Phase Adversarial CAPTCHA Framework0
Automated Validation of Textual Constraints Against AutomationML via LLMs and SHACLCode0
DanceChat: Large Language Model-Guided Music-to-Dance Generation0
Provably Learning from Language Feedback0
Improving Named Entity Transcription with Contextual LLM-based Revision0
LLM-as-a-Fuzzy-Judge: Fine-Tuning Large Language Models as a Clinical Evaluation Judge with Fuzzy LogicCode0
Slimming Down LLMs Without Losing Their Minds0
Bridging the Gap Between Open-Source and Proprietary LLMs in Table QACode0
DreamCS: Geometry-Aware Text-to-3D Generation with Unpaired 3D Reward Supervision0
Show:102550
← PrevPage 63 of 244Next →

No leaderboard results yet.