SOTAVerified

Large Language Model

Papers

Showing 151175 of 6097 papers

TitleStatusHype
TagRouter: Learning Route to LLMs through Tags for Open-Domain Text Generation TasksCode1
FlexRAG: A Flexible and Comprehensive Framework for Retrieval-Augmented GenerationCode3
Information Suppression in Large Language Models: Auditing, Quantifying, and Characterizing Censorship in DeepSeek0
From Human to Machine Psychology: A Conceptual Framework for Understanding Well-Being in Large Language Model0
Improving Large Language Model Safety with Contrastive Representation LearningCode0
VGR: Visual Grounded Reasoning0
The Behavior Gap: Evaluating Zero-shot LLM Agents in Complex Task-Oriented Dialogs0
Large Language Model-Powered Conversational Agent Delivering Problem-Solving Therapy (PST) for Family Caregivers: Enhancing Empathy and Therapeutic Alliance Using In-Context Learning0
From Emergence to Control: Probing and Modulating Self-Reflection in Language ModelsCode0
Investigating the Potential of Large Language Model-Based Router Multi-Agent Architectures for Foundation Design Automation: A Task Classification and Expert Selection Study0
FAA Framework: A Large Language Model-Based Approach for Credit Card Fraud Investigations0
SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security TasksCode2
Semantic Preprocessing for LLM-based Malware Analysis0
Intelligent Automation for FDI Facilitation: Optimizing Tariff Exemption Processes with OCR And Large Language Models0
LLM-as-a-Fuzzy-Judge: Fine-Tuning Large Language Models as a Clinical Evaluation Judge with Fuzzy LogicCode0
Nowcasting the euro area with social media data0
Unsourced Adversarial CAPTCHA: A Bi-Phase Adversarial CAPTCHA Framework0
MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices0
Automated Validation of Textual Constraints Against AutomationML via LLMs and SHACLCode0
DanceChat: Large Language Model-Guided Music-to-Dance Generation0
A Benchmark for Generalizing Across Diverse Team Strategies in Competitive PokémonCode1
Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills0
Grounded Vision-Language Navigation for UAVs with Open-Vocabulary Goal Understanding0
Provably Learning from Language Feedback0
NeuralNexus at BEA 2025 Shared Task: Retrieval-Augmented Prompting for Mistake Identification in AI TutorsCode0
Show:102550
← PrevPage 7 of 244Next →

No leaderboard results yet.