SOTAVerified

Large Language Model

Papers

Showing 151200 of 6097 papers

TitleStatusHype
FlexRAG: A Flexible and Comprehensive Framework for Retrieval-Augmented GenerationCode3
TagRouter: Learning Route to LLMs through Tags for Open-Domain Text Generation TasksCode1
From Human to Machine Psychology: A Conceptual Framework for Understanding Well-Being in Large Language Model0
Information Suppression in Large Language Models: Auditing, Quantifying, and Characterizing Censorship in DeepSeek0
Improving Large Language Model Safety with Contrastive Representation LearningCode0
VGR: Visual Grounded Reasoning0
Large Language Model-Powered Conversational Agent Delivering Problem-Solving Therapy (PST) for Family Caregivers: Enhancing Empathy and Therapeutic Alliance Using In-Context Learning0
Semantic Preprocessing for LLM-based Malware Analysis0
Investigating the Potential of Large Language Model-Based Router Multi-Agent Architectures for Foundation Design Automation: A Task Classification and Expert Selection Study0
FAA Framework: A Large Language Model-Based Approach for Credit Card Fraud Investigations0
From Emergence to Control: Probing and Modulating Self-Reflection in Language ModelsCode0
The Behavior Gap: Evaluating Zero-shot LLM Agents in Complex Task-Oriented Dialogs0
SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security TasksCode2
Intelligent Automation for FDI Facilitation: Optimizing Tariff Exemption Processes with OCR And Large Language Models0
LLM-as-a-Fuzzy-Judge: Fine-Tuning Large Language Models as a Clinical Evaluation Judge with Fuzzy LogicCode0
Nowcasting the euro area with social media data0
MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices0
Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills0
Unsourced Adversarial CAPTCHA: A Bi-Phase Adversarial CAPTCHA Framework0
Automated Validation of Textual Constraints Against AutomationML via LLMs and SHACLCode0
DanceChat: Large Language Model-Guided Music-to-Dance Generation0
A Benchmark for Generalizing Across Diverse Team Strategies in Competitive PokémonCode1
Grounded Vision-Language Navigation for UAVs with Open-Vocabulary Goal Understanding0
Slimming Down LLMs Without Losing Their Minds0
Provably Learning from Language Feedback0
AutoMind: Adaptive Knowledgeable Agent for Automated Data ScienceCode2
NeuralNexus at BEA 2025 Shared Task: Retrieval-Augmented Prompting for Mistake Identification in AI TutorsCode0
Improving Named Entity Transcription with Contextual LLM-based Revision0
Alzheimer's Dementia Detection Using Perplexity from Paired Large Language Models0
ADAgent: LLM Agent for Alzheimer's Disease Analysis with Collaborative Coordinator0
DreamCS: Geometry-Aware Text-to-3D Generation with Unpaired 3D Reward Supervision0
Superstudent intelligence in thermodynamics0
Prompt-Guided Latent Diffusion with Predictive Class Conditioning for 3D Prostate MRI Generation0
Towards Multi-modal Graph Large Language Model0
V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and PlanningCode7
Bridging the Gap Between Open-Source and Proprietary LLMs in Table QACode0
Disclosure Audits for LLM Agents0
GenBreak: Red Teaming Text-to-Image Generators Using Large Language Models0
Chat-of-Thought: Collaborative Multi-Agent System for Generating Domain Specific Information0
XGraphRAG: Interactive Visual Analysis for Graph-based Retrieval-Augmented GenerationCode0
The Predictive Brain: Neural Correlates of Word Expectancy Align with Large Language Model Prediction Probabilities0
SoK: Machine Unlearning for Large Language Models0
PHRASED: Phrase Dictionary Biasing for Speech Translation0
Efficient Fireworks Algorithm Equipped with an Explosion Mechanism based on Student's T-distribution0
From Pixels to Graphs: using Scene and Knowledge Graphs for HD-EPIC VQA Challenge0
Towards Secure and Private Language Models for Nuclear Power Plants0
Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM ReasoningCode1
SakugaFlow: A Stagewise Illustration Framework Emulating the Human Drawing Process and Providing Interactive Tutoring for Novice Drawing Skills0
Safe and Economical UAV Trajectory Planning in Low-Altitude Airspace: A Hybrid DRL-LLM Approach with Compliance Awareness0
Adapting Vision-Language Foundation Model for Next Generation Medical Ultrasound Image AnalysisCode1
Show:102550
← PrevPage 4 of 122Next →

No leaderboard results yet.