SOTAVerified

Large Language Model

Papers

Showing 161170 of 6097 papers

TitleStatusHype
The Behavior Gap: Evaluating Zero-shot LLM Agents in Complex Task-Oriented Dialogs0
SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security TasksCode2
Investigating the Potential of Large Language Model-Based Router Multi-Agent Architectures for Foundation Design Automation: A Task Classification and Expert Selection Study0
Intelligent Automation for FDI Facilitation: Optimizing Tariff Exemption Processes with OCR And Large Language Models0
LLM-as-a-Fuzzy-Judge: Fine-Tuning Large Language Models as a Clinical Evaluation Judge with Fuzzy LogicCode0
Nowcasting the euro area with social media data0
MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices0
Grounded Vision-Language Navigation for UAVs with Open-Vocabulary Goal Understanding0
Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills0
Unsourced Adversarial CAPTCHA: A Bi-Phase Adversarial CAPTCHA Framework0
Show:102550
← PrevPage 17 of 610Next →

No leaderboard results yet.