SOTAVerified

Large Language Model

Papers

Showing 24512475 of 6097 papers

TitleStatusHype
Unveiling Biases in AI: ChatGPT's Political Economy Perspectives and Human Comparisons0
This Is Your Doge, If It Please You: Exploring Deception and Robustness in Mixture of LLMsCode0
Revitalizing Saturated Benchmarks: A Weighted Metric Approach for Differentiating Large Language Model Performance0
QG-SMS: Enhancing Test Item Analysis via Student Modeling and Simulation0
Leveraging Approximate Caching for Faster Retrieval-Augmented Generation0
GEMA-Score: Granular Explainable Multi-Agent Score for Radiology Report EvaluationCode0
No Free Labels: Limitations of LLM-as-a-Judge Without Human Grounding0
Measuring temporal effects of agent knowledge by date-controlled tool use0
Architecture for a Trustworthy Quantum Chatbot0
AgentSafe: Safeguarding Large Language Model-based Multi-agent Systems via Hierarchical Data Management0
Know Thy Judge: On the Robustness Meta-Evaluation of LLM Safety Judges0
AOLO: Analysis and Optimization For Low-Carbon Oriented Wireless Large Language Model Services0
Leveraging Large Language Models to Address Data Scarcity in Machine Learning: Applications in Graphene SynthesisCode0
Better Process Supervision with Bi-directional Rewarding Signals0
Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining0
ToolFuzz -- Automated Agent Tool Testing0
KidneyTalk-open: No-code Deployment of a Private Large Language Model with Medical Documentation-Enhanced Knowledge Database for Kidney DiseaseCode0
PP-DocBee: Improving Multimodal Document Understanding Through a Bag of Tricks0
The Next Frontier of LLM Applications: Open Ecosystems and Hardware Synergy0
Towards Understanding Multi-Round Large Language Model Reasoning: Approximability, Learnability and Generalizability0
Human Implicit Preference-Based Policy Fine-tuning for Multi-Agent Reinforcement Learning in USV Swarm0
PAIR: A Novel Large Language Model-Guided Selection Strategy for Evolutionary AlgorithmsCode0
Multimodal Stock Price Prediction: A Case Study of the Russian Securities Market0
Trust, Experience, and Innovation: Key Factors Shaping American Attitudes About AI0
Haste Makes Waste: Evaluating Planning Abilities of LLMs for Efficient and Feasible Multitasking with Time Constraints Between ActionsCode0
Show:102550
← PrevPage 99 of 244Next →

No leaderboard results yet.