SOTAVerified

Large Language Model

Papers

Showing 24512500 of 6097 papers

TitleStatusHype
DETQUS: Decomposition-Enhanced Transformers for QUery-focused Summarization0
Leveraging Approximate Caching for Faster Retrieval-Augmented Generation0
Revitalizing Saturated Benchmarks: A Weighted Metric Approach for Differentiating Large Language Model Performance0
Unveiling Biases in AI: ChatGPT's Political Economy Perspectives and Human Comparisons0
TPU-Gen: LLM-Driven Custom Tensor Processing Unit Generator0
This Is Your Doge, If It Please You: Exploring Deception and Robustness in Mixture of LLMsCode0
GEMA-Score: Granular Explainable Multi-Agent Score for Radiology Report EvaluationCode0
ToolFuzz -- Automated Agent Tool Testing0
Architecture for a Trustworthy Quantum Chatbot0
AgentSafe: Safeguarding Large Language Model-based Multi-agent Systems via Hierarchical Data Management0
PP-DocBee: Improving Multimodal Document Understanding Through a Bag of Tricks0
AOLO: Analysis and Optimization For Low-Carbon Oriented Wireless Large Language Model Services0
Measuring temporal effects of agent knowledge by date-controlled tool use0
Better Process Supervision with Bi-directional Rewarding Signals0
Leveraging Large Language Models to Address Data Scarcity in Machine Learning: Applications in Graphene SynthesisCode0
Know Thy Judge: On the Robustness Meta-Evaluation of LLM Safety Judges0
Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining0
KidneyTalk-open: No-code Deployment of a Private Large Language Model with Medical Documentation-Enhanced Knowledge Database for Kidney DiseaseCode0
The Next Frontier of LLM Applications: Open Ecosystems and Hardware Synergy0
Human Implicit Preference-Based Policy Fine-tuning for Multi-Agent Reinforcement Learning in USV Swarm0
Towards Understanding Multi-Round Large Language Model Reasoning: Approximability, Learnability and Generalizability0
PAIR: A Novel Large Language Model-Guided Selection Strategy for Evolutionary AlgorithmsCode0
Multimodal Stock Price Prediction: A Case Study of the Russian Securities Market0
Hierarchical Re-ranker Retriever (HRR)0
Towards Explainable Doctor Recommendation with Large Language Models0
DriveGen: Towards Infinite Diverse Traffic Scenarios with Large Models0
LLM-TabFlow: Synthetic Tabular Data Generation with Inter-column Logical Relationship PreservationCode0
Use Me Wisely: AI-Driven Assessment for LLM Prompting Skills Development0
BatchGEMBA: Token-Efficient Machine Translation Evaluation with Batched Prompting and Prompt CompressionCode0
Text2Scenario: Text-Driven Scenario Generation for Autonomous Driving Test0
Haste Makes Waste: Evaluating Planning Abilities of LLMs for Efficient and Feasible Multitasking with Time Constraints Between ActionsCode0
ATLaS: Agent Tuning via Learning Critical Steps0
Generator-Assistant Stepwise Rollback Framework for Large Language Model AgentCode0
Measuring Political Preferences in AI Systems: An Integrative Approach0
RedChronos: A Large Language Model-Based Log Analysis System for Insider Threat Detection in Enterprises0
Trust, Experience, and Innovation: Key Factors Shaping American Attitudes About AI0
Jailbreaking Safeguarded Text-to-Image Models via Large Language Models0
Can (A)I Change Your Mind?Code0
Can Large Language Models Help Experimental Design for Causal Discovery?0
Language-Guided Object Search in Agricultural Environments0
Using (Not so) Large Language Models for Generating Simulation Models in a Formal DSL -- A Study on Reaction Networks0
SHADE-AD: An LLM-Based Framework for Synthesizing Activity Data of Alzheimer's Patients0
LLMs as Educational Analysts: Transforming Multimodal Data Traces into Actionable Reading Assessment ReportsCode0
Llama-3.1-Sherkala-8B-Chat: An Open Large Language Model for Kazakh0
KurTail : Kurtosis-based LLM Quantization0
Patient-Level Anatomy Meets Scanning-Level Physics: Personalized Federated Low-Dose CT Denoising Empowered by Large Language ModelCode0
Towards Refining Developer Questions using LLM-Based Named Entity Recognition for Developer Chatroom Conversations0
FunBench: Benchmarking Fundus Reading Skills of MLLMs0
Never too Prim to Swim: An LLM-Enhanced RL-based Adaptive S-Surface Controller for AUVs under Extreme Sea Conditions0
Leveraging Compute-in-Memory for Efficient Generative Model Inference in TPUs0
Show:102550
← PrevPage 50 of 122Next →

No leaderboard results yet.