SOTAVerified

Large Language Model

Papers

Showing 16511700 of 6097 papers

TitleStatusHype
Automated Skill Discovery for Language Agents through Exploration and Iterative Feedback0
Understanding and Meeting Practitioner Needs When Measuring Representational Harms Caused by LLM-Based Systems0
CogniPair: From LLM Chatbots to Conscious AI Agents -- GNWT-Based Multi-Agent Digital Twins for Social Pairing -- Dating & Hiring Applications0
MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale0
Evaluating Large Language Model Capabilities in Assessing Spatial Econometrics Research0
Evaluating Apple Intelligence's Writing Tools for Privacy Against Large Language Model-Based Inference Attacks: Insights from Early Datasets0
TalkingMachines: Real-Time Audio-Driven FaceTime-Style Video via Autoregressive Diffusion Models0
TestAgent: An Adaptive and Intelligent Expert for Human Assessment0
Adaptive Graph Pruning for Multi-Agent CommunicationCode0
TaxAgent: How Large Language Model Designs Fiscal Policy0
MASTER: Enhancing Large Language Model via Multi-Agent Simulated Teaching0
Hybrid AI for Responsive Multi-Turn Online Conversations with Novel Dynamic Routing and Feedback Adaptation0
Parameter Efficient Fine Tuning Llama 3.1 for Answering Arabic Legal Questions: A Case Study on Jordanian LawsCode0
WebChoreArena: Evaluating Web Browsing Agents on Realistic Tedious Web Tasks0
KDRL: Post-Training Reasoning LLMs via Unified Knowledge Distillation and Reinforcement Learning0
PGPO: Enhancing Agent Reasoning via Pseudocode-style Planning Guided Preference Optimization0
From Street Views to Urban Science: Discovering Road Safety Factors with Multimodal Large Language Models0
MLorc: Momentum Low-rank Compression for Large Language Model Adaptation0
Why Gradients Rapidly Increase Near the End of Training0
Image Generation from Contextually-Contradictory Prompts0
LAM SIMULATOR: Advancing Data Generation for Large Action Model Training via Online Exploration and Trajectory Feedback0
COALESCE: Economic and Security Dynamics of Skill-Based Task Outsourcing Among Team of Autonomous LLM Agents0
PointT2I: LLM-based text-to-image generation via keypoints0
EEG2TEXT-CN: An Exploratory Study of Open-Vocabulary Chinese Text-EEG Alignment via Large Language Model and Contrastive Learning on ChineseEEG0
Bridging Subjective and Objective QoE: Operator-Level Aggregation Using LLM-Based Comment Analysis and Network MOS Comparison0
Mamba Drafters for Speculative Decoding0
OG-VLA: 3D-Aware Vision Language Action Model via Orthographic Image Generation0
A Large Language Model-Supported Threat Modeling Framework for Transportation Cyber-Physical Systems0
HADA: Human-AI Agent Decision Alignment Architecture0
Translate With Care: Addressing Gender Bias, Neutrality, and Reasoning in Large Language Model TranslationsCode0
Goal-Aware Identification and Rectification of Misinformation in Multi-Agent SystemsCode0
Organizational Adaptation to Generative AI in Cybersecurity: A Systematic Review0
SALE : Low-bit Estimation for Efficient Sparse Attention in Long-context LLM PrefillingCode0
Artificial Empathy: AI based Mental Health0
From Macro to Micro: Probing Dataset Diversity in Language Model Fine-Tuning0
A Reward-driven Automated Webshell Malicious-code Generator for Red-teaming0
Hierarchical Level-Wise News Article Clustering via Multilingual Matryoshka Embeddings0
Grid-LOGAT: Grid Based Local and Global Area Transcription for Video Question Answering0
A Red Teaming Roadmap Towards System-Level Safety0
SentinelAgent: Graph-based Anomaly Detection in Multi-Agent Systems0
FABLE: A Novel Data-Flow Analysis Benchmark on Procedural Text for Large Language Model EvaluationCode0
HardTests: Synthesizing High-Quality Test Cases for LLM Coding0
S4-Driver: Scalable Self-Supervised Driving Multimodal Large Language Modelwith Spatio-Temporal Visual Representation0
RoboMoRe: LLM-based Robot Co-design via Joint Optimization of Morphology and Reward0
Beyond Exponential Decay: Rethinking Error Accumulation in Large Language Models0
Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry PriorsCode0
CREFT: Sequential Multi-Agent LLM for Character Relation Extraction0
Intuitionistic Fuzzy Sets for Large Language Model Data Annotation: A Novel Approach to Side-by-Side Preference Labeling0
When GPT Spills the Tea: Comprehensive Assessment of Knowledge File Leakage in GPTs0
MythTriage: Scalable Detection of Opioid Use Disorder Myths on a Video-Sharing Platform0
Show:102550
← PrevPage 34 of 122Next →

No leaderboard results yet.