SOTAVerified

Large Language Model

Papers

Showing 22512300 of 6097 papers

TitleStatusHype
SUV: Scalable Large Language Model Copyright Compliance with Regularized Selective Unlearning0
CodeARC: Benchmarking Reasoning Capabilities of LLM Agents for Inductive Program Synthesis0
Factored Agents: Decoupling In-Context Learning and Memorization for Robust Tool Use0
Evaluating LLM-based Agents for Multi-Turn Conversations: A Survey0
Arch-LLM: Taming LLMs for Neural Architecture Generation via Unsupervised Discrete Representation Learning0
Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning0
PharmAgents: Building a Virtual Pharma with Large Language Model Agents0
Penrose Tiled Low-Rank Compression and Section-Wise Q&A Fine-Tuning: A General Framework for Domain-Specific Large Language Model Adaptation0
DeepSound-V1: Start to Think Step-by-Step in the Audio Generation from Videos0
Token-Driven GammaTune: Adaptive Calibration for Enhanced Speculative Decoding0
Resona: Improving Context Copying in Linear Recurrence Models with Retrieval0
Generalization Bias in Large Language Model Summarization of Scientific Research0
Exploring the Effectiveness of Multi-stage Fine-tuning for Cross-encoder Re-rankersCode0
VALLR: Visual ASR Language Model for Lip Reading0
EQ-Negotiator: An Emotion-Reasoning LLM Agent in Credit DialoguesCode0
Socially Constructed Treatment Plans: Analyzing Online Peer Interactions to Understand How Patients Navigate Complex Medical Conditions0
LLM-Gomoku: A Large Language Model-Based System for Strategic Gomoku with Self-Play and Reinforcement Learning0
Boosting Large Language Models with Mask Fine-TuningCode0
RocketPPA: Code-Level Power, Performance, and Area Prediction via LLM and Mixture of Experts0
Leveraging Large Language Models for Risk Assessment in Hyperconnected Logistic Hub Network Deployment0
Debate-Driven Multi-Agent LLMs for Phishing Email Detection0
A Multi-Modal Knowledge-Enhanced Framework for Vessel Trajectory Prediction0
MemInsight: Autonomous Memory Augmentation for LLM Agents0
Controlling Large Language Model with Latent ActionsCode0
Using large language models to produce literature reviews: Usages and systematic biases of microphysics parametrizations in 2699 publications0
Malicious and Unintentional Disclosure Risks in Large Language Models for Code Generation0
RedditESS: A Mental Health Social Support Interaction Dataset -- Understanding Effective Social Support to Refine AI-Driven Support Tools0
Prompt, Divide, and Conquer: Bypassing Large Language Model Safety Filters via Segmented and Distributed Prompt Processing0
CFunModel: A "Funny" Language Model Capable of Chinese Humor Generation and Processing0
RALLRec+: Retrieval Augmented Large Language Model Recommendation with ReasoningCode0
InfoBid: A Simulation Framework for Studying Information Disclosure in Auctions with Large Language Model-based Agents0
Dynamic Pyramid Network for Efficient Multimodal Large Language ModelCode0
Operating Room Workflow Analysis via Reasoning Segmentation over Digital Twins0
Synthesizing world models for bilevel planning0
MoRE-LLM: Mixture of Rule Experts Guided by a Large Language ModelCode0
A Multilingual, Culture-First Approach to Addressing Misgendering in LLM ApplicationsCode0
D4R -- Exploring and Querying Relational Graphs Using Natural Language and Large Language Models -- the Case of Historical Documents0
Cross-Modal Prototype Allocation: Unsupervised Slide Representation Learning via Patch-Text Contrast in Computational Pathology0
Exploring the Effect of Robotic Embodiment and Empathetic Tone of LLMs on Empathy Elicitation0
1.4 Million Open-Source Distilled Reasoning Dataset to Empower Large Language Model Training0
Optimizing Photonic Structures with Large Language Model Driven Algorithm Discovery0
Membership Inference Attacks on Large-Scale Models: A Survey0
SemEval-2025 Task 9: The Food Hazard Detection Challenge0
A-MESS: Anchor based Multimodal Embedding with Semantic Synchronization for Multimodal Intent Recognition0
OAEI-LLM-T: A TBox Benchmark Dataset for Understanding Large Language Model Hallucinations in Ontology Matching0
PHEONA: An Evaluation Framework for Large Language Model-based Approaches to Computational Phenotyping0
ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation0
Rosetta-PL: Propositional Logic as a Benchmark for Large Language Model Reasoning0
FALCONEye: Finding Answers and Localizing Content in ONE-hour-long videos with multi-modal LLMs0
Iterative Hypothesis Generation for Scientific Discovery with Monte Carlo Nash Equilibrium Self-Refining Trees0
Show:102550
← PrevPage 46 of 122Next →

No leaderboard results yet.