SOTAVerified

Large Language Model

Papers

Showing 451500 of 6097 papers

TitleStatusHype
Think or Not? Exploring Thinking Efficiency in Large Reasoning Models via an Information-Theoretic LensCode1
Single-agent or Multi-agent Systems? Why Not Both?0
HoloLLM: Multisensory Foundation Model for Language-Grounded Human Sensing and Reasoning0
CosyVoice 3: Towards In-the-wild Speech Generation via Scaling-up and Post-trainingCode11
Multi-agent Systems for Misinformation Lifecycle : Detection, Correction And Source Identification0
UniTTS: An end-to-end TTS system without decoupling of acoustic and semantic informationCode1
ConciseRL: Conciseness-Guided Reinforcement Learning for Efficient Reasoning ModelsCode0
Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question AnsweringCode0
How do Scaling Laws Apply to Knowledge Graph Engineering Tasks? The Impact of Model Size on Large Language Model Performance0
Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel DecodingCode2
EMULATE: A Multi-Agent Framework for Determining the Veracity of Atomic Claims by Emulating Human ActionsCode0
Incentivizing Dual Process Thinking for Efficient Large Language Model Reasoning0
Plan and Budget: Effective and Efficient Test-Time Scaling on Large Language Model Reasoning0
Power-Law Decay Loss for Large Language Model Finetuning: Focusing on Information Sparsity to Enhance Generation QualityCode0
SD-MAD: Sign-Driven Few-shot Multi-Anomaly Detection in Medical Images0
LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning0
INFERENCEDYNAMICS: Efficient Routing Across LLMs through Structured Capability and Knowledge Profiling0
DeepRec: Towards a Deep Dive Into the Item Space with Large Language Model Based Recommendation0
Incremental Sequence Classification with Temporal Consistency0
CASTILLO: Characterizing Response Length Distributions of Large Language ModelsCode0
AdamS: Momentum Itself Can Be A Normalizer for LLM Pretraining and Post-trainingCode0
Large Language Model-Empowered Interactive Load Forecasting0
ChemMLLM: Chemical Multimodal Large Language ModelCode1
Action2Dialogue: Generating Character-Centric Narratives from Scene-Level Prompts0
Scalable and Interpretable Contextual Bandits: A Literature Review and Retail Offer Prototype0
Continually Self-Improving Language Models for Bariatric Surgery Question--Answering0
A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial OptimizationCode1
PMPO: Probabilistic Metric Prompt Optimization for Small and Large Language Models0
CTRAP: Embedding Collapse Trap to Safeguard Large Language Models from Harmful Fine-Tuning0
Generator-Mediated Bandits: Thompson Sampling for GenAI-Powered Adaptive Interventions0
Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine0
Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks0
Human-centered Interactive Learning via MLLMs for Text-to-Image Person Re-identification0
Reward Is Enough: LLMs Are In-Context Reinforcement Learners0
Any Large Language Model Can Be a Reliable Judge: Debiasing with a Reasoning-based Bias Detector0
Forging Time Series with Language: A Large Language Model Approach to Synthetic Data Generation0
NEXT-EVAL: Next Evaluation of Traditional and LLM Web Data Record Extraction0
CRAKEN: Cybersecurity LLM Agent with Knowledge-Based ExecutionCode1
AutoData: A Multi-Agent System for Open Web Data CollectionCode0
How Memory Management Impacts LLM Agents: An Empirical Study of Experience-Following BehaviorCode1
Aligning Dialogue Agents with Global Feedback via Large Language Model Reward Decomposition0
Highlighting What Matters: Promptable Embeddings for Attribute-Focused Image Retrieval0
MIKU-PAL: An Automated and Standardized Multi-Modal Method for Speech Paralinguistic and Affect Labeling0
LyapLock: Bounded Knowledge Preservation in Sequential Large Language Model EditingCode0
Beyond Empathy: Integrating Diagnostic and Therapeutic Reasoning with Large Language Models for Mental Health Counseling0
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective0
Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question AnsweringCode0
X-WebAgentBench: A Multilingual Interactive Web Benchmark for Evaluating Global Agentic SystemCode0
CP-LLM: Context and Pixel Aware Large Language Model for Video Quality Assessment0
Web-Shepherd: Advancing PRMs for Reinforcing Web AgentsCode2
Show:102550
← PrevPage 10 of 122Next →

No leaderboard results yet.