SOTAVerified

Large Language Model

Papers

Showing 18011850 of 6097 papers

TitleStatusHype
ProgRM: Build Better GUI Agents with Progress Rewards0
HoloLLM: Multisensory Foundation Model for Language-Grounded Human Sensing and Reasoning0
Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence0
CASTILLO: Characterizing Response Length Distributions of Large Language ModelsCode0
Generator-Mediated Bandits: Thompson Sampling for GenAI-Powered Adaptive Interventions0
Incremental Sequence Classification with Temporal Consistency0
EMULATE: A Multi-Agent Framework for Determining the Veracity of Atomic Claims by Emulating Human ActionsCode0
Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question AnsweringCode0
Scalable and Interpretable Contextual Bandits: A Literature Review and Retail Offer Prototype0
Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine0
Power-Law Decay Loss for Large Language Model Finetuning: Focusing on Information Sparsity to Enhance Generation QualityCode0
Plan and Budget: Effective and Efficient Test-Time Scaling on Large Language Model Reasoning0
DeepRec: Towards a Deep Dive Into the Item Space with Large Language Model Based Recommendation0
CTRAP: Embedding Collapse Trap to Safeguard Large Language Models from Harmful Fine-Tuning0
Large Language Model-Empowered Interactive Load Forecasting0
Incentivizing Dual Process Thinking for Efficient Large Language Model Reasoning0
How do Scaling Laws Apply to Knowledge Graph Engineering Tasks? The Impact of Model Size on Large Language Model Performance0
Continually Self-Improving Language Models for Bariatric Surgery Question--Answering0
AdamS: Momentum Itself Can Be A Normalizer for LLM Pretraining and Post-trainingCode0
ConciseRL: Conciseness-Guided Reinforcement Learning for Efficient Reasoning ModelsCode0
PMPO: Probabilistic Metric Prompt Optimization for Small and Large Language Models0
LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning0
Action2Dialogue: Generating Character-Centric Narratives from Scene-Level Prompts0
Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks0
SD-MAD: Sign-Driven Few-shot Multi-Anomaly Detection in Medical Images0
INFERENCEDYNAMICS: Efficient Routing Across LLMs through Structured Capability and Knowledge Profiling0
NEXT-EVAL: Next Evaluation of Traditional and LLM Web Data Record Extraction0
Reward Is Enough: LLMs Are In-Context Reinforcement Learners0
Any Large Language Model Can Be a Reliable Judge: Debiasing with a Reasoning-based Bias Detector0
Bridging Sign and Spoken Languages: Pseudo Gloss Generation for Sign Language Translation0
Listen to the Context: Towards Faithful Large Language Models for Retrieval Augmented Generation on Climate Questions0
Beyond Empathy: Integrating Diagnostic and Therapeutic Reasoning with Large Language Models for Mental Health Counseling0
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective0
Human-centered Interactive Learning via MLLMs for Text-to-Image Person Re-identification0
Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response TheoryCode0
X-WebAgentBench: A Multilingual Interactive Web Benchmark for Evaluating Global Agentic SystemCode0
Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning0
LyapLock: Bounded Knowledge Preservation in Sequential Large Language Model EditingCode0
MIKU-PAL: An Automated and Standardized Multi-Modal Method for Speech Paralinguistic and Affect Labeling0
Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question AnsweringCode0
CP-LLM: Context and Pixel Aware Large Language Model for Video Quality Assessment0
Aligning Dialogue Agents with Global Feedback via Large Language Model Reward Decomposition0
AutoData: A Multi-Agent System for Open Web Data CollectionCode0
Highlighting What Matters: Promptable Embeddings for Attribute-Focused Image Retrieval0
Self-GIVE: Associative Thinking from Limited Structured Knowledge for Enhanced Large Language Model Reasoning0
Privacy-Preserving Conformal Prediction Under Local Differential PrivacyCode0
ClickSight: Interpreting Student Clickstreams to Reveal Insights on Learning Strategies via LLMsCode0
Forging Time Series with Language: A Large Language Model Approach to Synthetic Data Generation0
UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning0
Can Pruning Improve Reasoning? Revisiting Long-CoT Compression with Capability in Mind for Better Reasoning0
Show:102550
← PrevPage 37 of 122Next →

No leaderboard results yet.