SOTAVerified

Large Language Model

Papers

Showing 5175 of 6097 papers

TitleStatusHype
Early Signs of Steganographic Capabilities in Frontier LLMsCode0
OpenTable-R1: A Reinforcement Learning Augmented Tool Agent for Open-Domain Table Question AnsweringCode0
LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMsCode1
Dataset Distillation via Vision-Language Category PrototypeCode1
Auto-TA: Towards Scalable Automated Thematic Analysis (TA) via Multi-Agent Large Language Models with Reinforcement Learning0
Thought-Augmented Planning for LLM-Powered Interactive Recommender AgentCode0
Where, What, Why: Towards Explainable Driver Attention PredictionCode1
Decoupled Seg Tokens Make Stronger Reasoning Video Segmenter and GrounderCode1
Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval0
ActAlign: Zero-Shot Fine-Grained Video Classification via Language-Guided Sequence AlignmentCode0
A Large Language Model-Empowered Agent for Reliable and Robust Structural Analysis0
ARAG: Agentic Retrieval Augmented Generation for Personalized Recommendation0
Large Language Model Agent for Modular Task Execution in Drug Discovery0
AgentStealth: Reinforcing Large Language Model for Anonymizing User-generated TextCode0
Detecting Referring Expressions in Visually Grounded Dialogue with Autoregressive Language ModelsCode0
MT2-CSD: A New Dataset and Multi-Semantic Knowledge Fusion Method for Conversational Stance Detection0
mTSBench: Benchmarking Multivariate Time Series Anomaly Detection and Model Selection at ScaleCode0
Can "consciousness" be observed from large language model (LLM) internal states? Dissecting LLM representations obtained from Theory of Mind test with Integrated Information Theory and Span Representation analysis0
Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test0
Multimodal Prompt Alignment for Facial Expression Recognition0
MedPrompt: LLM-CNN Fusion with Weight Routing for Medical Image Segmentation and Classification0
HumanOmniV2: From Understanding to Omni-Modal Reasoning with ContextCode2
GroundFlow: A Plug-in Module for Temporal Reasoning on 3D Point Cloud Sequential Grounding0
OracleFusion: Assisting the Decipherment of Oracle Bone Script with Structurally Constrained Semantic TypographyCode0
ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and EditingCode5
Show:102550
← PrevPage 3 of 244Next →

No leaderboard results yet.