SOTAVerified

Language Modeling

Papers

Showing 16511700 of 14182 papers

TitleStatusHype
Reinforced Large Language Model is a formal theorem proverCode0
Logical forms complement probability in understanding language model (and human) performance0
AIDE: Agentically Improve Visual Language Model with Domain Experts0
On Mechanistic Circuits for Extractive Question-Answering0
LLM4GNAS: A Large Language Model Based Toolkit for Graph Neural Architecture Search0
E2LVLM:Evidence-Enhanced Large Vision-Language Model for Multimodal Out-of-Context Misinformation Detection0
Lexical Manifold Reconfiguration in Large Language Models: A Novel Architectural Approach for Contextual Modulation0
TANTE: Time-Adaptive Operator Learning via Neural Taylor Expansion0
ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image ClassificationCode2
SelfElicit: Your Language Model Secretly Knows Where is the Relevant EvidenceCode1
Contextual Subspace Manifold Projection for Structural Refinement of Large Language Model Representations0
Can a Single Model Master Both Multi-turn Conversations and Tool Use? CALM: A Unified Conversational Agentic Language Model0
Examining Multilingual Embedding Models Cross-Lingually Through LLM-Generated Adversarial Examples0
LLM Pretraining with Continuous Concepts0
QA-Expand: Multi-Question Answer Generation for Enhanced Query Expansion in Information Retrieval0
AI-VERDE: A Gateway for Egalitarian Access to Large Language Model-Based Resources For Educational Institutions0
MetaSC: Test-Time Safety Specification Optimization for Language ModelsCode0
MGPATH: Vision-Language Model with Multi-Granular Prompt Learning for Few-Shot WSI ClassificationCode1
ETimeline: An Extensive Timeline Generation Dataset based on Large Language Model0
Recursive Inference Scaling: A Winning Path to Scalable Inference in Language and Multimodal Systems0
JamendoMaxCaps: A Large Scale Music-caption Dataset with Imputed MetadataCode1
Small Language Model Makes an Effective Long Text ExtractorCode1
DrugImproverGPT: A Large Language Model for Drug Optimization with Fine-Tuning via Structured Policy OptimizationCode0
RomanLens: Latent Romanization and its role in Multilinguality in LLMs0
Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn MoreCode0
Auditing Prompt Caching in Language Model APIsCode0
Implicit Language Models are RNNs: Balancing Parallelization and ExpressivityCode1
AppVLM: A Lightweight Vision Language Model for Online App Control0
Steel-LLM:From Scratch to Open Source -- A Personal Journey in Building a Chinese-Centric LLMCode4
K-ON: Stacking Knowledge On the Head Layer of Large Language Model0
ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought TemplatesCode4
Recent Advances in Discrete Speech Tokens: A Review0
Structural Reformation of Large Language Model Neuron Encapsulation for Divergent Information Aggregation0
RALLRec: Improving Retrieval Augmented Large Language Model Recommendation with Representation LearningCode1
Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoECode1
Rationalization Models for Text-to-SQL0
μnit Scaling: Simple and Scalable FP8 LLM Training0
HSI: Head-Specific Intervention Can Induce Misaligned AI Coordination in Large Language ModelsCode0
Investigating Compositional Reasoning in Time Series Foundation Models0
Digital Twin Buildings: 3D Modeling, GIS Integration, and Visual Descriptions Using Gaussian Splatting, ChatGPT/Deepseek, and Google Maps Platform0
Effective Black-Box Multi-Faceted Attacks Breach Vision Large Language Model Guardrails0
Enabling Autoregressive Models to Fill In Masked Tokens0
Uni-Retrieval: A Multi-Style Retrieval Framework for STEM's Education0
Certifying Language Model Robustness with Fuzzed Randomized Smoothing: An Efficient Defense Against Backdoor Attacks0
ScaffoldGPT: A Scaffold-based GPT Model for Drug Optimization0
DexVLA: Vision-Language Model with Plug-In Diffusion Expert for General Robot ControlCode1
RECOVER: Designing a Large Language Model-based Remote Patient Monitoring System for Postoperative Gastrointestinal Cancer Care0
UniCMs: A Unified Consistency Model For Efficient Multimodal Generation and UnderstandingCode1
IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech SystemCode11
Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging0
Show:102550
← PrevPage 34 of 284Next →

No leaderboard results yet.